G8 Research Output Database - Documentation
G8 Research Output Database — Methodology (User Documentation)

1) Data Sources & Joins

University Staff Directories

Journal Quality Lists (ABDC/JQL)

Journal Impact Factor (Clarivate JIF)

Database Linking (journal matching pass)

2) Standardisation Pipeline (Exactly How We Clean & Normalize)

2.1 Required fields & null handling

2.2 Type cleanup

2.3 Year coercion

2.4 Role blacklist (exclusions)

2.5 Title normalization (canonical roles)

Normalize raw titles to canonical forms. We search Job Title first, then Researcher Name if needed.

Raw / VariantCanonical
Associate LecturerAssociate Lecturer
Lecturer (A)Associate Lecturer
LecturerLecturer
FellowFellow
Senior LecturerSenior Lecturer
Senior FellowSenior Fellow
Associate Professor / Associate Prof / AsPrAssociate Professor
Professor / ProfProfessor
Professorial FellowProfessorial Fellow
Professor Emeritus / Emeritus Professor / EmeritusProfessor Emeritus

2.6 Name cleaning

2.7 Academic level mapping (A–E)

Canonical TitleLevel
Associate LecturerA
LecturerB
FellowB
Senior LecturerC
Senior FellowC
Associate ProfessorD
ProfessorE
Professorial FellowE
Professor EmeritusE
Exclude

Note: If role is None or unrecognized, the level is set to None.

3) What Gets Written to the Database

4) Ranking Metrics (Exactly How They're Computed)

  1. Total Publications: count of journal articles retrieved (after cleaning).
  2. A*/A-Ranked Publications: count of publications whose journals map to ABDC A* or A.
  3. Average JIF: mean of JIF values across only those articles with valid JIF.
    Avg JIF = (Σ JIFi) / N, where N = number of articles with JIF.
  4. Average 5-Year JIF: same as above, using 5-year JIF.
  5. Average Citation Percentile: mean of available OpenAlex citation percentiles for a researcher’s publications.

Notes: JIF and 5-year JIF are journal-level stats joined via ISSN → JCR. If no JIF exists, JIF-based averages are undefined (not zero).

5) Known Limitations (Transparency for Users)

6) Worked Example (End-to-End)

Assume a researcher has 5 articles:

Totals: total publications = 5; A*/A publications = 3.

Avg JIF: (7.5 + 5.0 + 2.5 + 1.2) / 4 = 4.05

Avg 5-year JIF: computed the same way, using 5-year values only.

Avg Citation Percentile: mean of all available publication percentiles.

7) Complete Lists (for Auditing)

Accepted canonical roles

Raw variants recognized (mapped to above)

Excluded keywords (blacklist)