Suche

Wo soll gesucht werden?
Erweiterte Literatursuche

Ariadne Pfad:

Inhalt

Literaturnachweis - Detailanzeige

 
Autor/inn/enSchierholz, Malte; Schonlau, Matthias
TitelMachine learning for occupation coding - a comparison study.
Gefälligkeitsübersetzung: Maschinelles Lernen für die Berufskodierung - eine Vergleichsstudie.
QuelleIn: Journal of survey statistics and methodology, 9 (2020) 5, 22 S.
PDF als Volltext kostenfreie Datei  Link als defekt meldenVerfügbarkeit 
Spracheenglisch
Dokumenttyponline; Zeitschriftenaufsatz
ISSN2325-0992
DOI10.1093/jssam/smaa023
SchlagwörterKünstliche Intelligenz; Lernen; Algorithmus; Automatisierung; Ausgeübter Beruf; Berufsbezeichnung; Berufsklassifikation; BIBB/BAuA-Erhebung; Codierung; IAB-Haushaltspanel; Statistische Methode
Abstract"Asking people about their occupation is common practice in surveys and censuses around the world. The answers are typically recorded in textual form and subsequently assigned (coded) to categories, which have been defined in official occupational classifications. While this coding step is often done manually, substituting it with more automated workflows has been a longstanding goal, promising reduced data-processing costs and accelerated publication of key statistics. Although numerous researchers have developed different algorithms for automated occupation coding, the algorithms have rarely been compared with each other or tested on different data sets. We fill this gap by comparing some of the most promising algorithms found in the literature and testing them on five data sets from Germany. The first two algorithms we test exemplify a common practice in which answers are coded automatically according to a predefined list of job titles. Statistical learning algorithms - that is, regularized multinomial regression, tree boosting, or algorithms developed specifically for occupation coding (algorithms three to six) - can improve upon algorithms one and two, but only if a sufficient number of training observations from previous surveys is available. The best results are obtained by merging the list of job titles with coded answers from previous surveys before using this combined training data for statistical learning (algorithm 7). However, the differences between the algorithms are often small compared to the large variation found across different data sets, which we ascribe to systematic differences in the way the data were coded in the first place. Such differences complicate the application of statistical learning, which risks perpetuating questionable coding decisions from the training data to the future." (Author's abstract, IAB-Doku).
Erfasst vonInstitut für Arbeitsmarkt- und Berufsforschung, Nürnberg
Update2021/2
Literaturbeschaffung und Bestandsnachweise in Bibliotheken prüfen
 

Standortunabhängige Dienste
Bibliotheken, die die Zeitschrift "Journal of survey statistics and methodology" besitzen:
Link zur Zeitschriftendatenbank (ZDB)

Artikellieferdienst der deutschen Bibliotheken (subito):
Übernahme der Daten in das subito-Bestellformular

Tipps zum Auffinden elektronischer Volltexte im Video-Tutorial

Trefferlisten Einstellungen

Permalink als QR-Code

Permalink als QR-Code

Inhalt auf sozialen Plattformen teilen (nur vorhanden, wenn Javascript eingeschaltet ist)

Teile diese Seite: