Suche

Wo soll gesucht werden?
Erweiterte Literatursuche

Ariadne Pfad:

Inhalt

Literaturnachweis - Detailanzeige

 
Autor/inSoliman, Ashraf
TitelAn Unsupervised Linguistic-Based Model for Automatic Glossary Term Extraction from a Single PDF Textbook
QuelleIn: Education and Information Technologies, 28 (2023) 12, S.16089-16125 (37 Seiten)Infoseite zur Zeitschrift
PDF als Volltext Verfügbarkeit 
ZusatzinformationORCID (Soliman, Ashraf)
Spracheenglisch
Dokumenttypgedruckt; online; Zeitschriftenaufsatz
ISSN1360-2357
DOI10.1007/s10639-023-11818-1
SchlagwörterLinguistics; Automation; Glossaries; Textbooks; Electronic Books
AbstractTerm extraction from textbooks is the cornerstone of many different intelligent natural language processing systems, especially those that support learners and educators in the education system. This paper proposes a novel unsupervised domain-independent model that automatically extracts relevant and domain-related key terms from a single PDF textbook, without relying on a statistical technique or external knowledge base. It only relies on the basic linguistic techniques of the natural language processing: pattern recognition, sentence tokenization, part-of-speech tagging, and chunking. The model takes a PDF textbook as an input and produces a list of key terms as an output. Furthermore, the model proposes a novel classification of sentences from which the concept of defining sentences is proposed. The defining sentences are the main textual units that the model revolves around to identify the key terms. The architecture of the proposed work consists of 21 processes distributed across three phases. The first phase consists of five processes for extracting text from a PDF textbook and cleaning it for the next phases. The second phase consists of eight processes for identifying the defining sentences and extracting them from all the textbook's sentences. The last phase consists of eight processes for identifying and extracting the key terms from every defining sentence. The proposed work was evaluated by two experiments in which two PDF textbooks from different fields are used. The experimental evaluation showed that the results were promising. (As Provided).
AnmerkungenSpringer. Available from: Springer Nature. One New York Plaza, Suite 4600, New York, NY 10004. Tel: 800-777-4643; Tel: 212-460-1500; Fax: 212-460-1700; e-mail: customerservice@springernature.com; Web site: https://link.springer.com/
Erfasst vonERIC (Education Resources Information Center), Washington, DC
Update2024/1/01
Literaturbeschaffung und Bestandsnachweise in Bibliotheken prüfen
 

Standortunabhängige Dienste
Bibliotheken, die die Zeitschrift "Education and Information Technologies" besitzen:
Link zur Zeitschriftendatenbank (ZDB)

Artikellieferdienst der deutschen Bibliotheken (subito):
Übernahme der Daten in das subito-Bestellformular

Tipps zum Auffinden elektronischer Volltexte im Video-Tutorial

Trefferlisten Einstellungen

Permalink als QR-Code

Permalink als QR-Code

Inhalt auf sozialen Plattformen teilen (nur vorhanden, wenn Javascript eingeschaltet ist)

Teile diese Seite: