Skip to content
Scan a barcode
Scan
Paperback Machine Learning Features for Determining Article Use in English Book

ISBN: 1973226669

ISBN13: 9781973226666

Machine Learning Features for Determining Article Use in English

Native speakers of English know intuitively when to use articles ("a/an", "the" or zero-article), but deFining when to use these common words for non-native speakers or machine translation systems is anything but straightforward. This thesis explores machine learning approaches to determining when to use an article for a given noun phrase, focusing on the effect of different genres and features on a model's performance. We start with a theoretical overview of what articles are and how they're used, followed by a summary of previous rule-based and machine learning approaches. We then evaluate a neural network model on six different genres of text (EU legislation, Fiction, news, parallel web pages, subtitles and technical documentation) and Find that genres with smaller vocabularies and more exact repetition perform best. Four feature families are used (lexical, morphological, syntactic and discourse), combining features from previous rule-based and machine learning approaches. Using feature importance and feature ablation tests, we Find that lexical and morphological features are the most salient, while syntactic and discourse features contribute little to improve a model's performance.

Recommended

Format: Paperback

Temporarily Unavailable

We receive fewer than 1 copy every 6 months.

Save to List

Customer Reviews

0 rating
Copyright © 2026 Thriftbooks.com Terms of Use | Privacy Policy | Do Not Sell/Share My Personal Information | Cookie Policy | Cookie Preferences | Accessibility Statement
ThriftBooks® and the ThriftBooks® logo are registered trademarks of Thrift Books Global, LLC
GoDaddy Verified and Secured