ABSTRACT: Extensive research has been done on the analytical and empirical examination of financial data in annual reports to detect fraud; however, there is scant research on the analysis of text in annual reports to detect fraud. The basic premise of this research is that there are clues hidden in the text that can be detected to determine the likelihood of fraud. In this research, we examine both the verbal content and the presentation style of the qualitative portion of the annual reports using natural language processing tools and explore linguistic features that distinguish fraudulent annual reports from nonfraudulent annual reports. Our results indicate that employment of linguistic features is an effective means for detecting fraud. We were able to improve the prediction accuracy of our fraud detection model from initial baseline results of 56.75 percent accuracy, using a “bag of words” approach, to 89.51 percent accuracy when we incorporated linguistically motivated features inspired by our informed reasoning and domain knowledge.
Skip Nav Destination
Article navigation
1 December 2010
Research Article|
January 01 2010
Can Linguistic Predictors Detect Fraudulent Financial Filings?
Jagdish Gangolly;
Jagdish Gangolly
University at Albany, SUNY
Search for other works by this author on:
Ozlem Uzuner
Ozlem Uzuner
University at Albany, SUNY
Search for other works by this author on:
Online ISSN: 1558-7940
Print ISSN: 1554-1908
American Accounting Association
2010
Journal of Emerging Technologies in Accounting (2010) 7 (1): 25–46.
Citation
Sunita Goel, Jagdish Gangolly, Sue R. Faerman, Ozlem Uzuner; Can Linguistic Predictors Detect Fraudulent Financial Filings?. Journal of Emerging Technologies in Accounting 1 December 2010; 7 (1): 25–46. https://doi.org/10.2308/jeta.2010.7.1.25
Download citation file:
Pay-Per-View Access
$25.00