In recent years we have observed many data leaks such as the Luxleaks, the Panama papers, the Paradise papers, the Pandora papers, etc. These leaks represent enormous quantities of data. However, most of this data is in textual form, making it difficult to extract relevant information for the average analyst. In this course we want to teach skills relevant to reviewing text data with the purpose of uncovering evidence for illegal activity. We will focus on how to obtain raw data through web scraping. Then, we will format the raw data into a final dataset that can be used to answer relevant real-world questions.
This course could be useful for:
- Students who are interested in obtaining advance knowledge of the application in textual analysis.
- Fraud analysts who want to find evidence for links between different users.
- Journalists that are interested in obtaining skills in finding the story through the large data source.
This course is an extension to:
- BAN432 Applied Textual Data Analysis for Business and Finance.
- BUS465 Detecting Corporate Crime.