PM Martijn

Preprocessing your data

Often, the documents you have gathered are not directly in a usable state to apply computational analysis on them. For example, the documents might be in the wrong format: you might have received PDF-files, but you only need the plain text that is contained by the files. Or you want to look at specific segments of text and will thus split your documents into different segments. 

TODO: 

Last updated: 29-Nov-2024