PM Martijn
Preprocessing your data
Often, the documents you have gathered are not directly in a usable state to apply computational analysis on them. For example, the documents might be in the wrong format: you might have received PDF-files, but you only need the plain text that is contained by the files. Or you want to look at specific segments of text and will thus split your documents into different segments.
TODO:
- Uitwerken voorbeelden van soorten preprocessing
- Uitwerken illustratie van wanneer je wat voor preprocessing nodig zou kunnen hebben
- Links naar voorbeelden in de example projects of elders
Last updated: 29-Nov-2024