May I have your support of the following:
I have several hundred of documents with hundred of pages each. As some keywords are located in different paragraphs with different meaning, which can be distinguished by different paragraphs under different section title. I want to apply BoW by different sections. Given the files are in Text format, my questions are:
How to locate the position of different sections by locating the section title (pls note that different no of paragraphs within a section for different doc)
As some of the documents may not have section titles, what should I do for these document?