Scan default directory

Extension	Tool	Install
`.docx`	`docx2txt`	`apt-get install docx2txt`
`.doc`	`catdoc`	`apt-get install catdoc`
`.pdf`	`pdftotext`	`apt-get install poppler-utils`

Column	Type	Description
Client	VARCHAR(255)	Client identifier (first-level folder name)
Folder	VARCHAR(512) PK	Source folder path
Filename	VARCHAR(255) PK	Document filename
DocType	VARCHAR(16) PK	File extension (docx, doc, pdf)
Loadtime	DATETIME	When the document was processed
Content	LONGTEXT	Extracted raw text
ContentSentence	LONGTEXT	JSON array of tokenized sentences
ContentTokens	LONGTEXT	JSON array of POS-tagged tokens
Module	VARCHAR(255)	ObjServiceFilefeed

Command	Context Keys	Result Key
`scan`	`scan_path` (optional), `client` (optional)	`_filefeed_result`

¶ (c) TechnoCore - All Rights Reserved.