Documents are ingested through /v1/artifacts/ingest. Once ingested they are chunked, embedded, and stored in the vector store for retrieval.
Target a specific collection:
This deletes everything in local_data/ including the vector store. It cannot be undone.
To ingest an entire folder from the command line, enable local ingestion in your settings:
Then run:
Watch mode (re-ingest on file changes):
PrivateGPT handles plain text natively. The following formats are also supported with built-in parsers:
.pdf · .docx · .pptx · .ppt · .pptm · .hwp · .epub · .md · .csv · .json · .ipynb · .mbox · .jpg · .jpeg · .png · .mp3 · .mp4
Any other file type is read as plain text.