3 projects
pdf2markdown4llm
A library to convert PDF files to Markdown format.
LiteLLMJson
This library offers functionality to cleanly extract JSON from LLM responses and generate prompts for LLM that return JSON. It features a simple implementation while maintaining high versatility.
MainContentExtractor
A library to extract the main content from html. Developed for information on LLM and for feeding data into LangChain and LlamaIndex.