Automated Document Parser

API Reference

  • API Reference
    • Core Module
      • DocumentParser
        • DocumentParser.__init__()
        • DocumentParser.parse()
        • DocumentParser.parse_multiple()
        • DocumentParser.get_loaded_files()
    • Configuration
    • Utilities
      • detect_file_type()
      • is_supported_file()
      • validate_file_path()
      • get_file_info()
    • Loaders
      • Main File Loaders
        • FileLoader
        • load_document()
      • File Load Module
        • Base File Loader
        • Text File Loader
        • CSV File Loader
        • JSON File Loader
        • DOCX File Loader
        • HTML File Loader
      • PDF Load Module
        • PDF Loader Classes
        • Base PDF Loader
        • PyPDF Loader
        • Unstructured PDF Loader
        • Amazon Textract Loader
        • Mathpix Loader
        • PDFPlumber Loader
        • PyPDFium2 Loader
        • PyMuPDF Loader
        • PyMuPDF4LLM Loader
        • OpenDataLoader PDF Loader
  • Modules
    • automated_document_parser
      • DocumentParser
        • DocumentParser.__init__()
        • DocumentParser.get_loaded_files()
        • DocumentParser.parse()
        • DocumentParser.parse_multiple()
      • FileLoader
        • FileLoader.__init__()
        • FileLoader.load()
      • load_document()

Additional Resources

  • GitHub Repository
Automated Document Parser
  • Search


© Copyright 2025, Pulkit Dhingra.

Built with Sphinx using a theme provided by Read the Docs.