This guide provides a quick overview for getting started with BeautifulSoup4 document loader. For detailed documentation of all __ModuleName__Loader features and configurations head to the API reference.Documentation Index
Fetch the complete documentation index at: https://langchain-5e9cc07a-preview-srimpr-1771619406-31dcf4f.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
Overview
Integration details
| Class | Package | Local | Serializable | JS support |
|---|---|---|---|---|
| BSHTMLLoader | langchain-community | ✅ | ❌ | ❌ |
Loader features
| Source | Document Lazy Loading | Native Async Support |
|---|---|---|
| BSHTMLLoader | ✅ | ❌ |
Setup
To access BSHTMLLoader document loader you’ll need to install thelangchain-community integration package and the bs4 python package.
Credentials
No credentials are needed to use theBSHTMLLoader class.
To enable automated tracing of your model calls, set your LangSmith API key:
Installation
Install langchain-community and bs4.Initialization
Now we can instantiate our model object and load documents:- TODO: Update model instantiation with relevant params.
Load
Lazy load
Adding separator to BS4
We can also pass a separator to use when calling get_text on the soupAPI reference
For detailed documentation of all BSHTMLLoader features and configurations head to the API reference: python.langchain.com/api_reference/community/document_loaders/langchain_community.document_loaders.html_bs.BSHTMLLoader.htmlConnect these docs to Claude, VSCode, and more via MCP for real-time answers.