⚠️ Deprecated ⚠️

This feature is deprecated and will be removed in the future.

It is not recommended for use.

  • Import from "@langchain/community/document_loaders/web/pplaywrightdf" instead. This entrypoint will be removed in 0.3.0.

Class representing a document loader for scraping web pages using Playwright. Extends the BaseDocumentLoader class and implements the DocumentLoader interface.

Hierarchy (view full)

Implements

Constructors

Properties

Methods

Constructors

Properties

options: undefined | PlaywrightWebBaseLoaderOptions
webPath: string

Methods

  • Method that calls the scrape method and returns the scraped HTML content as a Document object. Returns a Promise that resolves to an array of Document objects.

    Returns Promise<Document[]>

    Promise that resolves to an array of Document objects.

  • Method that calls the _scrape method to perform the scraping of the web page specified by the webPath property. Returns a Promise that resolves to the scraped HTML content of the web page.

    Returns Promise<string>

    Promise that resolves to the scraped HTML content of the web page.

  • Static method that imports the necessary Playwright modules. Returns a Promise that resolves to an object containing the imported modules.

    Returns Promise<{
        chromium: BrowserType<{}>;
    }>

    Promise that resolves to an object containing the imported modules.

Generated using TypeDoc