A transformer that converts HTML content to plain text.
const loader = new CheerioWebBaseLoader("https://example.com/some-page");const docs = await loader.load();const splitter = new RecursiveCharacterTextSplitter({ maxCharacterCount: 1000,});const transformer = new HtmlToTextTransformer();// The sequence of text splitting followed by HTML to text transformationconst sequence = splitter.pipe(transformer);// Processing the loaded documents through the sequenceconst newDocuments = await sequence.invoke(docs);console.log(newDocuments); Copy
const loader = new CheerioWebBaseLoader("https://example.com/some-page");const docs = await loader.load();const splitter = new RecursiveCharacterTextSplitter({ maxCharacterCount: 1000,});const transformer = new HtmlToTextTransformer();// The sequence of text splitting followed by HTML to text transformationconst sequence = splitter.pipe(transformer);// Processing the loaded documents through the sequenceconst newDocuments = await sequence.invoke(docs);console.log(newDocuments);
Protected
Generated using TypeDoc
A transformer that converts HTML content to plain text.
Example