Text Converter
The Sheetize Text Converter for .NET provides a straightforward way to turn files such as PDFs, DOCX, and HTML into plain‑text and back again, supporting use‑cases like data mining, long‑term storage, and interaction with text‑only platforms.
Key Features
Extract Text from Documents
Pull textual data from various file types, useful for processing pipelines and archival purposes.
Generate Documents from Text
Build structured files from raw text, simplifying the creation of reports or formatted outputs.
Detailed Guide
Turning a Document into Plain Text
Steps to convert a file to plain‑text:
- Prepare the Converter – use
TextConverter. - Configure Options – use
SaveOptionsto set conversion parameters. - Specify Input/Output – provide source and destination file locations.
- Run the Process – call
Processwith the prepared options.
Sample: PDF to Text Conversion
// Step 1: Create the converter
var loadOptions = new LoadOptions
{
InputFile = @"D:\Input.xlsx"
};
var saveOptions = new SaveOptions
{
OutputFile = @"D:\Output.txt"
};
TextConverter.Process(loadOptions, saveOptions);The converter is vital for scenarios requiring raw text extraction, such as building searchable archives or feeding data into text‑centric workflows.