Text Converter

The Sheetize Text Converter for .NET provides a straightforward way to turn files such as PDFs, DOCX, and HTML into plain‑text and back again, supporting use‑cases like data mining, long‑term storage, and interaction with text‑only platforms.

Key Features

Extract Text from Documents

Pull textual data from various file types, useful for processing pipelines and archival purposes.

Generate Documents from Text

Build structured files from raw text, simplifying the creation of reports or formatted outputs.

Detailed Guide

Turning a Document into Plain Text

Steps to convert a file to plain‑text:

  1. Prepare the Converter – use TextConverter .
  2. Configure Options – use SaveOptions to set conversion parameters.
  3. Specify Input/Output – provide source and destination file locations.
  4. Run the Process – call Process with the prepared options.

Sample: PDF to Text Conversion

// Step 1: Create the converter
var loadOptions = new LoadOptions
{
    InputFile = @"D:\Input.xlsx"
};

var saveOptions = new SaveOptions
{
    OutputFile = @"D:\Output.txt"
};

TextConverter.Process(loadOptions, saveOptions);

The converter is vital for scenarios requiring raw text extraction, such as building searchable archives or feeding data into text‑centric workflows.

 English