Input data

<< Click to Display Table of Contents >>

Navigation:  Reference > Input >

Input data

You need to input data before you can transform it. Data can be input by:

dragging a file onto the Center pane; or

clicking the From File or From Clipboard button in the Left pane

 

Enter the file location in File or click the browse button. For Excel spreadsheets you also need to add a sheet name, e.g. 'MySpreadsheet.xlsx[Sheet1]'.

 

Easy Data Transform can input data from files in the following formats:

delimited text file (e.g. CSV or TSV) with various delimiters

Excel .xlsx or .xls

Fixed width

Plain text

JSON

vCard

XML

 

You can select the input item in the Center pane and change any related options in the Right pane.

 

input-right-pane

 

Set Type to the file type. The default type will be set according to the file extension and the settings in the Input Extensions tab of the Preferences window.

 

For text files Easy Data Transform will treat CRLF, LF or CR control characters as line endings.

 

Easy Data Transform will make an intelligent guess at the:

column delimiter and quoting for delimited text files (e.g. Comma)

column widths for fixed width text files

text encoding for text files (e.g. UTF-8)

presence of a header row in the data

But you can also do this manually by selecting the input item and changing the Delimiter, Columns, Encoding and has header fields in the Right pane.

 

Set Ignore to the number of rows you want to skip before you start inputting. Note that this takes place before any empty rows are removed by Ignore empty rows.

 

For delimited text you can:

Set Delimiter to the delimiter character.

Check ignore repeated delimiters if you want to treat 2 or more consecutive delimiters as a single delimiter.

Set Quoting according to whether " quoting is used so that the delimiter can appear within values.

See delimited text for more details.

 

For fixed width set Columns to Manual to set column widths. See fixed width for more details.

 

For JSON or XML set Format to Long (more rows) or Wide (more columns) depending on how you want to treat arrays/repeat values. See JSON or XML for more details.

 

Check has header if your dataset has a header row, with column names.

 

Set Uses schema to Yes if want to use a schema to manage changes to columns in the input.

 

Check trim whitespace to trim any whitespace (e.g. tabs or spaces) off the start or end of header and data values.

 

Check simplify whitespace to perform the following in header and data values:

Replace any Tabs or Line Feeds with a Space.

Replace non-standard spaces (such as non-breaking Space, Thin Space etc) with spaces.

Remove Carriage Returns.

 

Check ignore empty rows to remove any rows that have only empty values (whitespace is not considered empty).

 

Check ignore empty columns to remove any columns that have only empty values (whitespace is not considered empty). This is carried out after any schema has been applied. Header values are ignored when deciding if a column is empty.

 

Check ignore hidden rows to remove any hidden rows (Excel files only).

 

Check ignore hidden columns to remove any hidden columns (Excel files only). This is carried out before any schema has been applied, so these columns may be added back with default values by a schema.

 

Check watch file if you want :

Run>Auto Run selected: the input to automatically reload the file every time that Easy Data Transform detects that it has been changed.

Run>Auto Run not selected: the input to be marked as 'Needs update' every time that Easy Data Transform detects that it has been changed.

 

Use Meta info if you wish to add some meta information about the input dataset, e.g. the name of an input file or the date it was created.

 

Use Alias to identify the input for batch processing or command line processing.

 

To change the file being used by an input, select the input item and change the file location in the Right pane (e,g, by clicking the '...' browse file button), rather the disconnecting the input and connecting a new one.