Import

The import module reads data from source files.

CSV/XLS/XLSX Import

CSV/XLS/XLSX module reads data from .csv (text files with delimeters) and spreadsheets .xlsx and .xls.

Limitations: The module reads only from the first sheet of .xlsx or .xls file. The module can't read from password-protected files.

Import module is responsible for detection of missing cells and their replacement with NULL values. Then in the Preprocessor module the NULL values will be replaced using one of the available methods of your choice.

To configure the import module:

  1. Click the Import button at Control panel. The file selection dialog opens.
  2. Select one of your data files, press OK. Then the Import configuration dialog opens.
  3. In the Import dialog set the importing parameters, press OK.
  1. If your project folder doesn't contain project settings yet, the Template selection dialog appears next to the Import configuration dialog.
  2. In that dialog, select one of suggested Templates and press OK.

If your project folder contains several data files, the Import module makes all of them available in the Data manager. Selection of just one file, already points importer module to the whole directory.

The file selected during the import procedure receives a special status Current in the Data manager. Only variables from the Current file can be used by the Data manager without filename prefixes, for example, var1 instead of filename.var1 for other files.

Import configuration

Import dialog

Read column labels from the 1st row

Reads column names from the first row of data file(s). The number of elements in the first row is used for detection of data table width.

Read row IDs from column

Optionally, if you have really unique identifiers of data rows, for example, calendar dates then you can tell the Importer in which column they are located and use them in visualization panels instead of default ID marks. In case of multiple data files, the column number belongs to the Current data file.

CSV delimiter

Sets a delimiter type. (Applicable to CSV files only)

Missing value mark

Replaces defined missing value marks in the dataset with regular NULL values.

Consider text cells as missing

Replaces any non-numeric values with regular NULL values.

After OK button of Import configuration window is pressed, the selected file path will appear in the title bar of GMDH Shell:

Data file format


Data files may consist of numeral or nominal (text) columns. Column names are allowed but not required.

Import module reads only the first sheet of XLS files. GMDH Shell does not provide tools for file editing but allows a user to keep the file opened for editing with MS Excel, OpenOffice Calc, LibreOffice Calc etc. It is not required to close the editor before pressing the Start button. After the data file is modified a user can save changes in its editor and start recalculation of results in GMDH Shell.

Data file examples

You are here: IntroductionImport
CC Attribution-Noncommercial 3.0 Unported
Valid CSS Driven by DokuWiki Recent changes RSS feed Valid XHTML 1.0