How Do I Process a Different Encoding

While Integrate.io ETL platform works with primarily UTF-8 encoded data, other character encodings can be processed with steps as shown in this example:

This dataflow used the functions as detailed in the following steps. These would need to be replaced with the relevant encoding, fields delimiters specific to your use case

Read data as raw and binary data type.
Convert the byte array data in the given encoding to a string type using
ByteArrayToString(body, 'UTF-16LE').
Split the data from step 2 using STRSPLITTOBAG(body,'\n') and then a Flatten() to get individual records or lines.
Remove headers as applicable (if it is from an API) with the filter transformation. Text matches(regex) options can be useful here.
Individual lines are split based on the relevant delimiter using CSVSPLIT(line, '\t').
Extract the required fields from the tuple as line.$0, line.$1,line.$2 and so on.

ETL & Reverse ETL
Knowledge Base

ETL & Reverse ETL Knowledge base

Getting started

5 Articles

How Do I ...

12 Articles

Connectivity And Security

49 Articles

Creating packages

55 Articles

Using clusters

4 Articles

Running and monitoring jobs

8 Articles

Configuring your Integrate.io ETL environment

13 Articles

Programming and API

5 Articles

Other

189 Articles

New Releases

18 Articles

How Do I Process a Different Encoding

Solutions

Support

Company

Language

ETL & Reverse ETL Knowledge Base

ETL & Reverse ETL Knowledge base

Getting started

5 Articles

How Do I ...

12 Articles

Connectivity And Security

49 Articles

Creating packages

55 Articles

Using clusters

4 Articles

Running and monitoring jobs

8 Articles

Configuring your Integrate.io ETL environment

13 Articles

Programming and API

5 Articles

Other

189 Articles

New Releases

18 Articles

How Do I Process a Different Encoding

See Also

Solutions

Support

Company

Language

ETL & Reverse ETL
Knowledge Base