Tailer Documentation
  • What is Tailer Platform?
  • Getting Started
    • Prepare your local environment for Tailer
    • Install Tailer SDK
    • Set up Google Cloud Platform
    • Encrypt your credentials
  • [Tutorial] Create a first data pipeline
    • Introduction
    • Prepare the demonstration environment
    • Copy files from one bucket to another
    • Load files into BigQuery tables
    • Prepare data
    • Build predictions
    • Export data
    • Congratulations!
    • [Video] Automatic Script
      • SQL script file
      • DDL script file
      • Tables to Tables script file
      • Launch configuration and furthermore
  • Data Pipeline Operations
    • Overview
    • Set constants with Context
      • Context configuration file
    • Move files with Storage to Storage
      • Storage to Storage configuration file
    • Load data with Storage to Tables
      • Storage to Tables configuration file
      • Storage to Tables DDL files
    • Stream incoming data with API To Storage
      • API To Storage configuration file
      • API To Storage usage examples
    • Transform data with Tables to Tables
      • Tables to Tables configuration file
      • Table to Table SQL and DDL files
    • Export data with Tables to Storage
      • [V3] Table to Storage configuration file
      • Table to Storage SQL file
      • [V1-V2: deprecated] Table to Storage configuration file
    • Orchestrate processings with Workflow
      • [V2] Workflow configuration file
      • [V1: deprecated] Workflow configuration file
    • Convert XML to CSV
      • Convert XML to CSV configuration file
    • Use advanced features with VM Launcher
      • Process code with VM Launcher
        • VM Launcher configuration file for code processing
      • Encrypt/Decrypt data with VM Launcher
        • VM Launcher configuration file for data encryption
        • VM Launcher configuration file for data decryption
    • Monitoring and Alerting
      • Monitoring and alerting parameters
    • Asserting Data quality with Expectations
      • List of Expectations
    • Modify files with File Utilities
      • Encrypt/Decrypt data with File Utilities
        • Configuration file for data encryption
        • Configuration file for data decryption
    • Transfer data with GBQ to Firestore
      • Table to Storage: configuration file
      • Table to Storage: SQL file
      • VM Launcher: configuration file
      • File-to-firestore python file
  • Tailer Studio
    • Overview
    • Check data operations' details
    • Monitor data operations' status
    • Execute data operations
    • Reset Workflow data operations
    • Archive data operations
    • Add notes to data operations and runs
    • View your data catalog
    • Time your data with freshness
  • Tailer API
    • Overview
    • Getting started
    • API features
  • Release Notes
    • Tailer SDK Stable Releases
    • Tailer Beta Releases
      • Beta features
      • Beta configuration
      • Tailer SDK API
    • Tailer Status
Powered by GitBook
On this page
  • 💡 What is the Convert XML to CSV operation?
  • ✅ Supported file types
  • Source files
  • Export files
  • ⚙️ How it works
  • 📋 How to deploy a Convert XML to CSV data operation

Was this helpful?

Edit on GitHub
  1. Data Pipeline Operations

Convert XML to CSV

Learn how to convert XML files into CSV files using a Convert XML to CSV data operation.

Previous[V1: deprecated] Workflow configuration fileNextConvert XML to CSV configuration file

Last updated 3 years ago

Was this helpful?

💡 What is the Convert XML to CSV operation?

The Convert XML to CSV data operation allows you to retrieve all the information contained in a possibly complex XML file into a set of CSV files, all located in a Google Cloud Storage bucket. You can later convert your CSV files into database tables using a data operation.

Note that the XML file provided must be well-formed and valid against a matching XSD file (defining its elements and attributes, and the rules that apply to them).

The XML and XSD file must have the same name (suffix excluded). Refer to for more details.

If the XML file contains entities not set in the XSD, then the corresponding data will not be exported to CSV files.

✅ Supported file types

Source files

  • XML + XSD file pair(s)

Export files

  • Multiple TSV (Tab Separated Values) + DDL file pairs

⚙️ How it works

Every time a new file matching the specified XML file name pattern appears in a given directory of a Google Cloud Storage bucket:

  • The XML file is checked against the matching XSD file.

  • If the XML file is valid, the conversion process is launched.

  • A set of CSV files with their matching DDL files (describing their schema) is generated in the working directory.

  • The source XML files are deleted from the working directory.

  • If set, a filtering occurs at the end of the process to remove unwanted CSV files.

📋 How to deploy a Convert XML to CSV data operation

  1. Create a working folder as you want, and create a JSON file for your data operation inside.

  2. Place the XSD file in the same location as your JSON file.

  3. Access your working folder by running the following command:

    cd "[path to your working folder]"
  4. To deploy the data operation, run the following command:

    tailer deploy configuration your-configuration.json
  5. For your Convert XML to CSV data operation to be executed, you need to place a file into the source folder.

  6. Access the GCS bucket to check your output files (CSV and DDL files).

Access your tailer folder (created during ).

Prepare your JSON configuration file. Refer to this page to learn about all the .

Log in to to check the status and details of your data operation.

Storage to Tables
this page
installation
parameters
Tailer Studio