What is a workspace?
What is a workspace?
Workspace is a set of services. There 3 workspaces available:
Data-Prep - This workspace is visible to all IB Studio users.
Data Profiling, Stitching, and Cleaning services.
Analytics - This workspace is visible to all IB Studio users
Descriptive, Predictive, and Prescriptive services on Install Base Data.
Analytics & Setup- This workspace is visible only to IBStudio-Admin/IBStudio-SuperUser
Analytics: Descriptive, Predictive, and Prescriptive services on Install Base Data.
Setup: Configuration, Feature Toggle, and User management
How are the services organized?
How are the services organized?
Data-Prep workspace
Data-Prep(beta)
DataLoader
DeltaLoad
Dedupe
Analytics
FCR
Auto Segmentation
Customer Health
Propensity
Purchase Rate
SMO
Service Contract Opportunities
Whom To Call
Analytics&Setup
Analytics: All above services that are mentioned under Analytics.
Setup
Workflows
Configuration
Feature Toggle
User Management
What is an iteration?
What is an iteration?
An iteration is the execution of a set of steps in a recipe. If new steps are added or existing steps are modified, then a new iteration # is created.
What is a recipe?
What is a recipe?
A recipe has a description of the steps to be applied to a data set in order to prepare it for data analysis.
Which operations can I perform in a recipe?
Which operations can I perform in a recipe?
You can perform column-level operations like Delete, Split, Merge, and Rename columns. To modify rows in columns you can perform Find/Replace, Delete rows by conditions, Create columns by conditions, Drop duplicates, etc. You can also perform SQL-like join operations i.e Left, Right, Inner join.
What if my file has duplicate columns?
What if my file has duplicate columns?
A validation error will be thrown if the file has duplicate columns.
What if my files have blank columns?
What if my files have blank columns?
A validation error will be thrown if the file has blank columns. The user will have to remove blank column names from the header in the CSV files to run a job.
Which file formats are supported?
Which file formats are supported?
Only CSV files are supported. We are working on XLS and TXT files right now and this feature will be rolled out soon.
What is the sample size?
What is the sample size?
The sample size of 25000. If a dataset has greater than 25000 records, then a sample of 25000 will be created, else the sample size will be equal to that of the input dataset.
Which date formats do you process?
Which date formats do you process?
Any column having the below date formats shall be converted into this standard format.
1999-03-31, 1999-03-31 04:30:18, 1999-03-31 04:30:18.0, 1999-03-31T04:30:18.528Z
1999/03/31, 1999/03/31 04:30:18, 1999/03/31 04:30:18.0, 1999/03/31T04:30:18.528Z
1999.03.31, 1999.03.31 04:30:18, 1999.03.31 04:30:18.0, 1999.03.31T04:30:18.528Z
31-03-1999, 31-03-1999 04:30:18, 31-03-1999 04:30:18.0, 31-03-1999T04:30:18.528Z
31/03/1999, 31/03/1999 04:30:18, 31/03/1999 04:30:18.0, 31/03/1999T04:30:18.528Z
31.03.1999, 31.03.1999 04:30:18, 31.03.1999 04:30:18.0, 31.03.1999T04:30:18.528Z
What is the standard output date format?
What is the standard output date format?
The standard output format is YYYY-MM-DD.
What is the precision of decimal, and float columns?
What is the precision of decimal, and float columns?
Upto 4 decimal places. Even for comparison operations up to 4 decimal places will be considered.
What are the possible datatypes?
What are the possible datatypes?
Boolean, Number, String, Date.
How do I ingest directly to the data_loader database?
How do I ingest directly to the data_loader database?
You can choose the map to the destination table in the data-prep service and publish the job.
How to do a union of multiple files?
How to do a union of multiple files?
The user can select one or more CSV files from the list, map the columns and submit the job.
Which JOIN types are supported?
Which JOIN types are supported?
LEFT, RIGHT, INNER
How do I reuse my recipe?
How do I reuse my recipe?
To reuse a recipe, please let the engineering team know. We are working on functionality to import/export a recipe.
How to create a new column with a constant value?
How to create a new column with a constant value?
You can create a new column Conditions. Provide the same constant value in the IF and ELSE condition. This will create a new column with the constant value provided.
How many split operations can I perform?
How many split operations can I perform?
You can perform a maximum of 5 splits on a string column.
What is cardinality?
What is cardinality?
Cardinality tells us the uniqueness of data in a column.
The lower the cardinality, the more duplicate values are present in a column.
If the cardinality is Very High then it means that more than 75% of values are unique.
If the cardinality is High then it means that 50% to 75% of values are unique.
If the cardinality is Medium then it means that 25% to 75% of values are unique.
If the cardinality is Low then it means that less than 25% of values are unique.
If the cardinality is Undefined then it means that the column contains only null values.
When is a job marked FAILED_REOPEN?
When is a job marked FAILED_REOPEN?
A job is marked failed_reopen when one of the recipe iterations get failed. The user can reopen the failed job recipe steps that failed.
What does an IB Studio Job workflow look like?
What does an IB Studio Job workflow look like?
When an user starts a new job from a CSV file the flow is
Queued -> InProcess ->Complete/Failed
Once the job is Complete, the user can add recipe steps and run an iteration.
Queued(iteration#)-> Running(iteration#)->Complete(iteration#)/Failed_Reopen(iteration#)->Published(iteration#)
The iteration# starts with 1 would be incremented by 1 for every subsequent run.
How do I escape special characters in text inputs?
How do I escape special characters in text inputs?
If the source conditions for string comparison contain SINGLE QUOTES or DOUBLE QUOTES or a special character, then use backward slash \
The user should also escape the special character, with \ in places where the input box is of type Text for custom values. Also note, you cannot escape a \ as this would lead to expression evaluation issues. So to remove \ from your strings, use find/replace.
What are the best resolution size and browser requirements?
What are the best resolution size and browser requirements?
For laptops with a 14” display, it works best at 75-85% zoom, based on the user’s screen resolution. Supported only in Google Chrome
How do I reorder recipe steps?
How do I reorder recipe steps?
The drag and Drop function is available in the recipe section to help reorder steps
Which is the best-supported browser?
Which is the best-supported browser?
Google Chrome
How many records can be displayed in the data grid?
How many records can be displayed in the data grid?
A maximum of 1Million records per job can be displayed on the grid in a paginated format(page size of 1000 records). If your input dataset has more than 1M records, then a random of 1M records are imported into the grid.
Which conditional operations can I perform?
Which conditional operations can I perform?
Source Column Type | IF Conditions | Destination Column Type | Operations- As per the datatype | Set value from a Column | Constant | Note |
String | EQUALS | String | CONCAT | Pick one from all string columns | Some String value | If the source conditions for string comparison contains SINGLE QUOTES or DOUBLE QUOTES or a spcial character, user should escape it with a backward slash \ |
Number | EMPTY | Number | LENGTH | Pick one from all number columns | Some Number value. |
|
Date | EMPTY | Date | ADD DAYS | Pick one from all date columns | Some Date value |
|
Boolean | EQUALS | Boolean | DELETE ROWS | N/A | 0/1 value |
|