The Yasna platform can process external datasets of text responses structured in a question database format. Access to this functionality is available with all plans including Free plan.
You can analyse 5000 data cases a month for free. For any feedback, feature suggestions and bug reports as well as for requesting enterprise plan please contact us at [email protected]
Have a look at a quick interactive demo to see the main features of the text processing tool
Have a look at a quick interactive demo to see the main features of the text processing tool
Data format
Before downloading the source data and processing it is necessary to format the data as follows:
- The data file should be in xlsx or csv format, using the delimiter - ‘;’
- Each column should contain response data for 1 question, and the wording of the question itself should be in the column header (= first row of the table).
- Optionally, you can add the interview ID (key) as the first column. In that case, rows with the same keys will be read as one interview. If you don’t use keys (option ”The data has IDs in the first column)” is off), the platform will assign them automatically. In this case, the data from different columns in a row will be read as belonging to one interview.
- You can also optionally add columns with variables related to the respondents in the database, so that you can make subgroups for analysis later.
Example of a table with raw data ready for processing
ID |
What associations do you have with this name? |
What do you think this title is for? |
Title (cell) |
Gender |
1000001 |
classical music |
makeup brand |
Current product |
Female |
1000002 |
luxury, beauty |
jewelry collection |
New concept |
Male |
1000003 |
holiday, festive season |
clothing collection |
New concept |
Female |
1000004 |
ballet, opera |
beauty brand |
Current product |
Male |
… |
|
|
|
|
The file can have any number of lines, questions, and variables.
Limitations of the report functionality
- We support only one level of subgroups. For example, if you need to create subgroups Gender X Age, but in the data table they are different columns, the platform will only allow you to do it by manually filtering interviews by ID list. Therefore, for optimization, it is better to mark in advance in the file, which interview belongs to which subgroup.