The Yasna platform can process external datasets of text responses structured in a question database format. Access to this functionality is available with all plans including Free plan.

You can analyse 5000 data cases a month for free. For any feedback, feature suggestions and bug reports as well as for requesting enterprise plan please contact us at [email protected]

Have a look at a quick interactive demo to see the main features of the text processing tool

Have a look at a quick interactive demo to see the main features of the text processing tool

Data format

Before downloading the source data and processing it is necessary to format the data as follows:

Example of a table with raw data ready for processing

ID What associations do you have with this name? What do you think this title is for? Title (cell) Gender
1000001 swan lake ballet cosmetics brand White swan Female
1000002 song I don't know Black raven Male
1000003 Russian folk songs may be domestic clothing brand Black Raven Female
1000004 Chanson, park, shashlik for vodka White swan Male

The file can have any number of lines, questions, and variables.

Limitations of the report functionality

  1. We support only one level of subgroups. For example, if you need to create subgroups Gender X Age, but in the data table they are different columns, the platform will only allow you to do it by manually filtering interviews by ID list. Therefore, for optimization, it is better to mark in advance in the file, which interview belongs to which subgroup.
  2. On the platform it is possible to make separate reports in Storytelling format (text summary) for any number of subgroups. However, the Scale mode only allows you to create one codifier per selected array. Saving multiple codifiers for the same question is not possible, the new codifier will erase the data of the old one. If you know in advance that you will need different codifiers for cells when coding (e.g. by concept), it is better to split the data for them in advance - either into separate files (and process them as different projects on the platform), or into columns with different headings for questions by concept - see the example below:
ID What associations do you have with this name? (White swan) What associations do you have with this name? (Black raven)
1000001 swan lake ballet
1000002 song
1000003 Russian folk songs
1000004 Chanson, park, shashlik
  1. The coded response database is uploaded in R format only, if necessary, we can merge it with the numerical analysis data and convert it into any other format convenient for the client on request. Only the percentages displayed in the table can be copied directly from the platform - see below.

Working with data step by step