Data and AI Training

Home | Power BI | Excel | Python | SQL | Generative AI | Visualising Data | Analysing Data

LBAG Survey Dataset

In May 2020, The London Business Analytics Group (LBAG) ran a survey of members to find out which topics would be most interesting for future talks. Four topics were proposed for data analysis: street level crime, COVID tests numbers, the performance of English schools, and results of the UK election in 2019.

Respondents to the survey were asked to provide a Likert style score to indicate their interest in the topic (Very boring, Dull, OK, Interesting, Fascinating) as a single answer to a multiple-choice question. There was a final question where respondents could add an overall comment. There were 81 respondents.

This data is in an Excel spreadsheet in a public web location here. The spreadsheet has three tables: Survey, Questions and ScoreSentiment.

The Survey table records the results. There is a column for the score of each topic (Crime, COVID, Schools, Elections). The survey tool assigned a numeric value, Response Id, for each answer: Boring -> 1, Dull -> 2, OK -> 3, Interesting -> 4, Fascinating -> 5. The table also has a Comment column where the respondent may have entered some free text.

The Question table has a row for each of the four multiple choice questions. This contains the Question Code, which is the column header in the table Survey table, and a full title and text of the question.

The Response table has a row for each of the possible responses to the multiple choice questions. The ResponseText lists the possible responses ( Boring, …, Fascinating). The Response Id column is the corresponding value recorded in the Survey table (1 to 5). The Sentiment column groups the scores into three categories (Negative, Neutral, Positive). The Points column reflects the survey designers’ view on any numerical weights to be assigned to each Response and penalises the topics that respondents consider boring more than it rewards topics that respondents find interesting.

Suggested Analysis Approach

Analyse and visualise the dataset to answer this question: which topic(s) are the most and (least) appealing to respondents?

You may want to:

Also, consider if you can apply a points scheme so it is possible to sum the results - if this makes sense at all

Useful resources during the exercise

During the case study, the prompt for an AI model may be useful. If this is a Power BI exercise, replace the reference to Excel in the prompt below.

In Excel, I have an Excel table named Survey with columns: RespondentId, A, B, C.
The column header and first two rows look like this.

RespondentId, A,B,C
1, 100, 200, 300
2, 101, 202, 303

I want to unpivot the data to achieve a result like this.

RespondentId, Category, Value
1, A, 100
1, B, 200
1, C, 300
2, A, 101
2, B, 202
2, C, 303

How can I achieve this?