Overview >> Submit a Challenge >> What we teach >> Previous Data Fairs >> Research >> Contact >>

Previous Data Fair Projects

Cultural Maps

Our study seeks to create a cultural map for tourists or locals in the City of Edinburgh that encompasses both the main cultural assets available and incorporates additional aspects such as demographic information, ethnicity and religious organisations, monuments and open spaces. A key challenge was to present (in a digestible way) what is an extremely rich dataset with enough depth to maintain the integrity of relationships but that could span the entire city to allow deep insights across different areas.

To experience the nodes that form the cultural sites in a human way, we therefore struck on the idea of using the transport network - the main arteries that help people travel through the city - as a way to effectively slice through and expose the depths of the dataset. Selecting a particular Lothian bus route (#16), we analysed religious, demographic data and cultural sites with proximity to bus stops along that line. This enables us to transcend the city and delve into the various flavours and influences appearing in very different but quite localised areas.

Shixuan Wang, Linrui He, Yifan Zhou and Karen Mair, 2019


Edwin Morgan’s Scapbooks

Edwin Morgan was the first national poet in Scotland, and he wrote many influential poems in his life. In addition to the poems he left to the public, he also produced 12 scrapbooks during his entire life and these scrapbooks are valuable biographical materials for people to learn more about Morgan and the time he lived. the metadata about the scrapbooks was collected and our project is to utilize the metadata to visualize the scrapbooks and allow the public to know what are in the scrapbooks without viewing the original books. The data we have for the project are 4 of overall scrapbooks that were produced from the 1930s to 1960s.

Kun Wang, Yongrao Du, Kehan Jiao, 2019

Web Video

Encyclopedia Britannica

The dataset is the Encyclopedia Britannica from edition 1 to edition 8, released by National Library of Scotland. The uncleaned OCR dataset contains 155,388 ALTO XML files, 155,388 image files, and METS metadata files. To show the data to general public who is our target audience, we built a fancy world of Encyclopedia in Minecraft containing four main parts of our finding in data analysis, enabling people have similar experience as they are exploring in a museum.

Bibo Tian, Xue Feng, Yifeng Gao, 2019

Cancer Data

This project is a data visualization project on cancer data in England. In this project, researchers analyzed data from Simulacrum, gained insights into cancer incidence, mortality rates, etc. Eventually, the project designed data visualizations to deliver these insights. The design outcome involves a data comics and paper engineering.

Traditional paper engineering works such as pop-up cards and books us the element of surprise to delight and amaze audiences at all age levels. Pop-up books could be printed in large numbers using cheap materials.

Chen Chen Jiayu Li Ziying Li, 2019

Diseases in British India from 1850 to 1950

The dataset is about visualizing the disease in British India from 1850 to 1950 and collects 468 official publications of short reports to multi-volume histories relating to disease, medical research, official institution report response to epidemics within a colonial context, consisting of 120,903 images, 469 TXT and 117,490 positional XML.

Our data visualization are supposed to be applied in a historical, or medical museum. It also can be used for introducing popular science to those who are interested in colonial India history. These visualisations of these historical data helps to have a better understanding of the prevalence of ancient Cholera.

Qbin Wu & Diwen Yu, 2019

Spriritualist Newspapers

Spiritualism is a religious movement started in the half of the nineteenth century and based on the belief that “human’s consciousness survives corporeal death by maintaining a transcendent existence in a spirit world” (Valente, 2008). It started in New York in 1848, spreading to British in the 1850s and reached its peak growth in membership from the 1840s to the 1920s, especially in English-speaking countries(Braude,2001). In 1969, the newspaper titled “ the spiritualist” was published in London and issued in other countries in the next 13 years.

Methods of Natural Language processing are used to conduct data analysis and the analyzed result will be visualized in the form of data comic.

Xin Jin, Hang Yang, Ruyuan Zhang, 2019

Inequality in Scotland

Our dataset is about social inequality in Scotland, including higher education, further education, training, employment, voluntary work and activity agreements. We used data comics to present our findings, and we found that the diversity of positive destination has increased.

Bowen Qian, Guanyu Chen, Shiyu Wang, 2019

Scottish Witchcraft

This project is about understanding and visualising the Survey of Scottish Witchcraft database. The database is a documentation of all known information about accused witches and the witchcraft belief in Scotland between 1563 and 1736 (The University of Edinburgh, n.d.). The database is a Microsoft Access database owned by the School of History, Classics and Archaeology of the University of Edinburgh.

Our group created a physicalisation which is an interactive physical map and objects that audiences can interact with. The piece aims to educate audiences with the history of the Scottish witchcraft including the general distribution of accused witches by gender and location as well as the different torture types by location. Our audiences can be anyone that is interested in the Scottish Witchcraft history.


International Trade Agreements

For this work we have been using data visualization as a means to gain insight into the underlying structure and relationships of international trade agreements. Our dataset is comprised of 450 preferential international trade agreements with the majority (424) being in english. The original dataset is open to the public and can be found here. Website, Download the DH2019 poster

Visualising NHS Scotland Patient Care Experiences

This project created an interactive visualisation that allows the user to explore differences in patient experiences across Scotland. The Scottish government collects data on healthcare experiences through a biannual survey, but they typically publish this data in lengthy reports and large tables, making it difficult for laypeople as well as policymakers to use the data. In our project, we tried to make this data more accessible by creating visualisations.


Utilising FIFA Data to Visualise Team Formations

The aim of this project is to analyse FIFA World Cup 2018 Qualifiers data and provide useful insights via data visualisations. We implemented this project by developing a web platform that allows users (soccer experts, managers, scouts, etc.) to generate customised visualisations in order to understand the data and extract non-trivial information. The idea behind our implementation is to produce clean and easy to follow plots that can be potentially used by the soccer domain experts to make analyses of opponents’ playing styles, formations, strengths and weaknesses, etc.

The main page of the visualisation displays a high-level overview of the data, using glyphs to symbolise inequality — the less round, the more difference in the reported experiences of patients. Clicking the glyphs takes the user to a heatmap displaying the reported experiences for overall themes and optionally for individual survey questions. This setup allows the user to control how much detail is being shown.



“I would never have been able to get data visualisations of my work without this project; now I have them, I can use them for future funding applications and demonstrate the implications of data-led historical study”

“We had a very positive experience working together as a team to build such a tool. We also benefited from our collaboration with Dr Campagnolo, as he provided useful and specialised knowledge around the given data set, which contributed to us creating a tool that can be very useful to the domain’s experts since Dr Campagnolo is one of them himself. Lastly, we were grateful for having the opportunity through this course to work on such a project, as this equipped us with experiences which will be definitely useful for our further academic or professional careers.”