Back to Sessions
Data

Data Cleaning: Tools and Techniques

11:45am - 1:00pm on Friday, November 21
KLCC Level 3 - Room 302

About This Session

This hands-on workshop helps investigative journalists understand the importance of preparing accurate, reliable, and consistent data for analysis. It introduces common data quality issues — such as missing values, duplicates, and formatting errors — and demonstrates step-by-step methods to detect, clean, and transform raw datasets. Explore both manual and automated tools, along with best practices and workflows, to ensure data is trustworthy and ready for effective analysis or storytelling.NOTE: Please bring your own laptop in order to follow along and get the most out of this session.Please take note of the following:Install the following tools in advance:https://openrefine.org/downloadhttps://tabula.technology/https://docs.google.com/spreadsheets/createhttps://support.microsoft.com/en-us/office/about-power-query-in-excel-7104fbee-9e62-4cb9-a02e-5bfb1a6c536aReview these two articles before the session:https://github.com/Quartz/bad-data-guidehttps://gijn.org/stories/data-cleaning-tools-and-techniques-for-non-coders/

Speaker

Pınar Dağ

Lecturer and GIJN Turkish Editor