Wikipedia Data - Extraction (Series 1)

Data extraction is an invaluable skill for any student looking to stand out when coming to college. Data extraction is the process of bringing data into statistical programming software to perform transformations, visualizations, and analyses. It is important for students to gain the ability to get their own data to analyze, and it is nice for them to start off with data they are familiar with or a website they use often.

One example of data extraction that students can use to get started is getting data from the California County Wikipedia website. This website provides detailed information about the counties of California, including population, area, and the date the county was established. Students can use this data to create visualizations and analyses that can help them better understand the counties of California.

Data extraction is a great way for high school students to gain experience with data and learn how to analyze it. With the right data and the right software (R or Python), students can create meaningful visualizations and analyses that can help them stand out when it comes time to apply to college. With data extraction skills, students can demonstrate to college admissions officers that they are prepared to take on a college-level data analysis course. Data extraction is an essential skill for any student looking to succeed in college, and the earlier they gain this skill, the better.

Watch the video demonstration