LAPD crime data privacy insecurity

 

This dataset describes the Crime in LA and includes 1,000,000 rows of data and 28 columns. the columns range from codes of crime, race, locations, and has columns for weapon, age, sex, arrest status, and even coordinates. This data was taken from 2020- Jan 3 2026.

Our project was made up of 5 people, a web specialist, project manager, data wizard, and a problem solver/floater. 

The methods we took for this data was denotatively looking at the columns and the data dictionary, as well as using data genesis format anatomy to derive a meaning from the data dictionary and how it could be problematic.

Our project details the LAPD crime data which contains potentially violations of privacy through public access of victim address, location, name, and what crime was committed. we went through the data denotatively looking through the data dictionary and the categories, as well as looking at the data genealogy to find exactly how this data could prove violations of privacy. 

The problem with this data being publicly available is that data scraping tools and AI can gather this data without any knowledge of it being scraped and can be thusly easily accessed and not controlled at all. This data simply shouldn't be publicly available because it is violation of 3 privacy theories and could be easily accessed and a lot of the information on there, namely name and address, could be easily harmful and could be cyber-stalked with nearly everyone having a social media account. 

Limited access theory is having the control to who can have access to your information, this theory is violated by this dataset and practice because the victim has no control over who can have access to this information. This could then be traced to who the individual is and the people they know and the places they frequent and almost everything in their life. Control theory is the ability to control exactly what information can be seen or publicly available, this theory is violated because the individual is not consented to this information being given to the public at all. Secrecy theory is the ability to avoid secret or personal information from being publicly available, such as name, address, etc. this theory is being violated by this dataset because all of these variables are publicly available and can be scraped by programs and AI tools.

Our findings were that this data is violating several privacy theories and could be a danger to security because of data scraping tools and online stalking through social media.

 

Term and Year
Winter 2026
Category
Privacy & Surveillance
Short Summary

Our project is about the 2020-2026 LAPD crime data and how it is a violation of privacy and security. For this we are going to specifically cite the data's easy access to location, name, and what crime is committed.