The Datafication of the Learner

Contemporary education systems generate and collect data at an unprecedented scale. Digital learning platforms log every click, pause, and error. Behavior management systems track disciplinary incidents in real time. Social-emotional learning tools prompt students to self-report emotional states. Attendance systems record movement patterns. Proctoring software monitors eye movements and keystrokes during assessments. The result is an increasingly comprehensive digital record of each child's educational life — a record that is being used to diagnose, predict, sort, and intervene in ways that raise profound ethical questions.

The dominant discourse around educational data is techno-optimistic: more data means better understanding of individual learners, which means more personalized instruction, which means better outcomes. This framing is not entirely wrong — thoughtful uses of data can indeed support learning. But it systematically obscures the power relations, the risks of harm, and the values at stake in the collection and use of data about children.

Privacy and Consent

The foundational ethical principle governing data collection is informed consent — the principle that individuals should be able to decide what information about them is collected and how it is used. Children cannot meaningfully consent to most forms of data collection, and parents' ability to exercise meaningful consent is constrained by the compulsory nature of schooling and the complexity of the data ecosystems involved.

Educational data is shared across a complex web of actors — school districts, state agencies, federal databases, and an expanding ecosystem of private vendors whose data practices are governed by contracts that few parents or educators ever read. FERPA and COPPA provide some legal protection, but they were designed for a simpler data environment and do not adequately address the risks posed by modern learning analytics and third-party data brokers.

Algorithmic Decision-Making and Bias

The use of algorithms to make or inform high-stakes decisions about students — placement in special education, identification for gifted programs, prediction of dropout risk, assessment of discipline — raises specific concerns about bias and due process. Algorithmic systems trained on historical data tend to reproduce the patterns of that data, including patterns that reflect past discrimination. A dropout prediction model trained on historical data from a school system with racially disparate discipline practices will encode those disparities in its predictions.

The opacity of many algorithmic systems compounds this problem. When a student is denied access to an advanced course on the basis of an algorithmic recommendation, neither the student, the parent, nor the teacher may have access to the information required to understand or contest the decision. This violates basic principles of procedural justice that have long been held to govern high-stakes educational decisions.

Toward an Ethical Framework for Educational Data

An ethical framework for educational data must begin from the principle that data about children exists to serve their learning and well-being — not the administrative convenience of institutions, the commercial interests of vendors, or the research agendas of external actors. This means applying strict purpose limitation, ensuring data minimization, building in transparency and contestability, and subjecting data practices to genuine democratic oversight.

It also means developing the professional capacity of educators to engage critically with data — to understand the assumptions embedded in data systems, to ask hard questions about how algorithmic recommendations are generated, and to exercise judgment that is informed by but not reduced to quantitative signals. Data literacy is not a technical skill but a civic one, and it belongs at the center of professional preparation for educators.

Bibliography

Eubanks, V. (2018). Automating Inequality: How High-Tech Tools Profile, Police, and Punish the Poor. St. Martin's Press. Gillborn, D., Warmington, P., & Demack, S. (2018). QuantCrit: Education, Policy, 'Big Data' and Principles for a Critical Race Theory of Statistics. Race Ethnicity and Education, 21(2), 158–179. Selwyn, N. (2015). Data Entry: Towards the Critical Study of Digital Data and Education. Learning, Media and Technology, 40(1), 64–82.