Huge Databases Offer a Research Gold Mine â€” and Privacy Worries
Last month several news organizations reported on the emergence of "fusion centers" — vast data clearinghouses, operated by state law-enforcement agencies, that can instantly call up key personal information on anyone: telephone numbers, insurance records, family ties, and much more.
Architects of the fusion centers say they are a long-overdue tool for combatting crime and terrorism. But critics warn that the centers are a menace to privacy and say they have been constructed at the state level to avoid the scrutiny that a single federal data system would attract.
"Being able to follow students longitudinally is the key to any sophisticated understanding of how colleges are doing and what's happening to students," says Thomas R. Bailey, director of the
Last month Mr. Bailey joined two dozen other scholars at a conference in
"We are far from having exhausted the important research questions that can be addressed with these types of data," Mr. Bailey said during the conference. "Even if we doubled or tripled our capacity at the Community College Research Center, we couldn't possibly deal with all of the issues that these data could be used for."
In an influential 2005 study, one of Mr. Bailey's Teachers College colleagues used a large database in
Despite this potential analytic power, many states have shied away from creating robust data systems. That has partly to do with a lack of resources and expertise, Mr. Bailey says. But it also has to do with nervousness about federal and state privacy laws.
Mr. Bailey hopes that the federal government will do more to prod states to take action — and especially to create better links between their school databases and their postsecondary databases. Since 2002 the Education Department has given states grants totaling around $40-million per year to improve their data systems. And since 2005, the Bill & Melinda Gates Foundation has supported a coalition known as the Data Quality Campaign, which encourages states to create unified databases of student achievement.
"Even if Ferpa did not exist, many of these challenges would still be with us," Mr. Bailey says. "Colleges' IT systems aren't set up to analyze this stuff. The data generally aren't stored in a way that's ideal for research, because that's not the purpose for which the system was designed. The resources and the time that it takes the staff of these places to comply with requests from researchers — those things are not necessarily Ferpa-related."
"We have not had the linkages to K-12 that we hoped we would have," says William E. Knight, director of institutional research at
Mr. Knight and Mr. Bailey both say they hope that the Ferpa revisions and the apparent success of the
A final barrier, cited by Mr. Bailey and several other scholars at the National Academies conference: No matter how strong or weak a state's data system might be, outside researchers need to gain the trust and respect of local officials before they can tap into the data.
"You have to realize that these are public officials, and it takes a lot of courage for them to make public some of these numbers," Mr. Bailey says. "We always try to explain that we're there as partners and we want to help them answer questions that are important to them." (Mr. Sellers says that he and his colleagues in
"Our job is to use these numbers to help colleges improve what they're doing," Mr. Bailey continues. "Not to judge them or to somehow expose them as incompetent. On the other hand, our job isn't to explain away bad numbers, either. So it's a balancing act."
the article "Huge Databases Offer a Research Gold Mine — and Privacy Worries" was published on may 5th in "The Chronicle of Higher Education" http://chronicle.com/free/v54/i35/35a01001.htmprevious page