日批在线视频_内射毛片内射国产夫妻_亚洲三级小视频_在线观看亚洲大片短视频_女性向h片资源在线观看_亚洲最大网

US EUROPE AFRICA ASIA 中文
China / View

Better manage risks inherent in Big Data

By Ernest Davis (China Daily) Updated: 2017-02-13 08:36

In the last 15 years, we have witnessed an explosion in the amount of digital data available - from the Internet, social media, scientific equipment, smart phones, surveillance cameras, and many other sources - and in the computer technologies used to process it. "Big Data", as it is known, will undoubtedly deliver important scientific, technological, and medical advances. But Big Data also poses serious risks if it is misused or abused.

But having more data is no substitute for having high-quality data. For example, a recent article in Nature reports that election pollsters in the United States are struggling to obtain representative samples of the population, because they are legally permitted to call only landline telephones, whereas Americans increasingly rely on cellphones. And while one can find countless political opinions on social media, these aren't reliably representative of voters, either. In fact, a substantial share of tweets and Facebook posts about politics are computer-generated.

A Big Data program that used this search result to evaluate hiring and promotion decisions might penalize black candidates who resembled the pictures in the results for "unprofessional hairstyles," thereby perpetuating traditional social biases. And this isn't just a hypothetical possibility. Last year, a ProPublica investigation of "recidivism risk models" demonstrated that a widely used methodology to determine sentences for convicted criminals systematically overestimates the likelihood that black defendants will commit crimes in the future, and underestimates the risk that white defendants will do so.

Another hazard of Big Data is that it can be gamed. When people know that a data set is being used to make important decisions that will affect them, they have an incentive to tip the scales in their favor. For example, teachers who are judged according to their students' test scores may be more likely to "teach to the test," or even to cheat.

Similarly, college administrators who want to move their institutions up in the US News and World Reports rankings have made unwise decisions, such as investing in extravagant gyms at the expense of academics. Worse, they have made grotesquely unethical decisions, such as the effort by Mount Saint Mary's University to boost its "retention rate" by identifying and expelling weaker students in the first few weeks of school.

A third hazard is privacy violations, because so much of the data now available contains personal information. In recent years, enormous collections of confidential data have been stolen from commercial and government sites; and researchers have shown how people's political opinions or even sexual preferences can be accurately gleaned from seemingly innocuous online postings, such as movie reviews - even when they are published pseudonymously.

Finally, Big Data poses a challenge for accountability. Someone who feels that he or she has been treated unfairly by an algorithm's decision often has no way to appeal it, either because specific results cannot be interpreted, or because the people who have written the algorithm refuse to provide details about how it works. And while governments or corporations might intimidate anyone who objects by describing their algorithms as "mathematical" or "scientific," they, too, are often awed by their creations' behavior. The European Union recently adopted a measure guaranteeing people affected by algorithms a "right to an explanation"; but only time will tell how this will work in practice.

When people who are harmed by Big Data have no avenues for recourse, the results can be toxic and far-reaching, as data scientist Cathy O'Neil demonstrates in her recent book Weapons of Math Destruction.

The good news is that the hazards of Big Data can be largely avoided. But they won't be unless we zealously protect people's privacy, detect and correct unfairness, use algorithmic recommendations prudently, and maintain a rigorous understanding of algorithms' inner workings and the data that informs their decisions.

The author is a professor of computer science at the Courant Institute of Mathematical Sciences, New York University.

Project Syndicate

Highlights
Hot Topics

...
主站蜘蛛池模板: 国产精品99久久久 | 欧美中文字幕第一页 | 欧美黄色片免费看 | 色爱综合网| 一区二区三区精品视频在线观看 | 国产中文字幕视频 | 91毛片网站 | 亚洲毛片在线看 | 一区二区在线观看视频 | 久久精品高清 | 国产福利视频 | a级片毛片| 色天堂在线视频 | 欧美在线中文 | 久久综合中文字幕 | 亚洲综合一区二区三区 | 国产成人三级在线观看视频 | 国产成人精品在线播放 | 精品亚洲天堂 | 欧美日韩在线中文字幕 | 日韩欧美激情视频 | 国产精品久久久久久久久久久久久久久 | 99re这里都是精品 | 中文字幕有码视频 | 亚洲va欧美va天堂v国产综合 | 老女人连续高潮呻吟 | 亚洲成人网在线播放 | 疯狂试爱三2浴室激情视频 超碰.com | 精品成人一区二区三区 | 大地av| 99涩涩 | 国产永久在线观看 | 国产一区二区三区精品在线观看 | 一区色 | 欧美乱淫 | 国产精品久久久久久久久久免费 | 天堂久久精品 | 一级做a爱片久久毛片 | 北条麻妃99精品青青久久 | 亚洲福利视频一区 | 日朝毛片 |