你喜欢吃炸薯条吗?你在面谱网上给过它们“赞”吗?下面是小编为大家收集关于TED英语演讲:你以为你点的“赞”就是单纯的“赞”吗,欢迎借鉴参考。
演说题目:Your social media "likes" expose more than you think
演说者:Jennifer Golbeck
演讲稿
If you remember that first decade of the web, it was really a static place. You could go online, you could look at pages, and they were put up either by organizations who had teams to do it or by individuals who were really tech-savvy for the time.
如果你还记得网络时代的头十年,网络是一个水尽鹅飞的地方。你可以上网,你可以浏览网页,当时的网站要么是由某个组织的专门团队建立,要么就是由真正的技术行家所做,这就是当时情况。
And with the rise of social media and social networks in the early 20xxs, the web was completely changed to a place where now the vast majority of content we interact with is put up by average users, either in YouTube videos or blog posts or product reviews or social media postings. And it's also become a much more interactive place, where people are interacting with others, they're commenting, they're sharing, they're not just reading.
但在二十一世纪初随着社交媒体以及社交网络的兴起,网络发生了翻天覆地的变化:如今网络上大部分的互动内容都是由大众网络用户提供,既有Youtube视频,也有博客文章,既有产品评论,也有社交媒体发布。与此同时,互联网成为了一个有更多互动的地方,人们在这里互相交流、互相评论、互相分享,而不只是阅读信息。
So Facebook is not the only place you can do this, but it's the biggest, and it serves to illustrate the numbers. Facebook has 1.2 billion users per month. So half the Earth's Internet population is using Facebook. They are a site, along with others, that has allowed people to create an online persona with very little technical skill, and people responded by putting huge amounts of personal data online.
Facebook不是唯一一个你可以做这些事情的地方,但它确实是最大的一个,并且它用数字来证明这点。面谱网每个月有12亿用户。由此可见,地球上一半的互联网用户都在使用面谱网。这些都是网站,允许人们在网上创建不同的角色,但这些人又不需要有多少计算机技能,而人们的反应是在网上输入大量的个人信息。
So the result is that we have behavioral, preference, demographic data for hundreds of millions of people, which is unprecedented in history. And as a computer scientist, what this means is that I've been able to build models that can predict all sorts of hidden attributes for all of you that you don't even know you're sharing information about.
结果是,我们拥有数以亿计人的行为信息、喜好信息以及人口数据资料。这在历史上前所未有。对于作为计算机科学家的我来说,这意味着我能够建立模型来预测各种各样的你或许完全没有意识到的与你所分享的信息相关的隐藏信息。
As scientists, we use that to help the way people interact online, but there's less altruistic applications, and there's a problem in that users don't really understand these techniques and how they work, and even if they did, they don't have a lot of control over it. So what I want to talk to you about today is some of these things that we're able to do, and then give us some ideas of how we might go forward to move some control back into the hands of users.
作为科学家,我们利用这些信息来帮助人们在网上交流。但也有人用此来谋取自己的私欲,而问题是,用户并没有真正理解其中用到的技术和技术的应用方式。即便理解了,也不见得他们有话事权。所以,我今天想谈谈我们能够做的一些事情,也启发我们如何改善情况、让话事权回归用户。
So this is Target, the company. I didn't just put that logo on this poor, pregnant woman's belly. You may have seen this anecdote that was printed in Forbes magazine where Target sent a flyer to this 15-year-old girl with advertisements and coupons for baby bottles and diapers and cribs two weeks before she told her parents that she was pregnant.
这是塔吉特百货公司的商标。我并不单单把那个商标放在这个可怜的孕妇的肚子上。或许在福布斯杂志上你看过这么一则趣事:塔吉特百货公司给这个15岁女孩寄了一份传单,传单上都是婴儿奶瓶、尿布、婴儿床的广告和优惠券。这一切发生在她把怀孕消息告诉父母的两周前。
Yeah, the dad was really upset. He said, "How did Target figure out that this high school girl was pregnant before she told her parents?" It turns out that they have the purchase history for hundreds of thousands of customers and they compute what they call a pregnancy score, which is not just whether or not a woman's pregnant, but what her due date is. And they compute that not by looking at the obvious things, like, she's buying a crib or baby clothes, but things like, she bought more vitamins than she normally had, or she bought a handbag that's big enough to hold diapers.
没错,女孩的父亲很生气。他说:”塔吉特是如何在连这个高中女生的父母都尚未知情之前就知道她怀孕了?“ 原来,塔吉特有成千上万的顾客,并拥有他们的购买历史记录,他们用计算机推算出他们所谓的“怀孕分数”,不仅能知道一个女性是否怀孕,而且还能计算出她的分娩日期。他们计算出的结果不单单是基于一些显而易见的事情,比如说,她准备买个婴儿床或孩子的衣服,更是基于其他一些事情,例如她比平时多买了维他命,或她买了一个新的手提包大得可以放尿布。
And by themselves, those purchases don't seem like they might reveal a lot, but it's a pattern of behavior that, when you take it in the context of thousands of other people, starts to actually reveal some insights.So that's the kind of thing that we do when we're predicting stuff about you on social media. We're looking for little patterns of behavior that, when you detect them among millions of people, lets us find out all kinds of things.
单独来看这些消费记录或许并不能说明什么,但这确是一种行为模式,当你有大量人口背景作比较,这种行为模式就开始透露一些见解。当我们根据社交媒体来预测关于你的一些事情时,这便是我们常做的一类事情。我们着眼于零星的行为模式,当你在众人中发现这些行为模式时,会帮助我们发现各种各样的事情。
So in my lab and with colleagues, we've developed mechanisms where we can quite accurately predict things like your political preference, your personality score, gender, sexual orientation, religion, age, intelligence, along with things like how much you trust the people you know and how strong those relationships are. We can do all of this really well. And again, it doesn't come from what you might think of as obvious information.
在我的实验室,在同事们的合作下,我们已经开发了一些机制来较为准确地推测一些事情,比如你的政治立场、你的性格得分、性别、性取向、宗教信仰、年龄、智商,另外还有:你对认识的人的信任程度、你的人际关系程度。我们能够很好地完成这些推测。我在这里在强调一遍,这种推测并基于在你看来显而易见的信息。
So my favorite example is from this study that was published this year in the Proceedings of the National Academies. If you Google this, you'll find it. It's four pages, easy to read. And they looked at just people's Facebook likes, so just the things you like on Facebook, and used that to predict all these attributes,along with some other ones.
我最喜欢的例子是来自今年发表在美国国家论文集上的一个研究。你可以在谷歌搜索找到这篇文章。这篇文章总共四页,容易阅读。他们仅仅研究了人们在Facebook上的“赞”,也就是你在Facebook上喜欢的事情。他们利用这些数据来预测之前所说的所有特性,还有其他的一些特性。
And in their paper they listed the five likes that were most indicative of high intelligence. And among those was liking a page for curly fries. (Laughter) Curly fries are delicious, but liking them does not necessarily mean that you're smarter than the average person. So how is it that one of the strongest indicators of your intelligence is liking this page when the content is totally irrelevant to the attribute that's being predicted? And it turns out that we have to look at a whole bunch of underlying theories to see why we're able to do this.
在文章中列举了最能够显示高智商的五个“赞”。在这五项中赞“炸扭薯”页面的是其中之一。炸扭薯很好吃,但喜欢吃炸扭薯并不一定意味着你比一般人聪明。那么为什么喜欢某个页面就成为显示你智商的重要因素,尽管该页面的内容和所预测的属性与此毫不相干?事实是我们必须审视大量的基础理论,从而了解我们是如何做到准确推测的。
One of them is a sociological theory called homophily, which basically says people are friends with people like them. So if you're smart, you tend to be friends with smart people, and if you're young, you tend to be friends with young people, and this is well establishedfor hundreds of years. We also know a lot about how information spreads through networks. It turns out things like viral videos or Facebook likes or other information spreads in exactly the same way that diseases spread through social networks.
其中一个基础理论是社会学的同质性理论,主要意思是人们和自己相似的人交朋友。所以说,如果你很聪明,你倾向于和聪明的人交朋友。如果你还年轻,你倾向于和年轻人交朋友。这是数百年来公认的理论。我们很清楚信息在网络上传播的传播途径。结果是,流行的视频、脸书上得到很多“赞”的内容、或者其他信息的传播,同疾病在社交网络中蔓延的方式是相同的。
So this is something we've studied for a long time. We have good models of it. And so you can put those things together and start seeing why things like this happen.So if I were to give you a hypothesis, it would be that a smart guy started this page, or maybe one of the first people who liked it would have scored high on that test.
我们在这方面已经研究很久了,我们己经建立了很好的模型。你能够将所有这些事物放在一起,看看为什么这样的事情会发生。如果要我给你一个假说的话,我会猜测一个聪明的人建立了这个页面,或者第一个喜欢这个页面的人拥有挺高的智商得分。
And they liked it, and their friends saw it,and by homophily, we know that he probably had smart friends, and so it spread to them, and some of them liked it, and they had smart friends, and so it spread to them, and so it propagated through the network to a host of smart people, so that by the end, the action of liking the curly fries page is indicative of high intelligence, not because of the content, but because the actual action of liking reflects back the common attributes of other people who have done it.
他们喜欢了这个页面,然后他们的朋友看到了,根据同质性理论,我们知道这些人可能有聪明的朋友, 然后他们看到这类信息,他们中的一部分人也喜欢,他们也有聪明的朋友,所以这类信息也传到其他朋友那里,所以信息就在网络上在聪明人的圈子里流传开来了,因此到了最后,喜欢炸扭薯的这个页面就成了高智商的象征,而不是因为内容本身,而是“喜欢”这一个实际行动反映了那些也付诸同样行动的人的相同特征。
So this is pretty complicated stuff, right? It's a hard thing to sit down and explain to an average user, and even if you do, what can the average user do about it? How do you know that you've liked somethingthat indicates a trait for you that's totally irrelevant to the content of what you've liked? There's a lot of power that users don't have to control how this data is used. And I see that as a real problem going forward.
听起来很复杂,对吧?对于一般用户来说它比较难解释清楚,就算你解释清楚了,一般用户又能利用它来干嘛呢?你又怎么能知道你喜欢的事情反映了你什么特征,而且这个特征还和你喜欢的内容毫不相干呢?用户其实没有太多的能力去控制这些数据的使用。我把这个看作将来的真实问题。
So I think there's a couple paths that we want to look at if we want to give users some control over how this data is used, because it's not always going to be used for their benefit. An example I often give is that, if I ever get bored being a professor, I'm going to go start a company that predicts all of these attributes and things like how well you work in teams and if you're a drug user, if you're an alcoholic.
我认为,要是我们想让用户拥有使用这些数据的能力,那么有几条路径我们需要探究,因为这些数据并不总是用来为他们谋利益。这有一个我经常举的例子,如果我厌倦了当一名教授,我会选择自己开家公司这家公司能预测这些特性和事物,例如你在团队里的能力,例如你是否是一个吸毒者或酗酒者。
We know how to predict all that. And I'm going to sell reports to H.R. companies and big businesses that want to hire you. We totally can do that now. I could start that business tomorrow, and you would have absolutely no control over me using your data like that. That seems to me to be a problem.
我们知道如何去预测这些特性,然后我就会把这些报告卖给那些人力资源公司和想要雇佣你的大公司。我们完全可以做到这点。我明天就能开始这个项目,并且你对我这用使用你的数据是一点办法也没有的。这对我来说是一个问题。
So one of the paths we can go down is the policy and law path. And in some respects, I think that that would be most effective, but the problem is we'd actually have to do it. Observing our political process in action makes me think it's highly unlikely that we're going to get a bunch of representatives to sit down, learn about this, and then enact sweeping changes to intellectual property law in the U.S. so users control their data.
所以我们可选的其中一条路径是政策和法律这条途径。某程度上我觉得这可能是最有效的。但问题是,事实上我们将不得不这么做。观察我们目前的政治进程让我觉得在美国,把一帮代表们聚在一起,让他们坐下来理解这个问题,然后颁布有关知识产权法方面的颠覆性条例,让用户掌控自己的数据,这似乎是不可能的。
We could go the policy route, where social media companies say, you know what? You own your data.You have total control over how it's used. The problem is that the revenue models for most social media companies rely on sharing or exploiting users' data in some way. It's sometimes said of Facebook that the users aren't the customer, they're the product. And so how do you get a company to cede control of their main asset back to the users? It's possible, but I don't think it's something that we're going to see change quickly.
我们可以走政策途径,这样社交媒体公司就会告诉你,你知道吗?你的确拥有你的数据。你绝对能自己决定要怎么去用。但问题在于大部分的社交媒体公司,他们的盈利模式在某方面取决于分享或挖掘用户的数据资料。所以有时会说面谱网的用户并不是顾客,而是产品。那么你要怎样让一个公司将他们的主要资产控制权双手拱让给用户呢?这是可能的,但我不觉得我们能很快见证这种改变。
So I think the other path that we can go down that's going to be more effective is one of more science.It's doing science that allowed us to develop all these mechanisms for computing this personal data in the first place. And it's actually very similar research that we'd have to do if we want to develop mechanisms that can say to a user, "Here's the risk of that action you just took." By liking that Facebook page, or by sharing this piece of personal information, you've now improved my ability to predict whether or not you're using drugs or whether or not you get along well in the workplace.
所以我认为我们得走另一条途径,一条更有效的途径,一条更加科学的途径。这途径是开发一种技术让我们能够发展所有这些机制来首先处理自己的个人信息资料。而这很接近我们必须做的研究,要是我们想要发展这些机制跟用户说明,“这样做你需要承担那样的风险。” 你在Facebook上点“赞” 或者分享一些私人信息,就相当于增强了我的能力去预测你是不是在吸毒或者你在工作中是否顺利。
And that, I think, can affect whether or not people want to share something, keep it private, or just keep it offline altogether.We can also look at things like allowing people to encrypt data that they upload, so it's kind of invisible and worthless to sites like Facebook or third party services that access it, but that select users who the person who posted it want to see it have access to see it. This is all super exciting research from an intellectual perspective, and so scientists are going to be willing to do it. So that gives us an advantage over the law side.
我觉得,这样做能够影响人们分享的决定:是要保持私隐,还是在网上只字不提。我们也可以探究一些别的,例如,让人们去给上传的东西加密,那么像面谱网这样的网站或其他能获取信息的第三方来说,这些信息就隐秘很多,也少了很多意义,而且只有上传人指定的用户才有浏览的权限。从智能的角度来看,这是一个非常振奋人心的研究,而且科学家们也会乐意去做这样的事。这样在法律方面,我们就有优势了。
One of the problems that people bring up when I talk about this is, they say, you know, if people start keeping all this data private, all those methods that you've been developing to predict their traits are going to fail. And I say, absolutely, and for me, that's success, because as a scientist, my goal is not to infer information about users, it's to improve the way people interact online. And sometimes that involves inferring things about them, but if users don't want me to use that data, I think they should have the right to do that. I want users to be informed and consenting users of the tools that we develop.
当我谈论到这个话题时,人们提到的其中一个问题,就是如果当人们开始把这些数据进行保密,那些你研发的用来预测人们特性的手段都会作废。我会说,绝对会作废,但对我来说,这是成功,因为作为一个科学家,我的目标不是去推测出用户的信息,而是提高人们在网上互动的方式。虽然有时涉及到推测用户的资料,但如果用户不希望我们用他们的数据,我觉得他们应该有权去拒绝。我希望用户能被告知并且赞同我们开发的这种工具。
And so I think encouraging this kind of science and supporting researchers who want to cede some of that control back to users and away from the social media companies means that going forward, as these tools evolve and advance, means that we're going to have an educated and empowered user base,and I think all of us can agree that that's a pretty ideal way to go forward.
所以我认为,鼓励这类科学,支持这些研究者们这些愿意放弃部分控制,退还给用户们,并且不让社交媒体公司接触数据的研究者们。随着这些工具的进化和提高,这一切意味着向前的发展,意味着我们将会拥有一个有素质有权力的用户基础,我觉得我们都会同意这是一个理想的前进目标。
Thank you.(Applause)
谢谢。(掌声)