Wednesday, May 2, 2007

A Brief Introduction




Search engines are invaluable internet tools used by millions of people in order to obtain vast amounts of information about any topic imaginable. Many people were not aware though that search engines collect and save search logs from internet users. However, many people learned of this in August 2006, when America Online publicly released search log data from approximately 650,000 users -- without permission. The purpose of this was for research; however, America Online did not comprehend at the time the massive privacy violation that they had committed. They accidentally revealed many private details about their users. The search logs contained extremely personal and sensitive information that many users would not want public, such as family difficulties, depression issues, and medical concerns. America Online had used identification numbers rather than names and user identifications when listing the search logs, but in some cases it was still possible to identify the user based on their search results since some search logs included names, addresses, and social security numbers. Privacy advocates were outraged, and AOL soon withdrew the information and publicly apologized. Nevertheless, it was too late. The private information was all over the internet, and the damage was done. People became greatly concerned about their privacy and the issue of whether search engines should collect and save data from internet users arose.


So What's The Deal?

Most people in the world use search engines, and at the same time, most people greatly value their privacy. Most rational people want their business kept confidential. They want a sense of comfort and security. No one wants to be subjected to unnecessary harassment. Internet users such as ourselves must recognize this issue and decide a course of action. Let us examine this issue and the problems this issue causes.

When a person types and sends information through a search engine, that information is stored and later reviewed by a search engine employee. The reason for this is supposedly beneficial. Our data is used for research and developmental purposes in order to make improvements to search engine technology. We can assume that designers of search engines are simply trying to help internet users in their search engines needs. Markham Erickson, director of NetCoalition, states, “If companies cannot use search query information to improve their services for users, then consumers ultimately will pay the price if this data cannot be used to improve search technology.” Government officials can also subpoena search engine companies in order to obtain information for their own interrogative or research purposes. Companies such as MSN, Yahoo, and America Online have all complied with government requests for search logs. The government may request search logs in order to obtain evidence against criminals or simply to learn what the average internet user searches for. In January 2006, a subpoena was sent to Google by the government requesting data to "assist the government in its efforts to understand the behavior of current Web users, to estimate how often Web users encounter harmful-to-minors material in the course of their searches, and to measure the effectiveness of filtering software in screening that material.” The Department of Justice argues that this is beneficial because the government is trying to prevent crime and learn how people think.

The Expense of the People

I understand the need for search engine technology being improved – but at what cost? I believe this need for innovation and improvement is at the expense of the people. The major infraction here is the invasion of privacy, and I do not believe that any person or company has the right to pry themselves into another's personal business, regardless of what their intentions are. When the AOL search log incident occurred, hundreds of thousands of peoples' search logs were made public. These search logs contained peoples' darkest secrets. One of the search logs revealed a wife's secret affair, and another revealed a person's entire medical history. Several of these search logs contained full names and social security numbers as well, which made these people vulnerable to identity theft. These search logs represent more than just data; they represent a person's life. Kevin Bankston, lawyer from the Electronic Frontier Foundation, said it best when he stated that these search logs represent “the most secret hopes, deepest fears and dirtiest laundry of every user. They provide a snapshot of incredibly intimate events and ideas, often revealing personal problems, financial difficulties, medical ailments, sexual preferences, and more.” What also concerns me is if search engines are sending our data to outside sources, then perhaps their main reason for collecting and storing search logs is for marketing purposes and not for improvement purposes.

Here are some interesting examples and lists of search logs that were stored and leaked via the AOL search log incident:



Yeah, I can really see how this information would help advertisers. Thanks a lot, AOL!


My Opinion

I believe that search engines need to revise their privacy policies in order to limit and retain the data from outside sources which would prevent privacy infractions. Collected search logs that contain information that is traceable back to individual people should be deleted on the spot. Or perhaps the best solution to this issue would be for search engines to simply stop collecting our data completely. If search engines continue to collect our data, then our privacy will never be secure or confidential. Perhaps these search engine companies can use their own experimental search requests from themselves in order to improve their technology, instead of collecting and storing search logs from casual internet users. The government obtaining search logs to prosecute people is not something that I see as reasonable or legal either. With our current laws of freedom of expression and privacy, the government is not allowed to obtain our mail from post offices or record our phone conversations in outside life. Kevin Bankston responded to these government subpoenas and stated, “That's like asking the post office to keep copies of our mail, or phone companies to keep recordings of our phone calls, just in case investigators might find it useful." The laws of real, outside life should pertain to life on the internet as well. Currently the most effective way for internet users to conceal their privacy is through internet anonymizers. Internet anonymizers conceal your IP address, and without knowledge of your IP address, no one can identify you or locate you.


Here are some interesting YouTube videos that pertain to this controversy:

http://www.youtube.com/watch?v=HqD5zPqZ5Nk

http://www.youtube.com/watch?v=whanGlvZ5lc



A short list I made discussing the pros and cons:


Pros:

-Improvement of search engine technology

-Improves advertisements

-Assists Prosecuters

Cons:

-Assists stalkers

-Makes people susceptible to identity theft

-Major privacy invasion

To Finish

The issue of search engines storing our information is a controversy that pertains to everyone who uses the internet. The internet is a massive place with many risks, and information that a person sends over the internet can be intercepted. It is very likely that anyone that has researched a topic through a search engine has had their data stored and examined. We internet users have to take a stand and protect our privacy. People should have their freedoms of speech, expression, and privacy protected, and this controversy of search engines storing search logs is a blatant invasion of privacy. No one wants their private affairs to be seen by strangers that have no business looking into it. The America Online search log incident was the catalyst of this controversy as it concerned millions of people about their privacy on the internet. This incident demonstrated how personal and private matters can be seen by anyone, and in some cases, put people in serious danger. Some search logs contained critical identification information and opened people up for identify theft, spam, and stalkers. Currently, search engines still collect and store information from internet users, and we internet users should take a stand to protect our privacy. Perhaps changes can be made to this ongoing controversy if enough people become aware of this issue and make their concerns heard.


If you would like to read more on this controversy, here are links to some of the articles that I mentioned:
"Should Web Search Data Be Stored?" - Wall Street Journal Online Debate (Bankston and Erickson)
"No Place To Hide: Privacy Invasion and Censorship." - American Press Institute (McMasters)
"Google Rebuffs Feds On Search Requests." - The Boston Globe (Liedtke)
“Could Future Subpoenas Tie You to 'Britney Spears Nude'?” - Law.com (Lohmann)

Resolution List:
-Web anonymizers (Conceal yourself!)
-Search engines should revise their ways.
-Search engines should stop storing our information all together.
-Internet users can voice themselves by sending complaints to search engine companies.