Archive for the 'Search Engines' Category

Google’s Gatekeepers

Sunday, November 30th, 2008


These three Google employees may be the world’s most powerful censors

The New York Times Magazine today featured Google’s Gatekeepers - a look at the small unaccountable team within Google who decide whether and to what extent they will comply with the wishes of governments around the world who wish to regulate its operations. Encouragingly, Andrew McLaughlin, global public-policy director, is a Berkman Fellow, which is about as good a place as I can imagine to start from if you want to appreciate Internet regulation issues.

More disturbingly, Nicole Wong describes her role as finding an approach which “will allow our products to move forward in a country” (which should come as no surprise - as a publicly-held company it is legally obliged to maximise its profits).

Google formally enters the media business (in a quiet way)

Friday, October 10th, 2008

Google has long insisted that it wasn’t interested in or involved with news gathering that involved human intervention - “we just serve stuff up using algorithms”, they say. (Of course the algorithms at Google News are continually tweaked to ensure that people using them get the kind of results that Google believes that they want, and the selection of news sources themselves is done by humans…) But I just noticed a new programme off in a corner of Google - Power Readers in Politics - essentially a group blog run by a small and Google-selected set of politicians and journalists, attached to Google Reader. Also see their Canadian version.

Insight into the business of search

Monday, June 30th, 2008

While reading a New York Times article about voice recognition and speech synthesis I learned about a new audio and video search engine, Everyzing. While the search engine itself was interesting, I was struck by the fact that the first thing you find in searching for Everyzing and in visiting their website is not the search engine page itself, but a page about the company’s business. And it is not centred on the user - it’s centred on the content providers. Specifically, the company is touting its skills in search engine optimization of audio and video content. So even before it hits the mass market it is already planning to make money by helping deep-pocketed media companies to get their media found by searchers ahead of others who don’t have those resources.

I tend to think of search engine companies and search engine optimization companies as being enemies but this reminds me that the relationship is a lot more complex.

Bad news for online book content availability, academics

Monday, June 23rd, 2008

Google’s Book Search gets most of the press but Microsoft has also been active in the large-scale digitization of both in copyright and out of copyright books for their search engine. At least until recently. I hope Microsoft’s short-sighted decision to phase out their book digitization programme does not encourage Google to do likewise. We academics have also lost out - the same decision also put paid to Microsoft’s “Live Search Academic” engine which shadowed Google Scholar.

Hurray - essay writing services are to be banned from Google

Wednesday, May 23rd, 2007

I know it raises familiar awkward questions about Google’s market power but in this instance I have to agree. Google’s ban on advertising for essay writing services joins its existing bans on ads for “weapons, prostitution, drugs, tobacco, fake documents and miracle cures.”

My view on book digitisation or Kevin Kelly goes author-baiting

Tuesday, May 23rd, 2006

I thought I had written my own robust defence of Google Book Search and book digitisation in general but it seems I haven’t - at least I haven’t found any on this blog or my personal one (though I posted enthusastically about meeting Brewster Kahle who runs the Internet Archive and briefly mentioned my own experience of using Google Books).

Kevin Kelly has had a go at providing a popular account of the potential importance and utility of the widespread availability of books online in Scan This Book! in the New York Times. He rehearses many of the good arguments against the ever-lengthening text copyright regime and for the social utility of book scanning programmes but unfortunately his argument is somewhat spoiled by his need to “epater les bourgeois”.

First by hyperbolic statements: “The link and the tag may be two of the most important inventions of the last 50 years.” Inventions? Talmudic scholars (for one) would be surprised to learn we had just invented annotation.

Secondly, and more importantly, by un-necessarily sweeping and apocalyptic predictions about the way technology will (must?) change existing businesses (like publishing).

Copies of isolated books, bound between inert covers, soon won’t mean much. Copies of their texts, however, will gain in meaning as they multiply by the millions and are flung around the world, indexed and copied again. What counts are the ways in which these common copies of a creative work can be linked, manipulated, annotated, tagged, highlighted, bookmarked, translated, enlivened by other media and sewn together into the universal library.

Much of what he says is arguably valid for non-fiction and particularly scientific research but less so for fiction where (as Updike says) we actually seem to like reading (or viewing) self-contained narratives (though we may then go on to comment on them or construct other self-contained narratives based on them).
He usefully points out that academic science is heading faster than other fields towards the universal library but doesn’t think through the implications. Academics need to publish freely to advance and do so happily but only because there is a state system in place that pays them to be experts because society benefits from their creation and dissemination of knowledge. It is hard to imagine the same model being applied to the writers of cookbooks, say, but in the UK we pay authors a (very) modest sum when their books are checked out of the public library.

Could some form of super-UNESCO (or a number of national government initiatives) help to fund freely-available fiction (or other creative works) to be added to the universal Internet library of the future? Perhaps paid for through a levy on broadband subscriptions as suggested by some in France - the Global License? This is approximately the way the BBC works, for example (though it is not as free as it should be in sharing the content that our license fees have paid for). Surely this is a more attractive proposition for artists than having to individually flog “performances, access to the creator, personalization, add-on information, ads, sponsorship or periodic subscriptions” which KK suggests are the options that will be available to artists in the brave new world when their individual works themselves are no longer saleable.

KK anticipates some of the hostile reaction that followed from John Updike (speaking predominantly on behalf of fiction authors) and by Nicholas Carr (among others) more generally:

Some people alive today are surely hoping that they die before such things happen, and others, mostly the young, want to know what’s taking so long. (Could we get it up and running by next week? They have a history project due.)

I for one am in the young enthusiast camp but I don’t think it means that the way things work in the creative industries has to be swept away in order to bring the (near)-universal Internet library about.

If we could reduce the term of copyright to 14 years, renewable for another 14 (as Creative Commons suggests) and if copyright could be easily asserted at a central database but the default for works where the ownership was unclear was that such work would be in the public domain, authors would still be able to get paid for their works during their most valuable commercial life but we would have a huge public domain of useful information instead of the stunted one we have now.

The above suggestions still radical (more radical than many authors and publishers would like, I am sure) but are consistent with the new potentials technology offers without requiring the total restructuring of publishing…

I’ve not provided as fully thought-through or well-ordered set of arguments here as I’d like (and not perhaps a particularly original point of view either - it draws heavily on Lawrence Lessig’s thinking, for example) but there’s as much polishing here as I can spare considering the £0 I am getting paid to write this! What do you think?

Update: As if in answer to my wishes, I have just heard about a very promising bill in the US House of Representatives - the Orphan Works Act which would release into the public domain works where the owner is no longer known. Someone should set up a campaign to support Lamar Smith (the bill’s sponsor).

David Brake

What’s the big deal about the DOJ’s request for a random selection of ‘anonymised’ searches?

Saturday, January 21st, 2006

The US Department of Justice asked Google recently for a selection of the searches people made over a two month period, with the identities of the searchers removed. Google, to its credit, has resisted this while the other major US search engines have not. John Battelle and Danny Sullivan have further analysis of this issue.

There are four issues that arise from this which have not been prominently dealt with in discussions I’ve read:

1) If Google hadn’t resisted this request, would we have heard about the fact that AOL, Yahoo and MSN immediately complied?
2) How many people realise that Google (and others) frequently do keep track not just of what people search for but who searches for what?
3) Mightn’t there be some searches that tend to reveal the identity of the searcher even if the searcher’s IP address is concealed? Particularly if you examine them in time order? Update: Search Engine Watch just produced a lengthy analysis of this issue which suggests that this is probably not a big issue.
4) The DOJ doesn’t seem to have asked only for searches by Americans. Does it have any business looking at searches from international users who might (for whatever reason) have chosen to use yahoo.com instead of (for example) yahoo.co.uk?

Update: Tim Wu of Columbia Law School writes an excellent analysis of the whole issue on Slate, arguing that it’s simply excessive for search engines to collect these reams of personal data on what people are searching for.

Search engines and privacy

Friday, July 15th, 2005

A very interesting article has been produced outlining the privacy implications of Internet search engines. It isn’t just that they make it easy for others to get at publicly-held data - they also collect all sorts of data about your surfing habits and other information. Google and others say they don’t use such data or intend to use it in any way except to make your surfing easier, but will this always be true? And do people always realise the implications of their trading their privacy for convenience?

I have to say I use all of Google’s potentially privacy-intrusive functions, but I am addicted to new cool features…

P.S. I agree with Ray Everett-Mills who expresses surprise that a company like Google has no privacy officer. As far as I can tell they leave that stuff to the lawyers - lawyers can protect you from the consequences of mistakes but a good privacy officer would keep them from making mistakes. Here in Europe I like to think a company like Google would not think twice about having one…

Maybe you can find the answer to anything online?

Friday, May 20th, 2005

The UK government is trying out a new service - the People’s Network Enquiry Service where you can ask (it seems) just about any question you like and a librarian either from the UK or somewhere else in the world (if you mail outside 9-5 Mon to Fri) will try to answer it. Will this encourage those not online to use it by providing an easy to understand alternative to surfing? Time will tell…

counseling horizon credit a newmp3 808 azuraxanax free consultation 24hour shippinglg 8100 mp3der mp3 rauscht wald schon abendlichmp3 820 yh samsungxanax 5mg much toomp3 83090 enable Map

401k loans faqloan 4 militaryrating credit adverse loansloan home adjustableadvance software payday loanloan bankruptcy va afteradvance american payday loans123 day pay loan Map

porn clitclitoral pornclohe pornclose porn upclose ups porncloseups porn galleriesclothes on pornporn club 17 Map

Lots of good quality full text academic books now online

Wednesday, April 27th, 2005

Many librarians (and academics) are accustomed to being somewhat snobbish about the Internet and often rightly so. Historically it has been great at providing stuff dealing with current events and issues but it has tended to have a lousy “memory”. If you want to know what people wrote and thought prior to 1998 or so, you normally need to consult a library.
This is now starting to change. In 2003 there was a flurry of announcements from Amazon, Google and others about making the full text of books available online and searchable. I confess I assumed most of this would be out of copyright not very useful texts but Google Print just (covertly) made it possible to search just the books they have scanned in (thanks Google Blog!) and a search for major theorists like my old friend Bourdieu shows a lot of good, ‘major league’ academic books are now there. I also stumbled across some entertaining recreational reading

It’s harder to tell what Amazon has got as it mixes search by author and title etc with keyword search using the same form. Note - this only works if you search via Amazon.com (the US site) - it isn’t available through Amazon UK yet.

Of course even if you find a book you want to read via Google Print you have to read it on screen or print it out one page at a time. Amazon only lets you read a few pages from any one book as a taster.

P.S. If the book you want isn’t available in full text, a Yahoo or Google search for a book title may tell you which university or public libraries it is available at.

Update: I didn’t realise that Google like Amazon will only let you see a few pages at a time from any of the books you find if that book is still in copyright.

afg 1300 130 loan brokermp3 50cents et oliviaoak loan 1313 rdmp3 noots sum 41dumb 411 mp350s parent mp3 directoryrefinance 195 loandevotion mp3 415 Map

nudes movietekken moviebig movies tit freemovie nudestorture moviesthe movie glorymovie jjj post gallery thumbnailporn animal movies Map