This is 's TypePad Profile.
Join TypePad and start following 's activity
Scientist at Microsoft's MSN; co-creator of BlogPulse. I blog at http://datamining.typepad.com/data_mining.
Interests: strategy, computational linguistics, weblogs, artificial intelligence, data/text mining, gis, social media.
Recent Activity
Metric driven Agile for Big Data
Working in Bing Local Search brings together a number of interesting challenges. Firstly, we are in a moderately sized organization, which means that our org chart has some rough similarities to our high level system architecture. This means that we... Continue reading
Posted Apr 20, 2013 at Data Mining: Text Mining, Visualization and Social Media
Comment
0
O Knowledge Graph, Where Art Thou?
Posted Feb 10, 2013 at Data Mining: Text Mining, Visualization and Social Media
Comment
1
Participation and Observation in Search
The early days of web search were essentially about observation. The web search engine observed the web (documents, links and user behaviours) and then delivered results based on those observations. In recent years we have started to see more of... Continue reading
Posted Feb 8, 2013 at Data Mining: Text Mining, Visualization and Social Media
Comment
0
Better Beaches
Posted Jan 26, 2013 at Data Mining: Text Mining, Visualization and Social Media
Comment
0
Google's Year in Local Search
Posted Dec 14, 2012 at Data Mining: Text Mining, Visualization and Social Media
Comment
0
Snorkel* and Surf* in Kauai and Maui
Posted Dec 8, 2012 at Data Mining: Text Mining, Visualization and Social Media
Comment
3
Viliam - Thanks for the comments. I'm deliberately putting this out there with plenty of issues. In the online software world, one can wait for a product to be perfect and never release it, or get feedback early and release often. That is what I hope to do with this system. Regarding lemmatisation, I now have this working on my dev system and will push it out this week. Hopefully you can take another look then. Some interesting issues arise, such as while it is ok to stem snorkeling to snorkel, it is less clear if you want to return hits on 'surf' when searching for 'surfing'.
Beach Search Engine Demo
I've written recently about building a perfect beach search engine. Here is a brief example of using the site. Let's imagine you want to find a beach that offers snorkeling, but you want to find one that is shallow because you have small children with you. A query for 'snorkeling AND shallow' br...
Beach Search Engine Demo
Posted Nov 28, 2012 at Data Mining: Text Mining, Visualization and Social Media
Comment
2
Techno Punditry with Data
Posted Nov 17, 2012 at Data Mining: Text Mining, Visualization and Social Media
Comment
1
Building the Perfect Beach Search Engine
Posted Nov 11, 2012 at Data Mining: Text Mining, Visualization and Social Media
Comment
4
How Apple's Local and Mapping Investment Will Lead to Web Search
[I work at Microsoft where I work on projects that drive data quality in our local search experiences on Bing and other clients.] Most of the civilized world, by this time, has heard about Apple's fumble with their new mapping... Continue reading
Posted Sep 29, 2012 at Data Mining: Text Mining, Visualization and Social Media
Comment
0
Local Search - How Hard Can It Be?
Posted Sep 20, 2012 at Data Mining: Text Mining, Visualization and Social Media
Comment
0
The State of Hawai'i Demands a New Search Engine
Posted Aug 25, 2012 at Data Mining: Text Mining, Visualization and Social Media
Comment
3
Google's Visual Approach to Local Search
Posted Aug 16, 2012 at Data Mining: Text Mining, Visualization and Social Media
Comment
0
Google's Evolving Approach to Local Search Development
Posted Aug 3, 2012 at Data Mining: Text Mining, Visualization and Social Media
Comment
0
Livehoods - Behavioural Neighborhood Mapping
Posted Aug 1, 2012 at Data Mining: Text Mining, Visualization and Social Media
Comment
0
Dear Web, Thanks For Using My Screen
Posted Jul 29, 2012 at Data Mining: Text Mining, Visualization and Social Media
Comment
2
@Marc Machielse - using more screen doesn't mean having longer lines spanning a wider column. Think of the evolution of television - widescreen was adopted there and, amazingly, textual information and other non video data has been adapted to that presentation format (think of news programmes). I'm a big fan of negative space (what designers call the empty spaces that form part of design), but I would also like to know what *opportunities* are seen in these wider screens. For example, I find all the advertising that clusters up the flow of reading to be horrible from a design perspective. It is, in fact, designed to make for poor reading. Why not explore how the additional horizontal space could be used instead?
Dear Web, Please Use My Screen
We've all been using widescreen desktops and laptop for a while. When will the web catch up? Urbanspoon: BBC Bing Google Plus
Dear Web, Please Use My Screen
Posted Jul 23, 2012 at Data Mining: Text Mining, Visualization and Social Media
Comment
5
Bing's Evolving Local Search
Posted Jun 23, 2012 at Data Mining: Text Mining, Visualization and Social Media
Comment
1
Microsoft Surface : Bigger than Windows 8 and Bing
Posted Jun 18, 2012 at Data Mining: Text Mining, Visualization and Social Media
Comment
0
5 Hidden Skills for Big Data Scientists
1. Be Clear: Is Your Problem Really A Big Data Problem? There are many big data problems out there requiring huge compute scale, innovations in computation paradigms, vast storage space and so on. But just because your data takes up... Continue reading
Posted May 26, 2012 at Data Mining: Text Mining, Visualization and Social Media
Comment
2
Zero Tolerance Search : 24 year old neuroscientist
[The idea behind 'zero tolerance search' posts is to illustrate real life search interactions that show how far we have to go in leveraging the explicit and implicit data in the web and elsewhere.] Yesterday, I heard part of an... Continue reading
Posted May 12, 2012 at Data Mining: Text Mining, Visualization and Social Media
Comment
0
Excellent Visualization of Network Effect
Continue reading
Posted May 12, 2012 at Data Mining: Text Mining, Visualization and Social Media
Comment
2
Graphing Twitter Attention
Posted Apr 30, 2012 at Data Mining: Text Mining, Visualization and Social Media
Comment
0
More...
Subscribe to ’s Recent Activity


