Seven Interesting Google Projects that a Data Professional may not have heard about:

Standard

Here’s the list:

1. Google Refine

2. Google Prediction API

3. Google Trends

4. Google Chart Tools

5. Google Big Query

6. Google Correlate

7. Google Fusion Tables

Note: These projects may not be ready to be used in your production environment as some of them are in Beta/Experimental stages and their support/development may be deprecated in future.

Thanks: I thought of writing this blog post after a discussion I had with Parth Acharya about Google and it’s projects for Data Professionals. He pointed me to some of the most interesting samples that used Google Fusion Tables and here’s his one of the blog post on related topic: Google Fusion Table & Data Visualization

Advertisements

There’s been a growing interest in Hadoop & Big Data, Here’s the Proof:

Standard

I like to keep an eye on Technology Trends. One of the ways I do that is by subscribing to leading magazines for articles – I may not always read the entire article but I definitely read the headlines to see what Industry is talking about. during last 12 months or so I have seen a lot of buzz around Big Data and I thought to myself – It would be nice to see a Trend line for Big Data. Taking it a step further, I am also interested in seeing if there is a correlation between growing trend in “Hadoop” and “Big Data”. Also, I wanted to see how it compares with the Terms like Business Intelligence and Data Science. With this, I turned to Google Trends to quickly create a Trend report to see the results.

Here’s the report:

Big Data Hadoop Business Intelligence

Here are some observations:

1) There’s a correlation between Trend of Big Data and Hadoop. In fact, it looks like growing interest in Hadoop fueled interest in “Big Data”.

2) Trend line of Big Data and Hadoop overtook that of Business Intelligence in Oct 2012 and sep 2012 respectively.

3) Decline in Trend line of Business Intelligence.

4) There seems to be a steady increase in Trend line for Business Analytics and Data Science.

And Here’s the Google Trend report URL: http://www.google.com/trends/explore#q=Big%20Data%2C%20Hadoop%2C%20Business%20Intelligence%2C%20Business%20Analytics%2C%20Data%20Science&cmpt=q

What do you think about these trends?

Two ideas to make your social network activities “Searchable”:

Standard

Some time back, I wanted to search one of my own social network post. It was a resource I had shared and somehow I was not able to “google” it (again). I eventually found it – but it took me 15 odd minutes to scroll down to my twitter feed. It was NOT fun! And I thought to myself – there’s got to be a better way! And I thought – It’ll be great if I solve it for not just Twitter but all my social network activities that includes LinkedIn, Facebook Pages, Google+. So here’s couple of things thats working for me, I hope it helps someone out there too:

Now, before we begin when I say “Searchable” – I mean searchable by YOU (or a human being) and not necessarily search engines. But it turns out, both my ideas increase your chances of getting your social media activities Indexed! With that, Here are the ideas:

1) Syndicate your Social Network Activities (Posts/Images/Updates) to Tumblr/Blogger

I use IFTTT to syndicate my Twitter, Facebook and LinkedIn activities to Blogger

2) Create a post about your social network activities on your blog:

Here’s an Example: Things I shared on Social Media Networks during Oct 19 – Nov 11

Though Idea #2’s main goal is to keep my blog readers updated about my social network activities – But it also acts as a good way to make my social media posts “searchable”.

And remember I said earlier that the chances of your social network posts getting indexed by search engines increases? That’s because WordPress, Tumblr & Blogger’s posts are accessible by Google (unless you choose to block it). So that’s about it for this post. If you like the idea(s), please let me know! And if you have other ideas – also let me know, I am always looking for ways to make my social media activities easily searchable to me as well as for anyone else.

Let’s connect and converse on any of these people networks!

paras doshi blog on facebookparas doshi twitter paras doshi google plus paras doshi linkedin

I played with Twitter Firehose for couple of hours and how you can do so too:

Standard

First up: what’s a Twitter Fire-hose?

It’s a real-time stream of tweets! I had pointed out in an earlier post that Twitter gets 340 million tweets per day!

twiiter fire hose 340 millions tweets per dayImage courtesy

Why did I want access to Fire-hose?

Curiosity.

I had heard – It’s expensive, Is it?

For an Individual: Absolutely! For companies: Not if they know how to create business value out of it.

Note the words “couple of hours” in the title. I’ll Explain that part later.

How did you get access?

via DataSift. They had a free trial w/ 10$ credit and I tried that. Check them out if want to play with Twitter Firehose. It’s fun!

What did I do with it?

I collected 15,000 tweets over a period of 2 hours containing words “Google” OR “Microsoft“.

Total cost for me: 3-4$

Note: I added the cost just so that you get a general Idea. Look at the pricing page of DataSift for more details.

Are their other Twitter Data  Resellers?

Yes. As of now, it’s DataSift, GNIP and Topsy. search for “Twitter Certified Data Reseller Products” to find the list. I was able to find a Free Trial by DataSift and that’s why I tried DataSift.

If I just want to play with Twitter Data, what are the alternatives?

you can work with their streaming API which gives 1% of tweets. you can find an example here: Grab Twitter search data using R and export to a tab delimited file

Conclusion:

In this post, I discussed about how you can try Twitter Firehose. Also pointed you to an alternative of using streaming API which gives 1% of tweets. I hope that helps.

Mapping “Facebook Page Likes vs Country” using PowerView in Office 2013

Standard

Just a quick note that you can quickly create maps in PowerView in office 2013. I just created one in 2 minutes:

Facebook Page likes VS Country

This seems like a great way to visualize where your fans are from. In my case, most of them are from India and so one actionable insight would be to schedule posts based on Time Zone in India. And I can imagine that such reports could be very helpful to brands who have sizable fan following on Facebook.

Here’s the screenshot:

Maps Power View Excel 2013 social media analytics
Thanks to the following blog-posts for inspiration:

1. Google Fusion Table & Data Visualization (He used Google Fusion Tables, I used PowerView!)

2. Creating Maps in Excel 2013 using Power View

Five examples of Recommendation Systems on the web:

Standard

Recommendation systems is application of Data Mining Technologies. I have researched about how to implement a recommendation system and as a part of my research, I studied recommendation systems that are already out there on the Internet and here are five examples of Recommendation systems on the web:

1. Amazon

Customers Who Bought This Item Also Bought:

recommendation systems amazon customers who bought this also bought

Frequently Bought Together: (Example of Market Basket Analysis a.k.a Association Rules):

recommendation systems amazon frequently bought together

2. LinkedIn

You should read this: How does LinkedIn’s recommendation system work? – it would open up your brain to “recommendation” opportunities around you!

Jobs you may like + Groups you may like + Companies you may follow:

recommendation systems Linkedin Groups Jobs Companies

3. Netflix

Did you knew about Netflix Prize for improving their recommendation engine? If not you should read that!

Here’s their Movies you’ll love recommendation system:

netflix prize recommendation system

4. Twitter

People you may want to follow:

twitter who to follow recommendations data mining

5. Google

I do not have a screenshot but just wanted to point out the Google “personalize” (a.k.a recommends based on past behavior) search results based on your search history. And you can switch that off, if you want: Turn off search history personalization

Conclusion

In this blog-post, we saw examples of recommendation systems. The key take away is that there is more than one approach to building a recommendation system. The approaches can be based on 1. Past Behavior 2. Past Behavior of “friends” 3. Recommendation based on the Item that is being searched And you can definitely, Mix and Match!

And I hope this post helped you understand an application of data mining that’s all around us! And question: Where else do you see recommendation systems in action? Leave a comment!

Things I shared on Social Media Networks during Oct 19 – Nov 11

Standard

The Goal of this series is to recap the conversations that I’m having on social networks and I do not want my Blog readers to miss that. So Here is the recap of last three weeks:

1. I was at SQL PASS 2012!

SQL PASS 2012 Paras Doshi

2. A nice Dashboard!

Metro fied Business Intelligene Dashboard windows 8

3. Learn to build an Enterprise Information management system using SSIS, DQS and MDS:

http://parasdoshi.com/2012/11/07/resource-learn-to-build-a-enterprise-information-management-system-using-data-quality-services-master-data-services-and-sql-server-integration-services/

 Enterprise Information management system using SSIS, DQS and MDS

4. Fake Data!

5. I reached 2000 points on MSDN!Paras Doshi reached 2000 points on MSDN!

6. A nice video by Jeremy Howard on Predictive Analytics:

7. A nice data visualization via the Data Mining add-in excel

nice data visualization via the Data Mining add-in excel

8. Get started on Hadoop on windows 7/server!

Download here: http://parasdoshi.com/2012/10/27/getting-started-with-hdinsight-a-k-a-microsofts-big-data-hadoop-platform-on-local-windows-machine/

Demo Here: http://parasdoshi.com/2012/11/02/end-to-end-demo-hadoop-hdinsight-hive-excel-power-view-azure-data-market/

Hadoop on windows 7/server!

9. I was at Give Camp 2012! if you do not know about “Give Camp”, then you should check it out!

Here’s last year’s (2011) post: http://parasdoshi.com/2011/10/24/i-gave-back-at-dallas-givecamp-and-why-i-think-every-software-professional-should-consider-doing-so-too/

Give Camp 2012

Let’s connect and converse on any of these people networks!

paras doshi blog on facebookparas doshi twitter paras doshi google plus paras doshi linkedin

Things I shared on Social Media Networks during Oct 11 – Oct 18

Standard

The Goal of this series is to recap the conversations that I’m having on social networks and I do not want my Blog readers to miss that. So Here is this week’s post:

1)

WOW! Global Internet Traffic (in Peta Bytes Per Month):
27483 PB/Month!
via http://www.evolutionoftheweb.com/

 growth of the internet big data

(Thanks Hardki pandya for sharing. Here’s his tweet: https://twitter.com/HPsay/status/258207371162181632 )

Also for the a related link:

https://www.facebook.com/suvendu.shekhar.giri said “This is really awesome and very informational ”

evolution of the web and surge in Data

2)

Patent wars: http://flowingdata.com/2012/10/08/patent-war/

patent wars google apple motorola htc samsung

3)

Quote: One reason Wikipedia is still a popular source for information is because most people don’t know they can edit it themselves.

Via https://www.facebook.com/thedolt and Gaurang Patel (https://www.facebook.com/Gaurang.patel7 ) commented that: “Yes, Absolutely true,most people don;t know they can edit post them selves,and i think they don’t have other alternative to get quick information.”

4)

https://www.facebook.com/janakiramm quote:

“Without Stackoverflow, 50% of the developers cannot continue to be developers and without Google, 80% of the developers cannot even claim to be developers!”

5)

Status update: “I just worked on a SQL Server 2012 Master Data Services Tutorial where I learned how to 1) create a new Model 2) Use MDS Excel Add – In to load entities to the model 3) use Master data manager web app 4) add a new member via MDM web app 5) And create a subscription view to retrieve entity members. if you want to do learn that: search for “SQL Server 2012 Developer Training kit“. Awesome resource!”

MDS SQL Server 2012 master data services tutorial

6)

Shared a Resource: Introducing SQL Server 2012 Master Data Services Improvements http://channel9.msdn.com/posts/SQL11UPD05-REC-03

7)

SolidQ mentioned me a post about the Sep 2011 SolidQ Journal edition: https://plus.google.com/u/0/105279914944150120482/posts/cKh6hurWndj

8)

SolidQ India (https://twitter.com/SolidQIndia) Re-Tweeted my post about “Blog: Why am I not seeing “Month Names” in correct logical order?” http://parasdoshi.com/2012/07/22/powerpivot-model-why-am-i-not-seeing-month-names-in-correct-logical-order/

9) Milena Petrovic said “Lovely Graph” for this graphic: https://plus.google.com/u/0/105010538932095629627/posts/2uvE5B2HSk5

From the webinar "How to Walk The Path from BI to Data Science: An interview with Michael Driscoll, data scientist and CEO of Metamarkets" - A global surge in Data // A nice slide about Big Data

Image Courtesy: From the webinar “How to Walk The Path from BI to Data Science: An interview with Michael Driscoll, data scientist and CEO of Metamarkets” – A global surge in Data

That’s about it. You can read last week’s post here: Things I shared on Social Media Networks during Oct 3 – Oct 10

Let’s connect and converse on any of these people networks!

paras doshi blog on facebookparas doshi twitter paras doshi google plus paras doshi linkedin

One Insight from Analyzing my Past Year’s Blog Traffic Data:

Standard

I analyzed my Past Year’s Web Traffic Data: Here’s the Insight: Traffic on Weekdays > Traffic in Weekends. In most cases, Traffic on Weekends drops by 40%!

In other Words: Not as many people read BLOGS on weekend as much as they do on Weekdays.

What Can you do with this insight?

> First, Verify that it’s true for your domain/field-of-expertise. I blog about Technology and so this insight is applicable to blogs like mine.

> If you plan to blog say Twice in a week. Schedule those Posts on Weekdays (instead of weekends)

What Data did I analyzed:

– WordPress.com site stats

– Google Webmaster Tools search queries report. This gives you the number of searches that include your “blog post” as one of the result. Here’s what it looks like:

web analytics analyzing google webmaster's search data

Seems people are not searching for Technical content during Weekends. And less folks searching for “content” on Search Engines means Less Traffic (& less EyeBalls).

Conclusion:

For my Blog, The Traffic Drops by 40% on Weekends. And that just means that lesser number of people are “listening” on weekends. Since I do not have a “schedule” for this blog yet – I would try to post things on weekdays instead of weekends. That’s what I would do – What would you do with this insight? Do you see the same Traffic pattern for your Blog?

How to add External Links to Facebook Page Tabs?

Standard

I figured a way to add External Links to my Facebook Page – In this blog-post I’ll share how I did that. It’s not a native feature and I have resorted to a third-party app to get this functionality. Here’s the Blog-Post:

This is how my Facebook page Tabs looks after I completed the process.

paras doshi faecbook page google plus twitter

You can see it is linked to my Blog, Twitter and Google+ from my Facebook page tab.

Before we start. you should have the proper admin permissions to install app to a Facebook page as we are going to use a third-party app for this blog-post. Here are the steps involved:

1)  https://www.facebook.com/iframehost > click on Go to App that’s right besides like button > click on “install page tab”

2) select the Facebook Page and click on add page tab

3) Authorize Tab application (review the permission requests before agreeing)

4) Can you see the Tab Settings Page?

add external link facebook pages tab settings

*NGO XYZ is just a page that I created to show this. Ignore the phrase when you see that.

5) I want to redirect it to say twitter.com/paras_doshi.

So following are the settings that I chose:

Tab Name: Twitter

Page Source: redirect

And I entered the URL that It will get redirected to:

twitter paras doshi social media

6) I am going to leave other choice as default for now. You could explore it for advanced options like “this tab will only be visible to your fans and invisible to your non-fans” among other things.

click on Save Settings.

7) Can you see the new tab?

new tab twitter external link via facebook page

8) Test it!

9) Come back on your page. Now – Don’t you want to change the Image? Yes? Go to Edit settings for this Tab:

how to edit settings of facebook page tab

10) click on “change” for custom tab image. And Now, upload the custom image.

When done – come back to your page. Can you see it? yes? Great!

external link to twitter on facebook page

11) To add more Tabs. click on this tab as “page administrator”

when prompted about “Do you want to be redirected?”. click on cancel

click on settings from the admin tab options.

and another tab:

add multiple external links on facebook page tabs

Limit seems to 10 tabs.

And once you click on “HTML Page Tab #n”, then you’ll have to repeat the steps about configuring the settings for this tab.

That’s about it!

Conclusion:

If you want your Facebook page to point to your (or your organizations) presence on other social media networks – then now you know how you can add external links and point your Facebook page fans to Google+, Twitter, Website, Blog among other things.

 

Are there any other cool facebook page techniques that you now of, I’ll be glad to learn about it – Please do post it in the comment section (along with your facebook page url! )

 

Related Articles:
1. Social Media Analytics. Facebook Page Smackdown: Status updates vs Images?
2. Five things I learned about “Art of report designing” from Facebook insights.