Author Archives for

Mining Associations with Apriori using R – Part 1

Prologue: I have been working and practicing various skills and algorithms as a progress to show on my road-map to become as a matured data scientist. As a part of this expedition I have decided to document all those stuffs I am going through. So whatever you read under this column will be either a summary […]

Calculating Confidence Interval for Classification accuracy

Prologue: I have been working and practicing various skills and algorithms as a progress to show on my road-map to become as a matured data scientist. As a part of this expedition I have decided to document all those stuffs I am going through. So whatever you read under this column will be either be […]

Classification using decision tree

Prologue: I have been working and practicing various skills and algorithms as a progress to show on my road-map to become as a matured data scientist. As a part of this expedition I have decided to document all those stuffs I am going through. So whatever you read under this column will be either be summary […]

Prediction using Simple liner regression in R – part 2

Prologue: I have been working and practicing various skills and algorithms as a progress to show on my road-map to become as a matured data scientist. As a part of this expedition I have decided to document all those stuffs I am going through. So whatever you read under this column will be either a summary […]

Prediction using Simple liner regression in R – part 1

Prologue: I have been working and practicing various skills and algorithms as a progress to show on my road-map to become as a matured data scientist. As a part of this expedition I have decided to document all those stuffs I am going through. So whatever you read under this column will be either a summary of […]

Automate your Torrent with Dropbox

Today I write this post to share my torrenting experience about how I made my torrent-downloads automated with dropbox. To do this hack you will require. Transmission client. Dropbox account. Linux / Mac machine. Transmission is an open source bit-torrent tool. Good part of Transmission is it has Graphical Interface (GUI), Web Interface, Command line […]

Entity Extraction – URL

  For this entity extraction task my goal is to write a simple regex rule  to identify the most common URLs from the text documents. Example: http://shakthydoss.com , https://support.company.com , http://172.16.7.41/home/ , http://172.6.7.41/home?name=shakthydoss&year=2013    As said earlier I took time to understand the structure, that URL is composed of. Every URL consists of the following units: The schema name (commonly called […]

Entity Extraction – Email id

  Goal is to write a perfect and easiest way to identify email ids from the text documents. I am going to use regular expression and define rule for strings (email id) i am looking for. Example: shakthydoss@gmail.com, student-244722@wilp.bits.pilani.edu.com, gns4f-3895494981@sale.craigslist.org.    Before blindly start writing some junk regx rule I took time to understand the format that email ids […]

Changing time zone using ncurses interface

You’re in different counties or different continent or just want to change the time zone (time setting) of your Linux machine. This post on Linux tip will shows the easiest way among the several other ways of configuring time zone. Time zone setting for Linux is determined by one file which can be found at […]

Error: couldn’t connect to server 127.0.0.1:27017 src/mongo/shell/mongo.js: exception: connect failed

Error: couldn’t connect to server 127.0.0.1:27017 src/mongo/shell/mongo.js: exception: connect failed  As a learner in mongodb I don’t get understand why this problem often occur in Linux machine. But Luckily somehow I figured out a way to resolve this issue with the help of few mongodb experts outs there on internet. Below is the way I followed to put an end to the starting trouble […]