February 17, 2012 at 9:14 am
I basically want this thread to be a good way for the newbs like me to get up to speed with some of the knowledge the veterans have.
For example did you guys find books or web posts that when you read them helped you understand data mining better than before? If so post them up. I am trying to get myself up to speed on Data mining because i think the business is going to want it in the future. Any help in that process from you guys would be awesome and i thank you ahead of time.
Chris
February 17, 2012 at 9:25 am
I think the key thing to know about any data mining is the differences between causation, correlation, and coincidence. It doesn't matter how good your technical skills are on the subject, if you can't spot those.
After that, learn how to judge data quality. There are ten or twelve major issues you'll find in data quality, regardless of the tools you use or the techniques you use them with, that will cause data mining to fail or produce false results if you don't know them thoroughly. You have to be able to spot the classical patterns like dropped out time, contrary facts, et al, without hesitation. Converse for the positive data quality metrics. You need to know those just as well.
After that, it's just all about the tools. Those will vary, and in a shop that's just moving into the field you'll probably be able to define what you want instead of having to learn legacy tools. That puts you in the driver's seat on that point.
But no tool available can make up for mistaking coincidence for cause or missing that a datum is from the wrong time period to be applicable, for example.
- Gus "GSquared", RSVP, OODA, MAP, NMVP, FAQ, SAT, SQL, DNA, RNA, UOI, IOU, AM, PM, AD, BC, BCE, USA, UN, CF, ROFL, LOL, ETC
Property of The Thread
"Nobody knows the age of the human race, but everyone agrees it's old enough to know better." - Anon
February 17, 2012 at 9:34 am
Awesome GS thank you for the post very good info.
Viewing 3 posts - 1 through 2 (of 2 total)
You must be logged in to reply to this topic. Login to reply