Settings

Theme

Ask HN: Is it possible to be a data science “script kiddy”?

7 points by dizzydiz 4 years ago · 8 comments · 1 min read


I have some research I'm looking to do on time-series in python in the coming weeks. In the longer term I do want to understand the math/stat underpinnings (I'm using Brilliant to catch up on maths), but, for now, could I make decent predictions/tests using python libraries? Perhaps by reading a layman's explanation of various outputs first (math is the limiting factor in this timeline)?

disgruntledphd2 4 years ago

Probably, but it's almost certainly not a good idea.

This is a pretty good (and free) textbook: https://otexts.com/fpp2/

To elaborate on my point above, you can definitely get some results quickly without understanding how it all works, but even a small bit of knowledge about whatever method you're using can help dramatically when you're trying to debug things.

I don't think the book above has that much maths, and it's definitely aimed at newbies to forecasting. The examples are in R though, which may not be particularly useful for you.

midjji 4 years ago

Tons of companies have and are hireing people for little more than beeing able o download install and run the pytorch examples, and who have possibly managed to retrain an existing network architecture for a similar task on a similar dataset. They are absolutely the script kiddie equivalent of a data scientist.

A simple trick question to see if you are one: Given a dataset with significant sample imbalance, how should the optimization be adjusted to account for this?

jstx1 4 years ago

Maybe. What's your question more concretely?

If you want to just figure out how to call some functions and get an output from them - yes, of course you can. Do you care about getting useful results? Do you want to do this as a full time job or is it just a one-off time series problem?

WastingMyTime89 4 years ago

Can you produce really insightful and correct analysis of time series without understanding what you are doing? No.

There are no shortcuts with statistics. It will all seem to work out fine until you involuntary shoot yourself in the foot and you most likely will because doing forecasting properly is tricky. If you are lucky you will notice something doesn’t look right but you might not.

sarusso 4 years ago

Are you looking to just do some forecasting? Univariate or multivariate time series?

pcunite 4 years ago

What kinds of source data and from where?

throwawaynay 4 years ago

So there's https://fast.ai MOOCs+library which don't require a big theoretical background

I heard about a program that looked great to do data science from just data a while ago, but I can't find it :/ It was on Show HN I think

I found that tho, don't know if it's gonna be helpful: https://towardsdatascience.com/top-8-no-code-machine-learnin... https://www.obviously.ai/

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection