Learning AI if You Suck at Math — Part Two — Practical Projects
If you read the first article in this series, you’re already on your way to upping your math game. Maybe some of those funny little symbols…
If you read the first article in this series, you’re already on your way to upping your math game. Maybe some of those funny little symbols are starting to make sense.
Also be sure to check out parts 3, 4, 5, 6 and 7.
But here’s another dirty little secret nobody tells you about AI:
You don’t actually need that much math to get started.
If you’re a developer or sys-admin you probably already use a lot of libraries and frameworks that you know little about. You don’t have to understand the inner workings of web-scraping to use curl. The same is true with AI. There are number of frameworks and projects that make it easy to get going fast without needing a data science PhD.
Don’t get me wrong. The math helps you feel confident about what’s going on behind the scenes. It allows you to read research papers and advanced books like Ian Goodfellow’s Deep Learning without your eyes glazing over. So keep studying the books I gave you in the last article. But if you want to start using AI, you can do that today.
Let’s get started with some practical projects.
My approach to learning is very similar to the excellent approach outlined in The First Twenty Hours. We all know the 10,000 hour rule. To truly master a skill you need to put in a lot of time. But we’re not there yet. We’re just getting started. Right now we’re trying to get from “this sucks” to “this is so much fun!”
The basics of the approach are simple:
Pick a project.
Get past self-crippling beliefs.
Try lots of stuff and fail fast.
Practice.
Easy, right? So let’s go!
Pick a Project
First off, you need a project that will really motivate you to get out of your comfort zone.
Kaggle is the place for machine learning. Right now they’re hosting a contest with a $1 million purse to improve classification of lung cancer lesions. Anyone can enter, including you.
Now I know what you’re thinking. There’s no chance that I win this. This is a contest for heavy hitters. Glad you brought that up, because that brings us to step two:
Get Over Self-Crippling Beliefs
The most important step in learning anything new is to shut down that little voice of self-doubt in your head as fast as possible. The First Twenty Hours advocates removing distractions, practicing on the clock and a number of other techniques. Throw in meditation, affirmations or heavy drinking. Whatever works. Just do whatever it takes to get that voice to go away so you can focus. If you need a self-help book to get over the hump, try You Are a Badass, a fun, funny, sarcastic masterpiece!
Here’s the deal: You do suck right now. But that’s OK! You won’t for long.
Feeling confused and frustrated is always the first stage of learning. So rather than beating yourself up, see it as a sign that you’re on the right track. You’re learning something awesome!
You probably won’t win the competition, but so what? Focus on getting a competent entry submitted before the deadline. Not everyone can win a marathon, but finishing one is a hell of an accomplishment in and of itself, right?
And you know what? You just might win. Seriously.
As an amateur you’re not burdened by years of theory and ideas that weigh down the professionals. Just remember the story about the student who solved two unsolvable math problems after finding them on the blackboard and mistaking them for homework assignments. The truth is data science is more art than science. It’s a field that attracts polymaths with all kinds of eclectic backgrounds. So get in there and try stuff.
Who knows what will happen?
Maybe you’ll see something the experts missed, make a real impact on cancer detection and take home some serious money to boot!
Try Lots of Stuff and Fail Fast
If you’re a dev-ops guy, you know this mantra. It applies to learning too. What I do is grab a bunch of samples of books and start skimming them quickly to see which ones make the most sense to me. Each person has a different style, so some books will work for one person and not another. Pick the one that works best for you.
There are a few books on machine learning out there, like Real World Machine Learning. Unfortunately, because the field is so new, most of the books are just starting to come out this year. You can pre-order Deep Learning: A Practitioners Approach or Hands-on Machine Learning with Scikit-Learn and Tensorflow.
But you don’t have to wait. Let me introduce you to my friend Safari Books Online. For forty bucks a month you can read as many books as you want and you get access to books in progress before they’re released, including the two listed above.
I’m going to save you some time though. Right now it’s totally unnecessary to learn how to code deep learning systems from scratch in Python, R, or Java. You need tools to get you rolling with Deep Learning fast so you can start working on your contest entry.
You want Keras with either TensorFlow or Theano.
You don’t even need to set it up yourself. Grab this sweet all-in-one deep learning Docker image.
Frankly, it doesn’t matter whether you use TensorFlow or Theano. They’re basically engines for running machine learning. At this point in your education, both are equal, so pick one.
Keras is a library of machine learning frameworks created by a top notch Google AI researcher. I had the good fortune of meeting the creator of Keras this weekend, Francois Chollet. He described Keras as the key to “democratizing AI.” He said that “deep learning is mature but it’s not yet widely disseminated…You don’t have to be an AI researcher to use Keras.” Instead you can just start playing around with all kinds of state of the art algorithms right away.
If you’ve already got a Mac or Linux rig with a good Nvidia graphics card you’re good to go. If you don’t, considering picking up an Alienware. I recommend the mid-range Aurora series. You don’t need a kick-ass processor. You need an SSD, a secondary spinning disk to dump data to, 16–64GB of memory and the best Nvidia card(s) you can afford. Focus all your cash on the cards, as they really accelerate deep learning. You’ll need to reformat it with Linux and get the latest binary drivers. Unfortunately, the open source ones won’t cut it for the latest chipsets. They’ll likely boot to a black screen. Fix that like this.
There are also some tutorials out there for building your own rig if you’re a do-it-yourselfer. Also, I just added my own tutorial in part three!
Lastly, you can use the AWS, Google or Azure cloud, but GPU compute in the cloud can get expensive quick. Better to buy than lease until you know what you’re doing.
Practice
Now you’re ready to get started. Here’s a super-simple example for getting started with Keras.
You are going to need an approach to the competition. Once again, I’m going to save you some time.
The most effective method of tagging and studying images at the moment is known as a convolutional neural net (CNN). Google, Facebook, Pinterest, and Amazon all use them for image processing and tagging. You might as well start with the best-practice, right?
In fact, if you head over the competition itself, get the data set, and check out the tutorial, you’ll see that it walks you through slicing and dicing the images and using a CNN with Keras and TensorFlow backend. Voila!
Frankly, you could do a lot worse than just implementing the tutorial and messing around with the parameters for a few weeks to see what you get.
After that get crazy. Throw different parameters and algorithms at it. Experiment and have fun. Maybe you’ll stumble across something the experts missed!
If you’re ready to try something more advanced after that, there are some great posts on the Kaggle Data Science Bowl 2017 board. Turns out data scientists are not above sharing some of their secret sauce. Check out this one, which helps you start exploring the data, which is a series of anonymized CT scans.
This one is more advanced and currently the most popular post on the board for good reason. It helps you do “pre-processing,” which is basically scrubbing and massaging the data to make it easier for neural nets to deal with more fluidly. It actually turns the 2D images into 3D images! Super cool!
Frankly, if you type out all this code yourself and get it running, you’re already kicking serious ass. This approach to programming is called “the hard way,” i.e., just type in the code without thinking about it until you understand it. There is even a series of books on Python and other languages that take this approach to learning, and it may work for you.
One warning: Someone posted a perfect score in the competition already. He did it in a clever way, by studying the leader boards and effectively doubling his training set size. It’s perfectly legal, but it won’t really help your goal, which is to learn about how to run neural nets against a training set for a good cause. I’d skip this approach for now, and focus on running Keras against the CT scans.
That’s it! With any luck you’ll help redefine cancer research and take home some cash too. Not a bad day’s work.
But even if you don’t win, you’ll be well on your way to learning how to use AI in the real world.
Whatever happens, remember to have fun!
Again, be sure to check out the other articles in this series if you missed them:
Learning AI if You Suck at Math — Part 1 — This article guides you through the essential books to read if you were never a math fan but you’re learning it as an adult.
Learning AI if You Suck at Math — Part 2 — Practical Projects — This article guides you through getting started with your first projects.
Learning AI if You Suck at Math — Part 3 — Building an AI Dream Machine — This article guides you through getting a powerful deep learning machine setup and installed with all the latest and greatest frameworks.
Learning AI if You Suck at Math — Part 4 — Tensors Illustrated (with Cats!) — This one answers the ancient mystery: What the hell is a tensor?
Learning AI if You Suck at Math — Part 5 — Deep Learning and Convolutional Neural Nets in Plain English — Here we create our first Python program and explore the inner workings of neural networks!
Learning AI if You Suck at Math — Part 6 — Math Notation Made Easy — Still struggling to understand those funny little symbols? Let’s change that now!
Learning AI if You Suck at Math — Part 7 — The Magic of Natural Language Processing — Understand how Google and Siri understand what you’re mumbling.
############################################
If you enjoyed this tutorial, I’d love it if you could clap it up to recommend it to others. After that please feel free email the article off to a friend! Thanks much.
############################################
If you love my work please do me the honor of visiting my Patreon page because that’s how we change the future together. Help me disconnect from the Matrix and I’ll repay your generosity a hundred fold by focusing all my time and energy on writing, research and delivering amazing content for you and world.
###########################################
A bit about me: I’m an author, engineer and serial entrepreneur. During the last two decades, I’ve covered a broad range of tech from Linux to virtualization and containers.
You can check out my latest novel, an epic Chinese sci-fi civil war saga where China throws off the chains of communism and becomes the world’s first direct democracy, running a highly advanced, artificially intelligent decentralized app platform with no leaders.
You can get a FREE copy of my first novel, The Scorpion Game, when you join my Readers Group. Readers have called it “the first serious competition to Neuromancer” and “Detective noir meets Johnny Mnemonic.”
Lastly, you can join my private Facebook group, the Nanopunk Posthuman Assassins, where we discuss all things tech, sci-fi, fantasy and more.
############################################
I occasionally make coin from the links in my articles but I only recommend things that I OWN, USE and LOVE. Check my full policy here.
############################################
Thanks for reading!
Hacker Noon is how hackers start their afternoons. We’re a part of the @AMI family. We are now accepting submissions and happy to discuss advertising & sponsorship opportunities.
If you enjoyed this story, we recommend reading our latest tech stories and trending tech stories. Until next time, don’t take the realities of the world for granted!