We have these emails hanging up all over our office, sent in from Khan Academy users with incredible, personal stories to tell. Every time I read a new one I’m emotionally affected, which means my robot emotion chip is faulty.
So when some curious soul (like a reporter) wanders in and asks me, “How will you know if Khan Academy is really successful?” I always answer their (totally valid) question with an explanation of our data, analytics, and fancy metrics — but what I’m really thinking is, “You haven’t read these letters.”
Let’s change that. These students’ and parents’ and teachers’ stories are now available for anyone to be inspired by. It is impossible to read them…go ahead, I challenge you…and not come away with the conclusion that a free educational resource like Khan Academy simply must exist.
Call me a softie. It’s not like I don’t believe in data as the final arbiter of any learning tool’s effectiveness. I do. But if our key data metric happened to be “# of page-long, authentic stories sent in from users who have turned their lives around in the face of drug addiction, unleashed their 2nd-grade son on the advanced math he’s fully qualified to handle, or earned acceptance into a university despite being stuck in a country that does not value education,” I don’t think I’d second guess seeing that number skyrocket.
Hopefully these stories inspire others as much as everybody on our team. We spent time designing the page to celebrate the authors, their letters, and the fact that these are real lives, not product testimonials.
Does the page accomplish this? Feedback welcome, good and bad.
I already can’t wait to drop some major challenges in the laps of our two incoming Fall interns to see what they can build.
- Khan Academy Internship, Summer ‘11
Check!
David Hu and Julian Pulgarin stepped up to the plate this Fall during their coopsinternships from University of Waterloo. We call ‘em internships because we’re from Amurrrrica, you crazy Canadians.
When hanging with friends and family recently, I found myself shocked by how willing my brain is to completely forget stories that were surely once described as unforgettable. I don’t have any desire to forget what we’re doing at Khan Academy, and that’s kinda sorta why being as open as possible is one of our core dev principles.
The Summer ‘11 story is already a nice piece of shared history that helps me answer every intern candidate who asks, “What kind of project do interns work on at Khan Academy?”. Here’s my version of the Fall story.

If tales about battling a bison for the right to cross a road can get up and walk out of my head,
then…well…better keep blogging.
It doesn’t get much more open than David’s post about how we use machine learning to assess student mastery. If I tried to summarize it for you, you’d see my managerial hair start spiking up and up and up toward the ceiling, Pinocchio-style. I won’t insult the work by talking about the statistics. Instead, I’ll just say that we now have a much better understanding of how competent each Khan student is in each math subject. Thanks, David.
That’s not all he did. From a dashboard to emphasize the importance of our exercises to a much smoother way of asking students to review work they’ve done in the past, he covers it all in this Vi Hart-inspired internship post-mortem.

Keeping an eye on our students’ activity via David’s dashboard

Forgetful Ben from the future is grateful for this video. And he’s also super-jacked and smart.
If anybody reading this has ever beaten Julian Pulgarin in chess, please rub his nose in it. Julian wiped the floor with me so many times that he would beg me to play him while he was blindfolded. My ego can’t handle that sort of hit, so I usually just deleted some data from production and pretended to be disappointed at each newfound emergency’s particularly poor timing.
When he wasn’t humoring me, Julian made major contributions across the board.
He started by building a number of new exercises for students to learn from, including an experimental crack at a new way of teaching fraction intuition. While working on this, he realized that it was painful to test our open source contributor’s GitHub pull requests, so he disappeared for a few days and came back with Sandcastle. Sandcastle automatically tags every pull request with a link that lets our developers test out the requests’ new exercise content in one click.

It’s a shining example of the reason we hire smart people and set them free to get things done. The first time Julian told me about his Sandcastle idea, I didn’t even really understand the direction. Now it’s indispensable.
Julian also gave our first KA Friday Tech Talk about how to do gradual feature rollouts for various segments of a large userbase. He ended up bringing this full-circle at the end of his internship by building Gandalf, our tool for doing the following:

Gandalf lets us selectively roll out features to all sorts of different subsections of our userbase.

“YOU SHA — ok, you guys, Hey!, you guys over there, you can pass — BUT OTHER THAN THEM YOU SHALL NOT PASS.”
Julian even “accidentally” left his chess set in Mountain View for us to mail back to him, which I’m pretty sure was his way of dropping the mic and walking off stage. “You clean up.”
In conclusion, University of Waterloo is legit. I skipped over plenty of work done by both David and Julian, and it still makes for an impressive Fall Winter there are seasons in Mountain View? As argued in the previous internship’s summary, any team that’s not dedicating tons of resources to both recruiting and mentoring interns is plain old missing out. We’re loving every minute of working with our interns.
Much like the phoenix or a tyrannosaurus flying a fighter jet, Summer ‘11 intern Joel Burget has risen from out of nowhere and dropped entertaining stories about his summer’s work. If you read them, are impressed, and want to hire Joel…you can forget it. He’s now full-time Joel.
If you wanna tag in next, the Summer ‘12 internship class still has openings.
This is my new favorite jQuery trick. I just learned it this year and have mentioned it in enough code reviews to decide it’s worth sharing.
When manipulating the DOM with jQuery, you often see code that looks something like:
$("#container").show();
$("#container .error").hide();
$("#container .zoo").css("background-color", "white");
$("#container .zoo .monkeys").empty();
$("#container .zoo .title").text("The zoo is empty");
$("#container .zoo input").val("");
$("#container .zoo").animate({height: 250});
…or, if somebody gets concerned about performance, they might try to reduce DOM lookups:
var container = $("#container");
container.show();
container.find(".error").hide();
container.find(".zoo").css("background-color", "white");
...
…and so on. Odds are, when you’re manipulating a single element like container, you’ll probably be doing something to nearby elements in the next few lines of code.

Readers of this blog, meet Emma. Emma, meet the three readers of this blog.
Enter .end(), newlines, indentation, and chaining. When you have one jQuery chain going and modify it with, say, .find(), you’re actually pushing the new chained set of elements onto a stack. .end() pops the current jQuery chain off the stack, which lets you do stuff like this:
$("#container")
.show()
.find(".error")
.hide()
.end()
.find(".zoo")
.css("background-color", "white")
.find(".monkeys")
.empty()
.end()
.find(".title")
.text("The zoo is empty")
.end()
.find("input")
.val("")
.end()
.animate({height: 250});
It’s easy to read, because the indentation is significant and matches up with DOM nesting levels. It gets rid of unnecessary DOM lookups. But most importantly for me, it feels natural to indent in and out as I write the code, using small additional selectors to step deeper into the DOM and .end() to find my way back out.
I’ve been destroying my old, ugly var’s left and right with this trick. I hope it helps somebody else.
Those who have worked with me will know that I’m an expert on this topic because my code gets laughed at all the time.
I’ve seen Good laughing and Bad laughing. Good is what I imagine happens when Robert De Niro sits next to Al Pacino as they’re watching Al’s cameo in some trainwreck of an Adam Sandler movie, and Bob turns to Al smiling and says, “This is awful.” It’s when you stare at some code and think, “Good grief. I can just imagine whatever took priority over making this code more reasonable.” There’s a knowing wink and a friendly jab between the laughed-at ghost-coder and the laugher:
Hah! I almost feel bad that you wound up solving it this way. I’ve been there before. You must be exhausted. Why don’t you sit down, I’ll take it from here.
Good laughter contains respect for the fact that this code exists at all and is being worked on by more than one programmer, which is more than you can say for what I’d bet is a whole boatload of perfectly refactored, unused files littering hard drives around the world right now.
That’s the type of laugher that’s erupting at Khan Academy these days, and it’s no coinkydink that now is right about when a team of top-notch coders are getting their first gazes into some of my previous creationsabysses. I can say with certainty that in these cases the knowing winks and playful jabs are well-deserved.

The rare (and best) third type of laughter.
A quick story before we get to the Bad laughter. Did you know that when Sal was a one-man show, the entire Khan Academy application was one big main.py file? At least the server-side stuff. All the request handlers, URL mappings, datastore models, data migrations, and even some HTML generation in one big 1000+ line file.
How funny is that! And it stayed that way for months! I mean, please. Who the heck wants to work on a codebase that’s one big file? If you haven’t detected the dripping sarcasm yet, recalibrate your sarcasm detector and start this paragraph again.
It’s easy to see how judging Sal by that one big file while he was busy making 2400+ free educational videos would be like judging a geek for wearing a t-shirt while she was just trying to look presentable enough to get to her computer to start writing code. Guess what? Sal’s code is still around, and it’s responsible for helping teach literally millions of students. Yet, as time goes on, it gets more and more likely that one day somebody will laugh at an out-of-place line with the type of judgment that can only come from being out-of-touch with what really mattered back then…and the fact that that line helped change many students’ lives.

You’ve gotta bend this brilliant tweet a bit to apply it to non-profit education,
but you’ll notice “good code” and “bad code” aren’t on the list.
Bad laughter doesn’t need much explanation, it just lacks respect. It happens too much in our industry, and I’m not sure why. I’m proud to say that it’s not a problem at Khan at all, but we haven’t always been 100% immune. We weren’t always 100% immune at Fog Creek. I’ve been guilty of this laughter myself, and I’d bet money it exists elsewhere. It’s most common among coders who feel like they need to prove themselves, and it can be combatted by a team that emphasizes shipping and the healthy laughter that comes from reminiscing about the last crappy hack you were responsible for when you decided to Just Ship It.
Here’s (just one of) ours:
PLAYLIST_STRUCTURE = [
{
"name": "Math",
"items":
[
{
"name": "Arithmetic",
"playlist": "Arithmetic"
},
{
"name": "Developmental Math",
"items": [
...
That’s our current Khan playlist structure, defined in code. When Sal adds or renames a playlist, we have to change the code. Laugh it up.
This will actually go away soon in favor of something much nicer, and I’m not arguing for crappy code. I’m putting this example here because it got us this far, without many problems, and who knows where we’d be if Sal had spent the first iteration of khanacademy.org trying to decide on the perfect playlist data structure.
…and a more rambling attempt to get a similar point across to a group of UIUC students by playing this powerful Thank You, Khan Academy video before revealing main.py.
We had some heated debates a while ago about what would happen if we opened up all of Khan Academy’s content for logged out users. Sal’s videos have always been open in this way, of course, but all the interactive exercises and statistical tracking and badges and stuff required an account.
It felt like the right move when we reconfirmed our belief that educational content should be as open and available as possible. We were also persuaded by Fred Wilson’s belief that giving logged out users more power is an effective way to empower your community. But we worried that registrations would drop, because by handing our 250+ exercises to logged out users, we’d be drastically shrinking the carrot on the other side of the “please login!” boundary.
We decided to go for it this summer. Figured I’d share some results.

A dog
4+ months after the change, we know a little more. Jace looked at the data and found:
So more visitors are trying exercises, but they’re not converting into more registered users…yet. We could have fun making up all sorts of explanations. Maybe we need to show off the badges and points unregistered users are accumulating a little bit more, maybe we’re not asking visitors to login forcefully enough, maybe doing math exercises makes people tired, maybe we don’t have enough pictures of golden retrievers on the login page.

Percentage of all visitors who use our exercises
Far more important than any random explanation is the fact that we’re now getting X% more data about how users learn (or don’t learn) thanks to these new exercise visitors. That data is powering some of the best work coming out of Khan Academy so far, so feeding more logged out users into our exercises is, as Jace puts it, a big win on data collection.
Plus, we’re confident that we could iterate and A/B test our way to higher registration rates for our new exercise users. It’s not our priority (for this week, at least). We’re busy cooking up important changes to the core learning experience.
*They don’t have access to everything. The ability to coach and communicate with other users is still restricted to users who have logged in.



After releasing GAE/Bingo, we received a number of worried correspondences from various very worried correspondents. It seems that GAE/Bingo, along with practically every other A/B testing framework out there, violates some purist principles of how to do significance testing.
The crux of the argument, reworded so simply that I’m pretty sure all statisticians (I admittedly know nothing about stats) would string me up:
If you repeatedly check the results of an experiment, sometimes you’ll see statistically significant results that aren’t actually significant.
So if you’re constantly checking your A/B dashboard and making decisions based on what it tells you, you’re often screwing up.
It’s a mathematically sound argument, as explained to me by my much smarter teammates. And it must be absolutely devastating for all the programmers who went out and bought the Razer Mamba Elite Wireless Gaming Mouse just to increase their click speed so they could mash the refresh button on their A/B dashboards as fast as possible.
Here’s the thing. I know of absolutely nobody who runs an A/B test like a crazed puppy who keeps sprinting loops around your legs hoping that…..ohboyohboyohboyohboy…..after the next 360° you’ll have lowered the puppy treat to the floor.
That doesn’t mean the argument isn’t valid. If you do check your dashboard every 5 seconds like a crazed puppy and immediately end experiments at the first sign of stat sig, then you probably should read the article and…..ummmmm…..find better uses for your time.
Luckily for us, one of my much smarter teammates with much more experience analyzing numbers’n’stuff landed an early modification to GAE/Bingo that should pacify all worried correspondents:

A historical graph of one of our A/B tests and each alternative’s performance. On our dashboard it’s interactive, weeeeeeeee.
By showing this graph everywhere our dashboard shows A/B results and waiting for the results to stabilize, we can be confident that we’re not making a snap judgment in the zone of idiotic decisions.
Ok, good. We’re safe. But what about everybody else? Did Fog Creek and 37signals and everybody including Google immediately start hemorrhaging money due to their reliance on faulty A/B tests which this truth came to light???
My guess is no, because A) they aren’t making snap judgments at the first sign of stat sig and B) with significant traffic, many of our A/B test experiments don’t even have a zone of idiotic decisions. Lots of ‘em look something like this:


…and it’s pretty clear in which cases a difference has been made.
Building GAE/Bingo required reaching into the bag of performance tricks a couple times. We needed long-term persistence of the data behind many A/B experiments, with stats accumulating at 500 reqs/sec, without slowing down pageloads.
Some of the wabbits we pulled out of the hat are pretty cool. Some are probably really stupid but get the job done. I’ll throw a couple your way and let you choose which is which.
GAE/Bingo makes it easy to run A/B experiments in one line of code: ab_test("monkeys"). This means during any given request, an individual user might interact with a number of different experiments depending on various code paths. Without knowing ahead of time which ab_test("monkeys") or ab_test("gorillas") or ab_test("chimpanzees")’s are going to run, we need to minimize the number of roundtrips spent talking to memcache or the datastore.
At first it makes sense to put a collection of all Experiment models in a single memcache slot. This is kinda helpful because at the beginning of the request you can:
experiments = memcache.get("experiments")
…and then each time you need to work with an experiment you can:
experiment = experiments[experiment_name]
You’ve only got one memcache call regardless of how many experiments you’ll be interacting with on each request.
There’s a big problem. Getting a bunch of objects out of memcache also involves deserializing all of those objects, not just transferring them over the wire. And at least until the 2.7 release ships, deserializing objects innnn ppuuurreeee Pyyyyttthhhooonnnnn iiiiiiiiiisssssssssssssss rrrreaaaaaaaaaaaaalllllllllllllyyyyyyyyyyyyy ssssssssssssssssssslllllllllllllllllllllloooooooooooooooooooowwwwwwwwwwwwwwwwwwwwww. If you’re running, like, 50 live A/B experiments and your user needs to interact with, like, three of them when requesting /profile, you’ve got, like, 47 experiment-deserializations worth of performance waste.
Here’s where we pulled out the same trick we first used when building a fast real-time badging system for Khan Academy. Since you’ll almost never be interacting with all of the experiments in an individual request, you really really Really don’t want to spend time deserializing them. So what happens if you serialize the objects once (in this case using protocol buffers) before putting them in the memcache collection?
Things are a little bit slower when creating the collection:
memcache.set(
"experiments",
{
"monkey": db.model_to_protobuf(experiment_monkey).Encode(),
"gorilla": db.model_to_protobuf(experiment_gorilla).Encode(),
...
}
)
Getting the collection from memcache is MUCH faster:
experiments = memcache.get("experiments")
…because instead of deserializing 50 experiment models, you’re unpacking 50 pre-serialized protocol buffers. Fast. Many much fast.
Now, when you need an actual experiment model, you just:
experiment = db.model_from_protobuf(entity_pb.EntityProto(experiments[experiment_name]))
…and only pay the deserialization penalty when you actually need to use the object.

This screenshot from my recent Khan Academy Friday tech talk should make everything totally clear. Especially without any context, I’m pretty sure seeing a picture of a gnome riding a platypus while carrying a monkey explains everything.
Bottom line: we make one single memcache call coupled with the minimum amount of experiment deserialization necessary, regardless of how many A/B experiments are running or which experiments are used by each request. There are lots of arguments over the fastest way to (de)serialize objects in App Engine — the fastest is to avoid the issue as much as possible.
Where we use this:
Split testing experiments and our real-time badge framework.
I’ll get in trouble for this or maybe even ruin the fun for everyone. This involves starting an asynchronous datastore put via db.put_async and then walking away with your hands in your pockets acting like nothing happened.
There are very clear instructions by the talented App Engine team that you should find some time to wait for the get_result() of any call to db.put_async(monkey). It’s also very clear that if you don’t wait for put_async to finish, App Engine is going to wait for you. In other words, you can’t magically send off a bunch of put_async’s and then send your response to the user without waiting for the put to complete.
db.put_async(experiment_monkey)
return render_template("mwuuahaha_im_not_doing_anything_else.html")
You can, however, send off a put_async and then do everything else your request could possibly think of doing (including rendering templates and such) without waiting for the response. App Engine will make sure the response finishes, but if you just kick off the put_async and then walk away and handle the rest of your request, you can maximize concurrency of your request’s work w/ the asynchronous put.
There are *lots* of other ways to get a very similar effect. All of them are probably more kosher. I won’t list them here. This just happens to be a neat little trick that you can trigger with one line of code without worrying about any other boilerplate.
Where we use this:
Find the spots yourself and make fun of me. I have a strong feeling this’ll be replaced in the future.
This one doesn’t belong in a “you’ll probably never need” post. It’s extremely common and handy: throw data in memcache (fast) and then run a background task or cron job that persists the data from memcache to the datastore (slow).
We actually get a little trickier because we need to persist lots of data that’s coming in quite often: each and every user’s participation and conversions in each and every A/B test. These events could be triggered multiple times per request for each user. It’s not quite clear how we’d put this data in memcache and what scheme would be running in the background to send it all to the datastore.
We opted for a bucketing system. Every time a user participates or converts in an A/B test, we randomly choose one of 50 memcache buckets and throw their user id in the bucket. When any of those buckets begins to overflow, we fire off a deferred task queue task to poke through the overflowing bucket, pull each user’s data out of memcache, and whisk it into the datastore.
bucket = random.randint(0, 50)
key = "_gae_bingo_identity_bucket:%s" % bucket
list_identities = memcache.get(key) or []
list_identities.append(ident)
if len(list_identities) > 50:
deferred.defer(persist_gae_bingo_identity_records, list_identities)
Where we use this:
GAE/Bingo and, in simpler fashions, pretty much all over the place.
If any of these hacks help someone else out, please let me know ASAP so I can win the bet against myself.
Regardless, expect more of these posts in the future. Tricks like these have been critical to keeping Khan Academy fast while adding new tools.
Continuing my trend of straight-up copying the work of the smartest people I know, I recently decided to tackle Khan Academy’s A/B testing problem (we didn’t have any A/B testing) by bringing Patrick McKenzie’s A/Bingo into App Engine land.
So here you go: GAE/Bingo is released and should get anyone on App Engine up and A/B testing in minutes. It’s currently in production on Khan Academy and performing well with hundreds of requests per second.
A/Bingo is an A/B testing framework for Ruby on Rails. It’s specifically designed to make the creation of split test experiments as quick and painless as possible.
GAE/Bingo is a re-imagining of this framework’s core design principles inside of Google App Engine. GAE/Bingo was specifically built for use at Khan Academy, which means:
We’ve got a million and one ideas to try out at Khan Academy. What tweak to our game mechanics will best motivate students to challenge themselves? What message makes it most likely for a student to sit back and watch a video when they really need to take time, slow down, and re-learn a core concept?
An A/B testing framework gives us the tools necessary to start answering these questions with experiments and hard(er) data. With ~1.5MM practice exercises answered per school day by Khan Academy students, we have a treasure trove of student activity from which to learn.
We also wanted to spread the love. Patrick helped out the Rails community by open sourcing A/Bingo, and we wanted to do the same for App Engine. I also couldn’t find any Python split testing framework that satisfied our needs and stayed true to the design principles of A/Bingo.
Plus, why not take advantage of the fact that App Engine’s vertical stack empowers framework creators to go pretty far when it comes to creating a drop-in, It Just Works experience? We hope GAE/Bingo accomplishes this and helps out some others in the community.
Start an A/B test in one line:
# Returns "chimpanzee" to half your users and "zorilla" to the other half
animal = ab_test("cute_logo_animal", ["chimpanzee", "zorilla"])…and when something good happens, score a conversion in one line:
bingo("cute_logo_animal")These two lines will automatically take care of experiment creation, user tracking, consistent A/B results for each individual user, and statistical analysis. You can do a lot more, of course, when it comes to specifying alternatives and tracking conversions — check out the docs. There are some pretty simple (optional) hooks that make it very easy to get consistent A/B results even when your users transition from anonymous to logged-in.

Trivial example: an A/B test proving that messaging a student with “You’re ready to move on!” is statistically more likely to encourage a student to move into to new content than “Nice work!”
Once at least one user causes the above lines of code to execute, you’ll get statistical analysis and be able to control your experiment from the dashboard.
Please do. Let me know how it goes via Twitter or email. Patch up all the inevitable bugs and fill in all the major holes left by our desire to ship v1. We’ll be continuing to improve the framework, and all help is welcome.
Enormous thanks goes out to Patrick McKenzie for his framework’s inspiration and the encouragement to follow his lead. I’ll be blogging more in the future about how we’re using GAE/Bingo and how we keep track of hundreds of requests per second w/ persistent storage and minimal impact on page load times.
Want to be handed a major portion of Khan Academy ownership, ridiculously high expectations, and a bunch of mentorship from our full-time devs? Sign up now. We believe anybody can help the world get a great education, and we accept interns year-round.
I can’t remember a time in my history of small company software development that hasn’t felt like sitting in a rickety donkey kong cart with jet afterburners attached and blazing, and everybody inside is just trying to keep the staples (why did they use staples?) to hold long enough for us to make it around the next bend in the tracks.
That being said, at least in my limited donkey kong cart experience, summer internship seasons always stand out in my mind as new high watermarks for shipping speed and dev intensity.

Just like Sal’s videos, our practice exercises are at the very heart of Khan Academy.
This summer’s class of Khan Academy interns has been no different. Our interns come in with promises of being handed ownership and control over major, user-facing features, and in return we demand excellence; it’s pretty similar, actually, to Khan Academy’s educational belief of encouraging experimentation but expecting mastery. They were shipping features on day one.
Any dev team out there not acknowledging the fact that high school and college students are capable of showing up on your doorstep and almost immediately redefining major portions of your product for the better is either failing to recruit well or is plain old missing out. Big time.
If that’s you, I hope to change your mind with this post.
Each of the following improvements to Khan Academy was contributed either largely or entirely by our interns this summer. Four of ‘em are in college, one just graduated high school, and one hasn’t even started applying to colleges yet.
This is a major body of work.
We learned about all types of weaknesses in our old exercises after last year’s pilots, and we’ve tackled them head on by improving our hints, removing multiple choice answers, focusing on the user’s exercise experience, and building new ways of asking old questions.

Almost all multiple choice questions are gone.
We’ve focused on helping our developers and our community create new exercises quickly. We’ve written better documentation, shipped simpler dev tools, and built solid bug reporting workflows to maintain a healthy stream of new, quality exercises. Our interns are responsible for not only porting all existing exercises to our new tools but also developing brand new frameworks to help exercises draw graphs, randomize questions, generate procedural hints, and more.

The Summer 11 interns’ recently launched new exercises have already served up over 5,000,000 math problems to Khan Academy students.
We know that at the end of the day, the only thing that matters is whether or not Khan Academy students are really learning, and a large quantity of quality interactive exercise content is core to that mission. KA wouldn’t be what it is today without the large quantity of quality videos Sal has created. As a development team and community, we should consider ourselves challenged to match his videos with quality exercises. The team’s efforts this summer have given us the tools necessary to take a crack at this considerable challenge.
In fact, exercises are so important to us that we’re now hiring full-time exercise developers to come push these tools to their limits and redefine what it means to learn online. If you want to join some very passionate devs, either apply now or join our open source exercise community.
Followers of this blog (also: leprechauns, unicorns) may remember a post in which I struggled to decide if Khan Academy should follow Stack Overflow’s registration model by allowing non-logged-in users to participate in all of our content and automatically transferring their work to a permanent account after they log in.
The more we thought about it and read posts about unregistered users like Fred Wilson’s, the more we realized that making access to our educational content as easy as possible was the right thing to do.

We track users’ progress and encourage them to log in, but we never get in anyone’s way when they’re trying to learn.
Two interns completely owned the design and implementation of this feature. If you go to Khan Academy now, you can start earning points and badges for watching videos and working on exercises without ever logging in. We encourage users to login at various milestones in their progress, but we never stop them from continuing to use the site or force them to close a popup.
If you log in, you keep all your progress. This has significantly reduced our bounce rate by getting rid of painful login walls, and we’re continuing to watch other statistics to see the effects of this change.
I can’t count of the number of times I’ve heard users ask us to display individual video progress next to each one of the 2,500+ video links on our homepage. I’m not gonna get into the various technical challenges here (maybe a different blog post), but this is nontrivial for a brain like mine.

Luckily, we hire interns that are way smarter than me and are able to solve such problems. These days, whenever you watch a piece of a Khan Academy video, skip around it, pause it, play it, or whatever, we keep track of extremely precise video progress statistics and display useful progress indicators next to every video link.
One of the earliest features launched during Summer ‘11 was the ability to share videos, exercises, and badges on Facebook and Twitter.

Facebook sharing has slowly gone up (even during the natural academic lull of summer), and we hope this trend will continue.
…and those are just the bigger changes. Our interns continue to claw for inch after inch after inch of improvements like nicer internal statistics, faster deploy scripts, performance tweaks, and better user account management.
Summer’s not over yet. Omar Rizwan, Jeff Ruberg, Joel Burget, Igor Terzic, Parker Kuivila, and Ben Alpert continue to set the current bar for Khan Academy internship classes. As an organization, we aim to beat this mark in the future. But those six are pushing hard to make it a tough task, and khanacademy.org is improving for the better, quickly.
Don’t doubt the inexperienced. Get your team’s recruiting, mentorship, and code reviews right (easier said than done), and a summer internship can be one of the best things that’s ever happened to your product. I already can’t wait to drop some major challenges in the laps of our two incoming Fall interns to see what they can build.
Want to be handed a major portion of Khan Academy ownership, ridiculously high expectations, and a bunch of mentorship from our full-time devs? Sign up now. We believe anybody can help the world get a great education, and we accept interns year-round.

Big shoes to fill.