Dragon NaturallySpeaking 15

Screenshot anime Overlord, season 3, episode 8, last scene

Why is there a Dragon here? For speaking, naturally! Dragon NaturallySpeaking is the world’s premiere speech recognition software, now with Deep Learning Artificial Intelligence that adjusts to your accent and the common cold. Fire breathing not included.

Today I upgraded (in a manner of speaking) from Dragon NaturallySpeaking version 13 professional individual to Dragon NaturallySpeaking version 15 home. I virtually never used the more advanced features of the earlier version.

The most important part for me is accuracy of recognition, and I have to say that version 15 is almost indistinguishable from magic in that regard. And I mean right out of the box: There is no longer even an option to train the program by reading a text for it. Version 13 was pretty good after training and a few days of practice. Version 15 is that good right out of the box. (At least I believe it doesn’t have access to my previous training, as it required me to uninstall the previous version and reboot the computer before I was allowed to install the newest version.)

I have used and reviewed many different versions of Dragon NaturallySpeaking over the years, both before and after it was acquired by Nuance. There has definitely been progress! I believe the first version I reviewed was either six or seven, and I generously compared it to homesick Asian high school exchange student. I could probably have added seasick as well, as its performance was unimpressive, to say the least. If you had functioning hands, you were better off using those, even if you typed with one finger.

Those days are definitely gone! Dragon NaturallySpeaking 15 takes dictation like a highly trained secretary, only faster. Actually, Dragon has outpaced secretaries for at least a couple of versions now, but this required you to speak clearly and train the program first. And the results were less impressive for me, who has a strong Scandinavian accent. Actually, “accent” might be too weak a word. If you are familiar with the computer game “Skyrim”, the pronunciation by the Nord bandits in that game is pretty close to how I speak in real life. I am not sure how a highly trained secretary will handle that, but Dragon NaturallySpeaking 15 has well over 99% accuracy, right out of the box, with that kind of foreign accent.

***

There are still some challenges. In my experience, they are not too bad, but I see a lot of one star reviews on Amazon. Most notably, Dragon is squeamish about working with applications it doesn’t know. Supposedly this includes earlier versions of Microsoft Office. When I started writing in LibreOffice, Dragon NaturallySpeaking automatically popped up to the “Dictation box” where you can dictate and edit your text before transferring it to the target application. It’s an okay solution in my opinion, but it can be distracting, and you cannot interact directly with the target program using your voice for instance “click file save” the way you can in supported programs. Removing the checkmark for automatically opening Dictation box lets me dictate directly in LibreOffice, but it still struggles with commands, and you cannot edit the text with Dragon after you dictate it.

I have the same problem with my favorite browser, Vivaldi. Admittedly that is not very common browser, So I installed the Dragon Web extension For chrome.As you can see from the previous sentence, that didn’t work too well, and it doesn’t work too well in Google Chrome either. Luckily I have fingers, and so Dictation Box it is. But Google Chrome is by far the most popular browser for Windows, and not having native support for that makes the program seem rushed, at best. Especially when you consider that Dragon NaturallySpeaking is a very expensive program. It is not so bad by Norwegian standards, since both salaries and living expenses here are already very high. Even so, I only buy Dragon NaturallySpeaking when it is discounted, as it was in this case. In the USA, a single person could eat for a month for this much money, and in the actual developing world even more. So in that perspective, you would expect a more polished product than this.

But what it does well, is take dictation. And at that, it is the best in the world. No software and no human can match it for the combination of speed, accuracy, and fast learning.

Writing Grammarly

Screenshot anime Amanchu

If you struggle to express yourself and put your thoughts into words, Grammarly might be a prized companion. For me who have at times struggled to stop putting my thoughts into words, it is just a curiosity.

I love living in the future, and I particularly enjoy all the new tools and toys and combinations thereof. In the latter category is Grammarly, an app/service that promises to watch over all your online writing and then some. (There is also a Windows app that can be used to write or proofread texts that are not meant to be shared online.)

One potential problem comes to mind immediately: What if your writing falls into the wrong hands?  We are not just talking about your love letters getting the wrong audience or the manuscript for your new book suddenly appearing written by a competitor. Any app that reads your writing could, in theory, also harvest passwords, credit card numbers and such. It was, therefore, an easy decision for me to not be among the early adopters of this software. But years have gone by and there has only been one scandal, which turned out to be overblown, and it had nothing to do with passwords and such. So as of today, I have Grammarly on my writing machine.

Grammarly promises to discover both spelling and grammar errors. The built-in text editor in Vivaldi (and Chrome) also catches spelling mistakes, but not grammar mistakes. (In the previous sentence, Grammarly wants to change “catches” to “catch”, presumably because the browsers Vivaldi and Chrome are two. Unlike me and you, it cannot see past the “and” to realize that the subject of the sentence is the text editor. Artificial intelligence is still no match for natural stupidity, as the saying goes.) Luckily you can tell Grammarly to ignore such a find, much like in Microsoft Office. Actually, in my experience, Microsoft Office is even worse at parsing grammar. But if you do all your writing in Office, you may not feel motivated to convince two grammar checkers that they are wrong and you are right.

Back in the good old days when I lovingly crafted my journal by hand in Notepad or some other pure text editor, it was common for me to find spelling errors when I read through my entry one year later. (Back then I linked to the year-ago entry because I wrote virtually every day.) When I read through them two or even three years later, it was not uncommon for me to find more errors. This is a human tendency: We read what we meant to write, not what we typed.

At this point in my entry, Grammarly has found one spelling error (I misplaced an “i” in Artificial) and two grammar errors that were not. It also disagreed on my comma usage in three cases, which I gracefully conceded, albeit under doubt. So I am probably not in the target group for paying customers. If you want to try for yourself, you can go to grammarly.com or just wait for one of their innumerable ads with which they flood the Internet.

Dragon Professional Individual 15

Dragon from video game Skyrim

No need to shout, the Dragon understands my Nordic dialect right away!

Over the years, I have made a habit of reviewing the various versions of Dragon NaturallySpeaking. Lately, Nuance has stopped using the phrase NaturallySpeaking in most contexts, but it is still the same product, and it is now up to version 15.

As the software has become more expensive again, and as it is already good enough for my limited use, I have started skipping some versions. Dragon version 13 was already good enough that I did not really expect it to get any better. Impressively, Dragon version 15 is actually noticeably better right out of the box.

Dragon version 15 uses a new “deep learning” technology similar to what is used in the most successful artificial intelligence projects. Dragon has always (or at least for as long as I have used it) had the ability to improve based on feedback from the user, as well as adapt its vocabulary and writing style by reading through documents. While these options still exist, there is less focus on them now as Dragon quietly adjusts in the background during everyday use.

Dragon has also clearly had some opportunity to acquaint itself with human speech in general before shipping to the customer: The product is amazingly accurate right out of the box. Longtime readers (if any) may remember that I compared some of the early versions to homesick exchange students from other continents. That time is long gone. Dragon version 15 understands even my “Skyrim” pronunciation of English (I grew up in Norway in the 1960s, where even the English teachers has rarely if ever been to England, let alone America or Australia.)

There is one problem that has dogged this software from the start, and it still remains, even if just barely. When we speak, we don’t actually pronounce periods at the end of the sentence; rather, we slightly change the tone of our pronunciation toward the end, typically speaking less forcefully. Conversely, we don’t actually pronounce a capital character at the beginning of a sentence; instead, we pronounce the first sound slightly differently from the rest. Ideally, speech recognition software might be able to use this to take dictation without requiring us to specify punctuation. Dragon NaturallySpeaking used to have this functionality, but I gave up on it pretty quickly. What actually happens is that even when I dictate punctuation, there is a slight increase in mistakes at the very beginning and end of the sentences. This is especially true if I don’t pronounce some form of punctuation at the end of my string of words, for instance because I run out of breath during a long sentence. I have to say, however, that this problem has been almost eradicated in the latest version of Dragon.

To me, recognition accuracy is by far the most important part of any speech recognition engine. But Dragon 15 has also some other features in addition to the improved accuracy. It has better support for various modern software, and it allows voice activated macros. (I believe this feature was also in version 13, but I did not use it then and I don’t use it now. In any case, functions like “insert signature” should be part of your email software, rather than your speech recognition software.) Also, the big unnecessarily helpful sidebar with examples no longer starts up by default. It used to do, and is also used to permanently displace any windows that happened to be in its way.

As usual, I am including a paragraph where I don’t in any way correct this transcription. This is that paragraph. (It may not be obvious to the reader, but that should be “the transcription” in the first line above.) Dragon used to be available in a few languages besides English; I am pretty sure I saw touch at some point, and Japanese? I can’t find any trace of that now, but I will admit that I have not looked very carefully.

Not too bad, huh? That should of course not be “touch” in the previous paragraph, but rather Dutch, the language in the Netherlands. (It actually got it right this time without correction. Go figure.)

MS Windows troubles

Screenshot anime Kanojo ga Flag o Oraretara

This morning was absolutely crawling with chaos. It started as I turned on my home office computer, which had installed updates at 3AM and restarted itself, as it frequently does. It seems like a good idea, to install updates while you sleep. After all, you would not want to miss the latest security patches and improved functionality.

Unfortunately, the new functionality was that I could not log in. Whether I picked my usual account or the betatester account I use for testing games, there was just a brief pause and then Windows returned me to the login screen. No error message. I restarted the computer and tried again. I did various things and tried again and again. No change. I restarted in Safe Mode. Same problem. I restored Windows to last good configuration. Still the same.

I installed Ubuntu Linux, which is a pretty good alternative to Windows for most people, and free. After a little while I switched to Xubuntu (it is really just a different setup, the core is the same as Ubuntu, but Xubuntu is more similar to old Windows versions). Ubuntu is free, like most Linux versions. I use to install it on old laptops when they become too slow under Windows. This is less of a problem these days, but it was a big deal back in the days of Windows Vista.

Xubuntu is nice enough, but there were a couple problems. I had used this machine to provide Internet access to my cabled home network, which includes a Windows 10 machine for playing games, a NAS (home server) for backup and sharing files, and a small old notebook computer for uploading and downloading to and from the NAT without taking up resources on the main machines. But now I could not get Linux to share the Internet. It should be easy, really, there is a choice for it. “Shared with other computers” it says, but that actually only lasted for a minute or so, then I got a message “Disconnected from Ethernet”. (Ethernet is the cabled network, to put it simply.) I did various things and restarted numerous times to no avail.

Eventually I found an USB wireless receiver and connected this to the Windows 10 machine, then told it to share its Internet. This worked well enough, except the NAS (Network-Attached Storage) server did not show up. After changing the workgroup name by editing a configuration file, I got it to show up. But as soon as I tried to copy a file to it, it hung up and show up empty until I logged off an logged on again. This repeated itself for as long as I bothered trying.

I was kind of in a hurry to continue working on my National Novel Writing Month story. Luckily that was saved on a disk I could access from Xubuntu. I copied it to a USB drive, in case I wanted to continue writing on it on the other Windows computer (the gaming computer). I installed WINE, a program that lets you run Windows programs in Linux. I had already read a few years ago that you could run yWriter in Linux this way. (yWriter is the program I use for writing novels. It is written by a programmer and novelist and fits my working style exactly.) It did work when started with WINE, and it found my novel in progress, but the spell check did not work and it did not recognize the names and locations. I downloaded the dictionary and manually copied it to the place it should be. Now it worked except it did not recognize words when Capitalized, such as at the start of every sentence.

Somewhere around this time I decided to reinstall Windows on one of the disks. (I am keeping Xubuntu on the other.) This took the rest of the evening and will continue into the next day or two or more.

Needless to say, there was no progress on the novel this day. But then again, contrary to the slogan of National Novel Writing Month, the world does not really need my novel. Probably.

The Dragon Upgraded

Screenshot anime YowaPeda

I feel like I can go anywhere… With Dragon NaturallySpeaking Premium, you can dictate anywhere using a USB microphone, wireless microphone, smart phone or dictation device. (Even so, I don’t recommend dictating while biking!)

In the past, Dragon NaturallySpeaking has been available in several different versions, and I have always used the cheapest one, Dragon NaturallySpeaking Home. It usually cost around $100, with the occasional big sale where you might buy it to at half price. As an existing customer, I could also upgrade it to the next version at half price from the start. Last time, two years ago, I also did that; I even preordered it.

This time, there was no question of pre-ordering. Either they didn’t ask me, or I missed it somehow. My first hint that there was a new version available came from a mail that offered to let me upgrade to Dragon NaturallySpeaking 13 Premium for €99. A bit more, but then the Premium version has some unnecessary but nifty features. So instead of being my usual cheapskate, I went for the premium version this time. It was already available for download; there was a link in the mail to the website where I could buy it. I checked the requirements and looked for any traps, but that didn’t seem to be anything suspicious. So I bought it with credit card, and could immediately start downloading.

The installation was easy and trouble-free, although it took some time. I first downloaded a small installer program, which then downloaded the big installer program, which then unpacked to a separate folder, which then installed the program in the default location. It may sound a bit complicated, but it was mostly just pressing the “next” button, although I had to choose a directory for the temporary files. I saved them to the network drive in case I have to reinstall on this machine or another. I would also recommend using an external disk for the temporary folder if you have limited disk space, since at some point there will be three big files and folders taking up space simultaneously: The big installer, the folder with the unpacked files, and the actual installation in your Program Files folder. If you have a reasonably new computer, this would probably not be a problem.

Speaking of new and old computers, the two latest versions have each reduced the computing requirements, so that you can actually run version 13 faster on a weaker computer than version 11. Good work!

After installation, Dragon offered to upgrade existing user profiles. This took surprisingly long, even for the profile that was almost empty. Several times I wondered if it had crashed, but I didn’t need to use it immediately so I just checked in on it from time to time, and eventually completed. If you don’t have an earlier version of Dragon, each with an effort to create a new user profile instead. I believe that in this case, you will also be offered to train the program to recognize your voice and improve its accuracy. At least this happened in the earlier versions. It may be that it is so good right out of the box that they don’t bother with that now?

As far as I remember, version 12 looked very much like version 11. Version 13 has a whole new visual profile, so it is obvious at a glance that you are running the new version. The DragonBar, usually placed at the top of the screen, is now just a small button when not in use. If you move your mouse to it, it expands to become larger than it was in the previous version, and the microphone on/off button also becomes much larger. The “Learning Center” (formerly Dragon Sidebar) still takes up the margin of the screen, but it now has a black and white color scheme and also seems to have larger letters. As always, you can minimize or remove this Learning Center if you don’t want (context dependent) hints about what you can do next. Even the DragonBar itself can be minimized to the system tray, and you can access the most common functions by shortcut keys or by voice commands. But that has always been the case, I just wanted to mention it.

As I mentioned in the previous entry, the first thing I noticed when trying Dragon NaturallySpeaking 13 was the leap in accuracy. I realize that I have praised its leaps in accuracy since at least version 10, but this time the difference seemed to me bigger than the official count of 15% improvement. 15% improvement does not seem a lot when the accuracy is already claimed to be about 99%. To me, it seems more like it has increased from 99% to 99.5%, which would actually be a doubling of the accuracy in the sense that there would be half as many errors. But I admit that in my case this could be because of an improvement in the handling of USB headsets.

(It is unfortunate that I cannot maintain this level of accuracy for longer texts, because my voice becomes hoarse after a few minutes. But this is an affliction that I share with very few humans. One hypothesis is that it comes from my years of almost complete silence, where I only asked a few questions at work and did not speak at all on my free time. If I take breaks and drink a little water between the paragraphs, I can continue for longer.)

The premium version contains some features not found in the home version. For instance, you can now make the program read back your own voice, not just a synthetic text to speech rendition of the text. You can also use a smartphone as a microphone, or even use recording devices and have the program transcribe them later. There is supposedly also an option to create your own voice commands, basically macros, but I haven’t tested that yet.

In conclusion, Dragon NaturallySpeaking 13 is awesome. You can actually speak naturally to it, and with very little training it will put your words on the screen and let you control Windows. Upgrading from version 12 seems to make a big difference for me, but your mileage may vary. Upgrading from Home to Premium is probably not a priority unless you have a USB microphone or some other unorthodox input device, but it adds some fun new features.

(As usual when writing about dictation software, I have dictated this entry in its entirety, except for a few minor corrections.)

Dragon NaturallySpeaking 13

Squeeing girls from anime Gekkan Shoujo Nozaki-kun

This is how I think my readers should react when I write about Dragon NaturallySpeaking speech recognition from Nuance. Somehow that never seems to happen. Let me try again, it’s two years since last time.

I love living in the future. And one of the more futuristic things that I have is the speech recognition software for Windows, Dragon NaturallySpeaking. (Windows also has its own built-in speech recognition, but for those who can afford it, Dragon is definitely the one hardest to distinguish from magic.)

Today I got a mail from Nuance, offering to upgrade my Dragon NaturallySpeaking 12 Home to Dragon NaturallySpeaking 13 Premium for €99. I immediately grabbed the chance, just as I have done every time there was an upgrade for the last five years at least. Was it worth it? Well, to paraphrase a friend of mine, €99 is a lot of money if you don’t have it. This is obviously not a product for the working classes of the developing world, but for Norwegian office worker the amount is trivial, barely noticeable against the high salaries and the high prices up here. And for me at least the effect of the upgrade was dramatic.

According to their website, version 13 is 15% more accurate than version 12 right out of the box. Evidently this has either crossed some kind of threshold in my case, or there was some bug in the version 12 Home in relation to my Plantronics USB headset. The USB headset worked very poorly with the previous version on my laptop (although it had worked reasonably well on the desktop with version 11). So when I wanted to dictate, I had to take off my USB headset and put on an analog headset for the duration, and even then the accuracy was at most marginally better than in version 11. Today after the upgrade, I can use my USB headset again, and what’s more: The accuracy is more than 99%. It still makes mistakes, but less so than my fingers. (And I have been typing for almost 50 years now.)

Back when I wrote about an early version of Dragon NaturallySpeaking here in the Chaos Node, it had only entertainment value for me, although I realized it could be useful for people who could no longer use their arms at all. Some years later a newer version helped save me from disability when my job caused a serious case of repetitive strain injury. At that time it still made quite a few mistakes, but at least I could correct them with my voice. Since then it has improved even more, and I have given it pretty good reviews each time. But let me tell you something: For me, version 13 is a giant leap.

It still makes mistakes, but so few mistakes that I risk overlooking them in the middle of all the perfect text. We are talking about perhaps one error for each paragraph on the first day. The software gets used to the sound of your voice and your writing style and also learns from all the errors you correct, so it gets better the more you use it. So to pull off this level of accuracy with almost no training is impressive indeed.

For those of you who are still here instead of being busy buying it, my next entry will get into some more detail about the installation and differences from the previous version.

(As usual when writing about dictation software, I have used the program to dictate this entry, except for a couple of minor corrections.)

Self-spaced repetition software

Screenshot Sims 3

Games are usually the enemy of studying, but some elements of games can be used to improve learning.

Spaced repetition is an amazing technique for learning without understanding. Understanding is certainly superior and in a league of its own, but it is hard to command or even predict; and sometimes you need to memorize for a while before you can understand. One may compare this to gathering ingredients before you can cook. Before you can read sentences, you must know the words, and so on.

Spaced repetition takes advantage of a particular memory effect: The best time to repeat something is just before you forget it. Repeating many times in a row adds little after the first couple repetitions. Repeating at fixed intervals helps, but the best effect comes with increased intervals. You may for instance double the intervals, which is more effective than fixed intervals. But the best effect is when one actively recalls a fact just as it is about to be forgotten: It should take a little effort to remember it, but one should be able to do it.

Since our brain is not under constant surveillance, the only person who can know this time is ourselves. Conventional Spaced Repetition Software (SRS) starts with a standard model of human memory, then adjusts intervals down if people keep forgetting, or up if they remember everything. Good SRS lets the user mark the difficulty with which they remember. But there is an even better solution, at least in some ways, and the Duolingo online language site has found it.

When you start a study unit in Duolingo, the “learning meter” is empty. As you learn, it fills up until it is full. But as time passes afterwards, the learning meter begins to slide down toward empty again. You can see this for each topic, and for each “workbook” in a topic. (This is a session that typically takes 10-15 minutes to go through.) You can then go back and test yourself.

If you go back and run the test while you remember everything, you will certainly get the learning meter back to full again, but the experience will be rather boring. Duolingo is a very game-like learning system, where you have to translate back and forth, listen to sentences in the target language, describe pictures or pick from multiple choices. You never know what the next question will be. But if you know everything by heart, it is not very exciting. It is like winning chess against a small child.

On the other hand, if you wait too long and have forgotten the words or phrases, you will be thrown out after three mistakes and have to start that workbook over until you get it right. That is not too much fun either, even though it only costs you a few minutes.

The result is that the users themselves find out how long to wait to get the most rewarding “game” experience. If it was too easy, you learn to wait longer. If it was too hard, you make sure to return earlier next time. There is no need for the software to know whether you learn fast or slowly. All it needs to do is reward you when you get the balance right, and it does so with an exciting learning experience and a feeling of winning against a worthy opponent: Your own forgetfulness. You pick your battles, so with a little experience you pick the best time yourself.

I am not sure how easy this is to translate into other forms of learning, but I think it may be easy with anything that requires memorization of facts. Geography and history come to mind. Make small, focused units and a table where one can see which ones begin to slip. Adjust the speed at which they slide depending on past performance. It can probably be done better than it is in DuoLingo, but the principle works amazingly well. Humans are very good at learning things when having fun.

Memrise vs Anki: place yer bets

Instead of doubling the amount of time I spend studying, I am trying to double the precision. Although you can learn almost anything by repeating it 7×70 times, the best time is just as you are about to forget. Anything before or after is less effective. But how do you know when you are forgetting if you don’t remember it?

I have written quite a bit over the past month about Memrise, a free Web resource for memorizing facts, vocabularies etc. It combines two of the most powerful techniques for rote learning: Spaced repetition and mnemonics. Spaced repetition tries to make you recall the fact just before you forget it, as this causes maximum learning with minimum effort. Mnemonics try to associate random facts with something that is easier to remember. This is obviously most effective if you do it yourself, but that can be frustrating. Memrise uses associations volunteered by users, and you can add your own.

I rather like this approach, and the way you can study at your own pace. Unfortunately, most of the time I remember 66% at best rather than the 90% that is the goal of spaced repetition. This was also the problem with the two previous SRS programs I used, AnyMemo and Mnemosyne. (Spaced Repetition Software is SRS business!) So I am testing another free program, ANKI, which has a good reputation among self-study amateur linguists. I am not too optimistic though. Now that this is my fourth attempt, I may have to accept that it is I who am too old for the programs that fit most people. It is the same with physical exercise, after all, but there I can set my own pace. And that’s the thing.

What I really miss is a dial or lever I can set, so the software reminds after e.g. 90% of the time it thinks should be right. Clearly the programs all overestimate my memory for random words. Of course, it would probably have helped if it was not so random, if it was at least somewhat related to my ordinary life. But that’s not what I need it for. I would really like something that was adjustable to me, rather than the other way around. It is kind of discouraging to have forgotten a third or more of the words when it is time to review them. It is also bad for learning – the “memory traces” in the brain weaken more quickly after the ideal recall time, or so I’ve read. So ironically, I would probably even spend less time reviewing if I had that “confidence dial”.

Anki does not have that, but it does have levels in the answers. Instead of just checking for itself whether you got it right, it asks whether it was hard, good or easy. The ideal is good, which is when you remember it with a little effort. If you had to think long and hard, it goes easier on you with that word or fact next time, in the form of asking you earlier. If you say it was too easy, it waits longer. And if you don’t get it right at all, it shows it again very quickly. So that sounds like an improvement.

On the other hand, I liked the suggestions for memorizing words, and I liked the way Memrise used different forms of multiple-choice questions in the early phase of learning a new word, then giving more and more options and eventually requiring you to write the answer. It also requires writing when reviewing, which involves more of the brain and makes it harder to fool yourself (“well, I got it ALMOST right!”).

I have picked up Anki and installed it on my PC and my Galaxy Note 2. (Unlike Memrise which is a website but requires some advanced browser features and can’t be used on my mobile devices.) Anki is also easily synchronized between two (or even more) devices. There are a lot of premade vocabularies and other data sets, and it pleases me to see that a lot of them are for studying Japanese. I downloaded a fairly small one that is mostly tangential to what I have already learned, and am testing it now.

Unfortunately there are obvious errors in the dataset I am testing, although small ones. Occasionally a romaji (western character) is used in a word written with katakana. I saw one obvious misspelling beyond that already in Japanese, and another in the English text. The Japanese is written in a font that is like an uglier Japanese version of Comic Sans. I hope this is a feature of that particular set and not of Anki! It is quite hard to read after the very legible font on my Windows machine, not to mention the downright beautiful hiragana font on the PC running Ubuntu Linux.

Apart from that, it seems nice enough. With the mobile app I can study at the bus, during breaks at work, even while a game is loading. OK, not much since I have a fast machine. But still, very handy. And I like its approach: If I don’t recognize a word, Anki shows it again after a minute. Once I recognize it, it increases to 10 minutes, then a day. I inserted 1 hour between those, the system lets you add steps like that. Then it goes up to 4 days and so on, I am not sure how far it goes. The most important part is of course whether I actually learn the words. I will have to come back to that. But if it turns out to wait too long, like all the rest, I will try to choose “hard” instead of “good” even when I remember, and see if that fixes it.

I really hope I won’t have to write my own. There are already quite a number of these. There’s Supermemo, the original and possibly best, if you can live with complicated. And there’s at least one other that I forgot the name of. I do that a lot, forget names. Although I don’t always remember doing it.

Talk to your toaster

I also used to be excited about the future, but now that I live here, I take it for granted.

NaNoWriMo – national novel writing month – is approaching once again. (“The month formerly known as November”, as I like to call it.) The forums for 2012 are up and running, and in the technology section there is as usually a thread dedicated to speech recognition, or more specifically Dragon NaturallySpeaking. (I would not mind a more general thread, since Windows also comes with speech recognition built in. Hopefully we can have more threads later.)

One thing I wanted to say early on was that it is not enough to be able to use speech recognition in a technical sense. The next challenge is to be able to tell a story to the computer. This is a very different thing, especially for us who have been writers for many years and are used to thinking with our fingers. It also doesn’t help to have been a grown-up for many years, during which you have not been able to tell long, obviously made-up stories to people without them looking at you very strangely. I suppose there are some families in which this problem does not exist, but I am not sure whether it is a good thing or not… ^_^;

So I recommended that people start telling stories to their computers already now, all through October, so that they have gotten over that hurdle, that shyness or awkwardness of telling imaginary stories out loud to inanimate objects. In fact, I recommend practicing on the toaster as well, and with blatant nonsense. The purpose is not to deliver the Great American Novel to your amazed toaster, but to get yourself to accept the unreasonable fact that it is possible to tell stories to home equipment. Such are the times in which we live. I could not have made it up in a sci-fi novel. Magic fantasy, perhaps, just perhaps.

I ask you, gentle reader, to consider this: Not only do I occasionally talk to a machine without being insane (or more so than those who don’t). I also carry in my shirt pocket a telephone, my own library with dozens of books, a bookstore with millions more, thousands of newspapers from all over the world, millions of songs and an unknown number of movies, and enough cat pictures to last the craziest old cat lady for a lifetime.

You can probably add to this, but the point is: I do this almost every day without giving it a second thought. I don’t wake up each morning thinking: “Oh my God! I live in a miraculous, magical world filled with amazing wonders that I would not have believed were possible when I was a child – what should I do today to take advantage of this to the fullest?”

If I did, and if my conclusion was that I should start the day by talking to my kitchen equipment, that might not be the worst thing I have done in my life.

Chunks of memory

Your personality doesn’t matter. This is a skill you can learn.

Extreme feats of memory are possible when we remember large quantities of information as one unit, because we have spent so much time with it. We all do this.

I know I have written about this before, already back in the original Chaos Node, where I read about it in an issue of Scientific American. Recently I read about the same thing in the book Talent is overrated. For instance, chess grandmasters could glance at a chess board and later reconstruct it exactly, something no normal person could do. From another ancient board game, Go (or Igo), I know that high-level players not only remember every move of a match, but can often guess how a match has progressed even if they arrive late into it, possibly even at the end. For someone unfamiliar with the game, this seems like magic. And yet we all do the same thing.

Neurotypical humans store incredible amounts of information about the people around them. Not only can they recognize a friend at a glance after several years, they can also keep track of the relationships between everyone in a village: Who are second cousins with who, who are friends, who are enemies, who are in love with who and who were in love with their current enemies years ago. Nobody finds this remarkable in the least, but it is really amazing.

Likewise we are very good at categorizing things. Or dogegorizing, I guees: Even children can usually tell cats and dogs apart, although small children have trouble with really small dogs which may be labeled cats. Even though there are so many different dogs and breeds of dogs, people have this internal concept “dog” which kind sums up the essential elements of doghood and which they remember as a unit, even after many years.

In the same way, if you grew up with your mother, when you think “my mother” you actually remember thousands of things, from how she looked at various ages to details of her behavior and relationships. You don’t consciously think of all these details every time you see her name, but if someone were to say something untrue about her, you would recognize it immediately.

In other words, all of us have the ability to remember very complex things as 1 unit.

Our short term memory is very limited, usually we are able to remember around 7 units of information at the same time. The actual number may vary from 5 to 9 and can be increased with rigorous training. It is the number of digits you can remember while walking from one room to another without repeating them in your mind. But if those digits are familiar, the number suddenly increases dramatically. For instance, to me the 6 digits 271258 count as 1 unit of information: It is my date of birth in the format used in this part of the world, ddmmyy. So I would be able to remember 6 more digits while leaving the room. Yes, strange as this may seem, I have an average short-term memory. I have tested this.

***

The computer language Forth caught my attention toward the end of high school. It was little more than a rumor back then, some new-fangled invention from the States. Personal computers were something hobbyists built themselves, and pitifully weak. A corporate mainframe at that time was perhaps a match for a smartphone today. OK, perhaps a little more. Let’s say a smartphone next year. But only a few years later, I had my own personal computer, weak though it was, and was programming in Forth.

This particular computer language had a peculiar structure. The basic language was very simple, consisting of a stack for data and a set of basic “words” that were coded in machine language, either directly or with an assembler. These were very simple commands which would be defined differently from computer to computer because of the hardware, but which (ideally) had the same names and function on all computers. But this was not what fascinated me. Rather, you could define new words by combining the old ones. The new words could be used in the same way to combine into more words. By keeping the definitions short and simple, the risk of errors was greatly diminished, and it was easy to test the new words right away. Yet there was no obvious limit to what you could do. There was very little overhead in having many levels of definitions.

The reason this appealed to me is that I am a verbal person. I think in a very similar way to this computer language, building new concept from existing concept. As long as I keep it simple, I can trust the knowledge I build from basic, and I can test it.

***

When you spend a lot of time doing something, whether it is programming or chess or surgery, you acquire what is called “domain knowledge” within that area. And when this knowledge becomes a part of you, something as natural to you as cats and dogs and family and friends, you begin to be able to think of it in chunks. The chess player can remember every piece on the board because not only the pieces are familiar to him, but the possible configurations too. He has seen them many times: When this particular group of pieces appear on this part of the board, it means certain risks and opportunities that are very real to him. He has no need to memorize this particular picture: He has seen it before, repeatedly, and it has meaning to him.

When I learned to read, I had to learn the alphabet like people did for generations before me. I hear that this is no longer considered very important, people start looking at words as pictures right away. But words still consists of letters, and sentences consist of words, paragraphs of sentences and so on. When you remember a poem or a particularly moving passage from a book, you don’t try to recall each individual letter in turn. Like the programming language, the “primitives” – the basic components – soon become buried in higher-level structures. Reading and writing are themselves everyday examples of structured knowledge. And as with the programming language, there is no obvious upper limit. Scholars will hold entire books conceptually in their mind – not word by word probably, but still in a very real sense whole books – and compare them to arrive at a higher meaning from the way the books agree or disagree. If we were wiser and lived longer lives, who knows what we could achieve?

Humans, it seems to me, are not proportionate to the savanna or the shores from which the “naked ape” emerged, but rather proportionate to the infinite. As better men than I have noticed, the most incomprehensible thing about the universe may be that it is so comprehensible. At least now we know a little bit more about ourselves as well.