• Home
  • About
  • Archives
  • Colophon
  • Resume
  • Tags
  • Tumblr

zanshin.net

because not enough websites start with the letter “Z”

Feed on
Posts
Comments
Tumblr
« Sibelius Simplify Notation Plugin Solves Latency Issues
Backing up iTunes Purchases »

Learning Mode

May 25th, 2008 by mark

In the early 1990’s, when I was still using OS/2 as my primary operating system, I attended a day-long demonstration about the platform.  Included in the slate of presentations was one about voice recognition.  The presenter, an energetic woman, described how she was able to read and respond to hundreds of emails and Compuserve forum postings using dictation, or voice-to-text, software.  She gave us a quick demo of the software’s ability by dictating, “You were right to write to me right now, Mr. Wright.”  The software was able to correctly spell each variation of the phonetically identical right/write/Wright.

Of course, the software wasn’t able to do that for me when I loaded it on my computer at home.  First, I had to train the software to my speech patterns, inflections, and rhythms.  The training consisted of reading several passages of text, provided with the software, making corrections to the text on the screen as I went.  After perhaps an hour’s work, the voice recognition was nearly 100% accurate.  Through the training exercises the software was able to “learn” about my speaking habits, and accurately capture text I dictated.

In the years following that experience I have run across several other examples of “software that learns.”  The one that comes to mind most readily are the Bayesian junk mail filters common in email clients today.  By identifying mail you consider spam (or not spam) the filter learns how to sort your mail.  Most of these tools are very accurate, with only a few false positives or negatives.

Our experience with Sibelius this weekend has led me to think that music notation software could benefit from some kind of training mode, or learning mode.  Sibelius allows one to input music notation in three ways: computer keyboard, note-by-note (Steptime) using your MIDI input device, or dynamically (Flexitime) using your MIDI input device.  They explain in their literature that playing a piece of music accurately to a metronome is difficult.  That as humans we tend to vary the time of our playing ever so slightly.  Flexitime attempts to allow for this by adjusting the tempo to match your playing speed.  If you slow do, it slows down.  Unfortunately there are several points of failure between the musician and the notation algorithm, not the least of which is latency introduced by the MIDI interface, and perhaps more latency introduced by the sound system in the computer. (That experienced musicians can, and do, play accurately to a metronome is a discussion for another posting.)

If Sibelius had a “learning mode” that provided several short pieces of music for the musician to play using Flexitime, the notation algorithm could examine the captured results and compare them to the stored standard.  From this comparison the algorithm could “learn” about the latency characteristics of the computer, MIDI input device, and MIDI interface.  While it might not completely eliminate the kind of notation problems we have seen, I feel it could go a long ways towards reducing them.

Tags: learning, sibelius, software

Posted in nerdliness

One Response to “Learning Mode”

  1. on 27 May 2008 at 11:00 am1Michael Stauffer

    Hi,

    I came across your post here about Flexitime. Thought I’d plug my software, InTime, that follows realtime tempo changes more flexibly, and can record your performance and give you a time-corrected MIDI file for notation in Sibelius or other software (www.circular-logic.com). There’s no learning mode, but we’re also working on a version that will analyze recorded performances instead of working realtime, and that will implicitly work with some memory of your performance as it’s analyzed.

    Cheers,
    Michael



  • Welcome!

    Mark H. Nichols is an enterprise architect, martial artist, nerd, and all around good guy. Currently he works in Kansas City, and lives in the suburbs with his fiancée, three cats, a couple pianos, and nearly a dozen computers. You can read more about Mark, and this site, or explore the archives.
  • Pages

    • About
    • Archives
    • Colophon
    • Resume
    • Tags
    • Tumblr
  • Popular

    • Shotski's Ring (part I)
    • Shotski's Ring (part II)
    • 10 Mac Apps
    • What is Zanshin?
  • Categories

    • diversions
    • elsewhere
    • family
    • health
    • life
    • links
    • meme
    • nerdliness
    • photography
    • random
    • relationships
    • social issues
    • Uncategorized
  • Archives

    • Blogroll

      • Elfenbein Klaviermusik Notes
      • Shawn Blanc
      • Sibylle Kuder
  • Vote

    Obama 08
  • last.fm

    1. cd cover
    2. cd cover
    3. cd cover
    4. cd cover
    5. cd cover
    6. cd cover
  • Meta

    • Log in
    • Entries RSS
    • Comments RSS
    • WordPress.org

zanshin.net © 1996 - 2008 All Rights Reserved.

Policies | WordPress Themes | Web Hosting Blue Host