This might seem like a silly idea for lip syncing but it seems to work for me right now. I have a program like After Effects or Premiere open in the background. Really it could be any audio program.
Then I open Spine on top of it. I adjust my timeline windows to line up. Then I scale the timeline in both programs to be nearly the same size. So I can at least see the wave form while animating. Once you get going it's easy to toggle between apps to listen or start a new animation.
This has really made my work flow a lot faster. Sometimes I have to go back to the audio program to hear something to make sure I'm getting a motion right. It's nice having that visual there.
You can see at the bottom of the frame how my timelines are lined up.
Hope this helps.