Maybe I'm naive, but I personally find it much simpler just to see how you can construct an arbitrary waveform using the summation of a series of sinusoids - and then a Fourier transform is just the inverse operation...
I think the point of the explanation was to provide some intuition as to where the Exp[- 2 pi i k n / N] in the formula comes from. If it's easier to think of that as a sinusoid then your way of thinking about it is better. If it's easier for you to think of that as spinning your signal around like a phasor then the OP's explanation is useful.
I agree with you. I know the Fourier transfom well from a discrete signals class but that explanation didn't make any sense to me. What is the motivation to spin the signal at 3k?