The design and validation of an automatically-scored constructed-response item type for measuring graphical representation skill