Video is becoming increasingly accessed, captured, and published on the Web, from different platforms and devices. Users can easily georeference the information they capture and access, allowing to enrich their contextualization. But video search has been limited to keywords, or a set of parameters, providing limited support for temporal and spatial dimensions. We propose novel ways to search and access georeferenced videos, where these dimensions are of central importance, especially by video trajectories shape and speed, and by time, using a multimodal interactive mobile interface, involving gestures and movement, with the potential for more natural interactions, increased engagement, sense of presence and immersion. The evaluation based on high-fidelity prototypes had positive results. Users found most features useful, satisfactory, sometimes fun, and easy to use. Different options and modalities were found interesting and adequate for different use scenarios that could be identified and suggested, and some concerns and challenges were identified to be taken into account in future developments, towards more flexible and effective interactive content access, through more natural interaction with mobile devices on their own or as second screens to a larger screen on TV or public displays.