sparkSpark’s primary abstraction is a distributed collection of items called a Dataset. Datasets can be created from Hadoop InputFormats (such as HDFS files) or by transforming other Datasets.SPARK* is an outdoor community space, home to some of York’s most exciting start-ups working in food, retail, arts and social enterprise.