Microsoft announced a new query language called U-SQL to work with big data. The language was announced as a part of Microsoft’s new Azure Data Lake service. Azure Data Lake service is a new analytics service built on YARN that allows data scientists to analyze data. It works with Microsoft’s Azure HDInsight, a fully managed Hadoop, Spark, Storm and HBase service. U-SQL takes advantage of SQL query syntaxes.
Visual Studio blog team wrote a blog on the reason U-SQL was developed and what are the advantages of U-SQL.
“Taking the issues of both SQL-based and procedural languages into account, we designed U-SQL from the ground-up as an evolution of the declarative SQL language with native extensibility through user code written in C#. This unifies both paradigms, unifies structured, unstructured, and remote data processing, unifies the declarative and custom imperative coding experience, and unifies the experience around extending your language capabilities.
U-SQL is built on the learnings from Microsoft’s internal experience with SCOPE and existing languages such as T-SQL, ANSI SQL, and Hive. For example, we base our SQL and programming language integration and the execution and optimization framework for U-SQL on SCOPE, which currently runs hundred thousands of jobs each day internally. We also align the metadata system (databases, tables, etc.), the SQL syntax, and language semantics with T-SQL and ANSI SQL, the query languages most of our SQL Server customers are familiar with. And we use C# data types and the C# expression language so you can seamlessly write C# predicates and expressions inside SELECT statements and use C# to add your custom logic. Finally, we looked to Hive and other Big Data languages to identify patterns and data processing requirements and integrate them into our framework.”
U-SQL was developed to process any type of data. Not only U-SQL scales efficiently to any size of data but also allows data developers to use custom code easily to express your complex, often proprietary business algorithms.