Yandex Scale 2020: from data marketplace to hybrid clouds

Solution for creating hybrid data clouds

The Gazprom Neft company and the Yandex.Cloud platform presented

solution for creating hybrid data clouds.With its help, government organizations and private companies will be able to implement Yandex.Cloud technologies, even if they are subject to technical or regulatory restrictions on the use of public data storage and processing services.

  • How will it work?

The user of the IT solution receives fulla ready-to-use hardware and software complex created on the basis of Yandex.Cloud equipment and technologies. The complex is installed in the customer’s data center and becomes part of his own platform for working with data. Further services — prevention, configuration, updating and other routine maintenance— are carried out according to Yandex.Cloud standards.

  • How will the company's hardware work with the new product?

The new IT product integrates with localcorporate data storage and processing systems and public cloud services. The result is a hybrid cloud that expands the capabilities of the company's digital infrastructure, allows you to set higher security standards and take into account other specifics of a specific user.

During approbation, specialists from GazpromOil ”took part in the development of requirements for the hybrid solution and actually became a partner in the design of its architecture. Therefore, our clients will receive not only a completely ready-to-use hardware and software complex based on Yandex.Cloud technologies, but also a service for supporting and regularly updating the product, taking into account the development of our public platform.

Oleg Koverznev, Chief Operating Officer, Yandex.Cloud

  • What's next planned?

Further development of technology partnershipbetween Gazprom Neft and the Yandex.Cloud platform will allow testing a new digital solution in various industrial operation scenarios. Gazprom Neft is considering the possibility of using a hybrid cloud to develop its computing cluster and developments in the field of artificial intelligence, which are used to search for new oil reserves and remotely control technological operations for its production. Also, additional capacity can be used to improve the efficiency of balancing workloads on the production IT infrastructure.

Every year our factories and oil fields createabout 5 PB of new data. We not only store this information, but make decisions on a daily basis based on its analysis, use it in modeling technological processes and training neural networks. The peculiarity of work in remote regions of Siberia and the Arctic requires large distributed computing power, which would be regulated from one place. The creation of a hybrid cloud solves this problem and provides information security. The product will become a convenient service for industrial companies and other market representatives.

Andrey Belevtsev, director of digital transformation at Gazprom Neft

Four new services for storing and managing data

Category of services for storage and managementdata became one of the fastest growing areas of the Yandex.Cloud platform in 2020, outpacing the growth dynamics of the services of the traditional leader - virtual machine rental. With the help of cloud data management services, companies, regardless of size and field of activity, solve problems such as data storage and processing, analytics and visualization.

The services of manageddatabases (Managed Databases) - since the beginning of 2020, the consumption of these services on the Yandex.Cloud platform has tripled, and the number of databases created by users has exceeded 10,000.

  • What happened before?

In our platform Yandex.Cloud was already available by clicking on almost all of the most popular storage and data processing solutions on the market. We've made significant improvements to our data migration scenarios this year. To do this, we added two new services that simplify the transfer of data between any sources - Managed Service for Kafka, and a specialized solution for transferring data between databases - Data Transfer. In the segment of general-purpose databases, the cloud has become even more accessible for Enterprise users with the new Managed Service for SQL Server. Managed Service for Elastic Search, a popular solution that supports full-text search and ad hoc analytics scenarios, has appeared in the family of analytical solutions.

Alexey Bashkeev, head of Yandex.Cloud platform

  • What's coming up now?

From September 23 to Yandex users.Cloud now has access to the new Data Transfer service, which allows you to transfer data between DBMSs without stopping applications, regardless of where they are deployed. Data Transfer helps you quickly and safely migrate databases from other cloud platforms or local databases to Yandex.Cloud managed database services. You can also use Data Transfer to move data between different databases on the platform and set up backups.

Also, the Managed service was released.Kafka is a system for streaming data to analytical systems. The service of search and data analysis ElasticSearch and one of the most popular commercial database management systems in the world for work in the ecosystem of Microsoft SQL Server products have been added. In total, 9 managed database services are now available on the Yandex.Cloud platform, covering most scenarios for storing and processing data.

New marketplace section with data for business analytics

In the Yandex.Cloud now has a section called "Geolayers", which contains anonymous geoinformation data from service partners. They can post their content for free or for a fee. Yandex.Cloud clients will access the necessary data in the service interface, analyze it using Yandex DataLens and use it for business purposes. For example, users can assess potential demand for products or prospects for opening outlets, plan supply chain expansions or marketing campaigns.

Content from two partners is already available in the section -“Center for Spatial Research” on a commercial basis and Rosstat on a free basis, as well as two examples of geoanalytics from Yandex - “Audience: interests and social demographics” and “Organizations: supply and demand”. With DataLens, users can combine and analyze all data presets to make business decisions.

We strive to give partners not just technology,and tools for monetizing any intellectual product that can be made or improved using the cloud. Separately, I would like to note that all the content of the "Geolayers" section is exclusively aggregated and anonymized data, for which it is impossible to obtain details for a specific device or user.

Oleg Koverznev, Chief Operating Officer Yandex.Cloud

Who acted as a partner of Yandex.Cloud?

Analytics is created in collaboration withRosstat. The cooperation agreement involves joint work on the preparation of various open data packages. Vital statistics and other demographic information are already available.

Rosstat publishes largearrays of data available to any user. Thanks to cooperation with Yandex.Cloud, they will gain additional value for researchers, analysts and businesses. The platform's tools allow you to combine sources, make quick analysis, build visual visualizations and, as a result, make quick strategic decisions.

Pavel Smelov, Deputy Head of Rosstat

Center for Spatial Research providedgeographic information data on the population, households in new buildings, as well as business potential indices for various areas. In addition, a dashboard for monitoring network trade in the Russian Federation is available free of charge, which has been running since 2015 and contains an index of the region’s development from the point of view of federal retail players.

According to the general director of the Center for SpatialDenis Strukov's research, analytical tools, indices and expertise, combined with the capabilities of DataLens, is a timely response to the request of the B2B and B2C markets. This, he said, is a cloud version of location intelligence, which has recently been developing abroad, and now in Russia.

General access to the Yandex marketplace.Cloud opened in 2019. This is a platform where Cloud clients can directly receive access to business applications from developers and publishers. Today, 47 applications and services are available: from operating systems to information security tools for genetic analysis.

Yandex.Cloud opens public access to new service for development based on machine learning DataSphere

Yandex.Cloud provides general access to the service for machine learning developers Yandex DataSphere. The service helps companies and individual developers reduce the cost of creating and operating machine learning models, automatically manage the amount and type of computing resources, and reduce the time lost for creating and organizing a development environment. Yandex DataSphere will be openly available from October 1.

Why is this relevant?

Companies' global spending on artificialintelligence is predicted to double over the next four years from $ 50 billion in 2020 to $ 110 billion in 2024. At the end of 2019, Russian companies spent on AI amounted to $ 172 million with a forecast of 30% growth annually. Many Russian companies are already actively using machine learning solutions. For example, in medicine for creating solutions for image analysis, in retail for developing recommendation systems.

Machine learning techniques are becoming more and morepopular business tool around the world. But for many companies it is still not available due to the high entry threshold and the cost of the required computing resources. To solve these problems, we created DataSphere, where you can get a ready-made ML environment at the click of a button. Different types of computing resources are available in it - from classic capacities to GPUs and distributed computing, and billing occurs only for the server power actually consumed while performing your tasks.

Alexey Bashkeev, head Yandex.Cloud platforms

What's new with Yandex.Cloud?

  • Serverless computing technology for developing machine learning models.

The technology automates resource management andallows for significant savings. In DataSphere, when editing and viewing code, the computing resources of the CPU or GPU are not used; a virtual machine of the required type is connected only for the duration of the actual calculations (model training, launch, other calculations).

As a result, the user only pays foractually consumed computing resource. Time for editing and reviewing code, work of a virtual machine that is not accidentally turned off is not charged. According to DataSphere testing, which involved 200 users from various fields, the downtime of computing power when developing machine learning is 50-70%. When using the product, this time will not be charged.

  • Implemented seamless switching between different types of computing resources.

This means that within one training scenariomodel, the user can use different types of virtual machines - economical with conventional processors (CPUs) and faster with GPUs (graphics accelerators). The model's training progress will remain the same. In most cloud-based machine learning development environments, the training model can only be computed on one type of machine.

  • Save versions of model calculations, including data, code, and states.

This feature makes the process of developing machine learning more profitable for business: the progress achieved in training is not lost, it can be reproduced if necessary.

Yandex SpeechKit Pro will help make voice robots smarter and more human

Yandex platform.Cloud introduced a specialization of the SpeechKit service - Yandex SpeechKit Pro. This is a program for development companies, whose participants will have access to new tools for creating robots and voice assistants focused on working in a specific industry or company.

Such robots will be able to recognize words and commandson a specific topic with the highest level of accuracy. The new tools will help to dramatically improve service scenarios in the bank, healthcare and delivery sectors. SpeechKit Pro also allows you to create individual features of a voice robot: intonation and manner of communication.

Why is this relevant?

By 2020, speech synthesis and recognition have becomethe most popular ML service on the Yandex.Cloud platform. Since the beginning of the year, SpeechKit consumption has grown by 120%. The number of active projects has exceeded 500. In Russia, an ecosystem of developers and integrators of solutions has already formed, who, at the request of companies from various fields, create and implement voice robots to help process incoming and outgoing calls, voice control systems in applications and customer service terminals, and analysis solutions efficiency of business communications.

Today there are more than 20 companies, most of themof whom are regular partners of the Yandex.Cloud platform. According to partners, over the past two years, the main motives for introducing voice robots in Russian companies have been cost reduction and rapid scaling of solutions.

Together with our partners, we went through a greatway, having made speech technologies from an exotic service an applied business tool in two years. Now we are taking the next step and opening up a new level of Yandex speech technologies for partners. Development companies will have access to the advanced features of SpeechKit, and solution customers will be able to choose the supplier with the most appropriate expertise.

Alexey Bashkeev, head of Yandex.Cloud platform

How will the solution adapt to different business tasks?

Together with the business interest in the possibilities of speechtechnologies, requirements for recognition accuracy in specific scenarios of interaction between voice robots and humans have also grown, the ability to quickly adapt developments for new tasks.

For example, for a shipping companyIt is fundamentally important that the robot does not get confused in evaluating the meaning of the phrases "Transfer the order" or "Enter the order", and for telecommunications companies - that it distinguishes between the phrases "Enable service" and "Disable service" without errors. The priority of a business is accuracy in its area, the ability to develop application experience in a specific business scenario based on objective indicators.

To solve these problems, Yandex.Cloud provides partners with additional development tools within the SpeechKit Pro specialization. Now partner companies will be able to use audio data tagging, train individual speech recognition models using customer data, monitor speech recognition quality metrics, and adapt recognition models to a specific data stream.

Specialization SpeechKit Pro has already been received by Neuro.net, Just.ai, Aviation Communication Technologies, Naumen, Robovoice and Voximplant.

Yandex.Cloud platform expanded service ecosystem with serverless computing technologies

Yandex.Cloud has expanded its service ecosystem with proprietary serverless computing technologies. To the four services announced in 2019, two more were added - Yandex API Gateway and Yandex Database in Serverless mode.

What does it mean?

Serverless computing frees companies fromcosts for solving problems of allocating and configuring cloud infrastructure: virtual machines, cloud database servers and applications. These tasks are now performed automatically on the Yandex.Cloud platform side.

When using serverless technologiesThe computing platform automatically detects, for example, an increase in the number of user requests to the company's application and allocates the resources necessary for stable operation. As soon as the load on the application decreases, the amount of power used to operate it decreases. This allows users to switch to a new payment principle for Russian clouds - only based on actual consumption of services - and achieve significant savings.

Yandex.Cloud is the first cloud platform in Russia to offer a complete serverless computing ecosystem. We have collected the most necessary data storage and processing technologies for solving urgent business problems and made them available in a serverless mode. Serverless computing is the next phase in cloud computing around the world. This is an opportunity to reduce costs by up to 90%, speed up the time to create and implement new solutions, increase application resilience during periods of peak loads, free companies from the tasks of scaling the service.

Alexey Bashkeev, head of Yandex.Cloud platform

How will it work?

The Yandex.Cloud serverless technology ecosystem now includes six services: Yandex Object Storageuniversal scalable solution fordata storage; API Gateway - a service for creating and managing APIs; running code in the form of functions - Yandex Cloud Functions; fault-tolerant database management system - Yandex Database;a universal scalable solution for exchanging messages between applications - Yandex Message Queue and the Internet of Things service Yandex IoT Core.

For all serverless computing ecosystem servicesDuring the first year (until October 2021), Yandex.Cloud has special tariffs that allow users to create and host their services for free, not exceeding a certain load level.

Read also

In the era of ecosystems: how IT giants are turning into interfaces of our everyday life

The Doomsday glacier turned out to be more dangerous than scientists thought. We tell the main thing

GitHub has replaced the term "master" with a neutral equivalent