The company created a dataset it called Project CodeNet, which contains 14 million samples with a total volume of 500 million
The authors used code from two Japaneseprogramming competitions: Aizu and AtCoder. Participants were required to write the code needed to turn a given set of inputs into a set of desired outputs for 4,000 different problems. The result was 14 million code samples, half of which were working.
IBM plans to make the project a standard dataset for training AI models that can recognize the structure of programs.
CodeNet is planned to be used to createintelligent development tools that search for the necessary procedures in applications and libraries, translate from one programming language to another, choose the right implementations and filter out the wrong ones, classify the code, and so on.
Read more
Nuclear reactions intensified in the reactor of the Chernobyl nuclear power plant
Scientists have shown how a black hole tears apart a star
Physicists have created an analogue of a black hole and confirmed Hawking's theory. Where it leads?