News

By admin

March 12, 2024

Potential Polish version of GPT: successful AI collaboration between Gdansk University of Technology and OPI

[ad_1]

Gdańsk Tech and OPI developed a Polish generative model called Qra, trained on a data corpus containing only Polish text. Initially, the corpus used about 2TB of raw text data in total, but as a result of cleaning and deduplication processes, it was reduced by almost two times to maintain the best quality and unique content. This is the first generative model trained on such a large Polish text resource using significant computing power. In comparison, Llama, Mistral and GPT models are trained mainly on English data, with only a small part of their training corpus consisting of Polish data.

The most complex version of the model trained on STOS over a month

At the IT Competence Center STOS of the Gdansk University of Technology, one of the most modern IT centers in the region with the supercomputer Kraken, a computing environment dedicated to building artificial intelligence models was created. A cluster of 21 NVidia A100 80GB graphics cards was used in the process. The team spent about six months preparing the environment, creating tools and models, training them (based on content from fields such as law, technology, social sciences, biomedicine, religion, and sports), and testing them. Thanks to the extensive infrastructure available at STOS, the actual training process of the most complex models was shortened from several years to about one month.

Qra has a good command of Polish

As a result of the collaboration between Gdańsk University of Technology and OPI, the research team created three models of different complexity (Qra 1B, Qra 7B, and Qra 13B). Models Qra 7B and Qra 13B achieve significantly better incomprehensibility results than the original models Llama-2-7b-hf (Meta) and Mistral-7B-v0.1 (Mistral-AI), in terms of their ability to model the Polish language in terms of comprehension, lexical layers, or the grammar itself.

The perplexity measurement tests were performed on the first set of 10,000 sentences from the PolEval-2018 test set, as well as a further set of 5,000 long and demanding documents created in 2024.

Solutions that require better language understanding

The Qra model is the basis for IT solutions that handle issues and processes that require a deeper understanding of the Polish language.

At the moment, Qra is a basic language model that can generate grammatically and stylistically correct answers in Polish. The content produced is of very high quality, as can be seen especially by the confusion measure. The team is currently working on tuning the model and plans to validate its capabilities in text classification, summarization and question answering.

The developed model has been made publicly available in a dedicated OPI-Gdańsk Tech repository on the huggingface platform, where anyone can download it and adapt it to their own domain, problem or task, including providing answers.

[ad_2]

Source link

Share on:

Get a Quote

Fields marked with an asterisk (*) are required

Latest Blogs

Energy Audit Case Study: Premier Design Company in Grater Noida

November 14, 2024

Premier Design Company, located in the bustling industrial hub of Greater Noida, has established itself as a leader in innovative design solutions and sustainable practices. Founded with the vision of integrating cutting-edge technology with environmentally friendly methodologies, the company has garnered a reputation for excellence in various sectors, including architecture,…

Elion did successfully Harmonic testing of SPI machine for air…

June 12, 2024

Elion is a leading provider of power quality analysis and harmonic testing services for air conditioning manufacturers in Ahmedabad, Gujarat. The company has developed a unique testing method for SPI machines, which are commonly used in the manufacturing process of air conditioning units. The harmonic testing of SPI machines is…

Uncovering the Top 5 Challenges of Conducting an Energy Audit

December 22, 2024

Energy audits are a crucial tool for businesses and homeowners looking to reduce their energy consumption and save money on utility bills. An energy audit is a comprehensive assessment of a building’s energy use, identifying areas where energy is being wasted and providing recommendations for improvements. The goal of an…

Protecting Your Hearing: The Importance of OSHA Noise Exposure Monitoring

October 6, 2024

OSHA, or the Occupational Safety and Health Administration, is a federal agency that sets and enforces standards to ensure safe and healthy working conditions for employees. One of the areas that OSHA regulates is noise exposure in the workplace. OSHA has established regulations to protect workers from the harmful effects…

Elion Completed a HAZOP Review for a Petrol Pump Chain…

June 20, 2025

Elion, a prominent consultancy specializing in safety and risk management, has undertaken a comprehensive Hazard and Operability Study (HAZOP) for a petrol pump chain operating in Gurgaon, Haryana. This initiative is part of a broader effort to enhance safety protocols and operational efficiency within the fuel distribution sector. The petrol…

HAZOP Level 3 Demystified: Everything You Need to Know

May 7, 2024

HAZOP Level 3 is an advanced methodology used in the field of process safety management to identify and mitigate potential hazards in industrial processes. It involves a systematic and comprehensive analysis of the process, equipment, and operating procedures to ensure the safety of personnel, the environment, and the surrounding community.…

Case Study of Thermography at Mall in Lucknow Uttar Pradesh

April 15, 2024

In this case study, we will be exploring the use of thermography in building maintenance, specifically in the context of a shopping mall. Building maintenance is crucial for ensuring the safety and efficiency of any structure, and Thermography has emerged as a valuable tool in this field. By using thermal…

How Energy Audit Helps Universities Reduce Electricity Costs

February 19, 2026

Understanding how an energy audit can be a powerful tool for universities seeking to curb their electricity expenses is crucial in today’s financially stringent academic environment. Universities, with their vast campuses, diverse student populations, and round-the-clock operations, are significant consumers of electricity. From illuminating lecture halls and powering research laboratories…

Don’t Get Burned: How Arc Flash Safety Training Can Save…

September 26, 2024

Arc flash is a dangerous and potentially deadly electrical event that occurs when an electric current deviates from its intended path and travels through the air from one conductor to another. This can happen as a result of equipment failure, dust, corrosion, or other factors that create a conductive path.…

Dollar General Workers Rally at Tennessee Headquarters for Improved Safety…

May 30, 2024

[ad_1] GOODLETTSVILLE, Tenn. (WZTV) — More than 200 Dollar General employees and customers marched and protested at the company's annual shareholder meeting in Middle Tennessee on Wednesday, demanding safety in stores and higher wages. A delegation of workers from Step Up Louisiana, who have been organizing for safety in dollar…

By admin

March 12, 2024

Potential Polish version of GPT: successful AI collaboration between Gdansk University of Technology and OPI

The most complex version of the model trained on STOS over a month

Qra has a good command of Polish

Solutions that require better language understanding

Share on:

Get a Quote

Latest Blogs

Energy Audit Case Study: Premier Design Company in Grater Noida

Elion did successfully Harmonic testing of SPI machine for air…

Uncovering the Top 5 Challenges of Conducting an Energy Audit

Protecting Your Hearing: The Importance of OSHA Noise Exposure Monitoring

Elion Completed a HAZOP Review for a Petrol Pump Chain…

HAZOP Level 3 Demystified: Everything You Need to Know

Case Study of Thermography at Mall in Lucknow Uttar Pradesh

How Energy Audit Helps Universities Reduce Electricity Costs

Don’t Get Burned: How Arc Flash Safety Training Can Save…

Dollar General Workers Rally at Tennessee Headquarters for Improved Safety…

Related Links

Company

Services

Resources

Expertise

Legal

Connect

Menu

Filters