Tech Blog - MIM Solutions - We make artificial intelligence work for you

Blog

Find out what our ML and Big Data experts think.

Is AI a threat to humanity?

Today’s post will be less technical and more philosophical. For several months, the temperature of the discussion on the development and future of artificial intelligence has been breaking new records, and nothing seems to suggest that it will start to fall. We have an AI summer in full swing. In this blog, I want to collect my current thoughts on what we are dealing with and where we are heading. And above all, what can we do to make this development as good as possible, especially when we are the ones developing it? Present Times Talking about the future without



30 September 2023

Fine-tuning BERT model for arbitrarily long texts, Part 2

Author: Michał Brzozowski This is part 2 of our series about fine-tuning BERT: if you want to read the first part, go to this link, and if you want to use the code, go to our GitHub. Fine-tuning the pre-trained BERT on longer texts Now, this is the time to address the elephant in the room of the previous approach. We were lucky to find the already fine-tuned model for our IMDB dataset. However, more often, we are in a more unfortunate situation when we have the labelled dataset and we need to fine-tune the classifier from



29 March 2023

Fine-tuning BERT model for arbitrarily long texts, Part 1

Author: Michał Brzozowski Models based on the transformers architecture have become a state-of-the-art solution in NLP. The word “transformer” is indeed what the letter “T” stands for in the names of the famous BERT, GPT3 and the massively popular nowadays ChatGPT. The common obstacle while applying these models is the constraint on the input length. For example, the BERT model cannot process texts which are longer than 512 tokens (roughly speaking, one token is associated with one word). The method to overcome this issue was proposed by Devlin (one of the authors of BERT) in the discussion. In this



17 March 2023

6 inspirations for making hypotheses in Data Science

Author: Adam G. Dobrakowski Redaction: Zuzanna Kwiatkowska In the Data Science literature, we can find quite a few articles that describe how to do exploratory data analysis (EDA) from a technical point of view. However, usually, there is no information on where to get inspiration for making hypotheses in such an EDA. That is why in this post, I would like to share my thoughts on how to approach searching for such inspiration. As always, I will be relying heavily on my own experiences. If you have any ideas that I haven’t included here, be sure to let me



6 March 2023

A/B Testing in Machine Learning. Part 3: 4 most common mistakes

Author: Adam G. Dobrakowski Redaction: Zuzanna Kwiatkowska This post is the third and last in a series of posts about A/B testing. The others are: A/B Testing in Machine Learning. Part 1: How to prepare the A/B tests? A/B Testing in Machine Learning. Part 2: Most common problems Here, I will show you common mistakes that inexperienced and even advanced ML engineers struggle with. Usually, a lot of people are involved in conducting A/B tests. ML engineers/data analysts, but also people responsible for the deployment or operation of a given element on the client’s side. Sometimes, we



23 February 2023

A/B Testing in Machine Learning. Part 2: Most common problems

Author: Adam G. Dobrakowski Redaction: Zuzanna Kwiatkowska This article is the second part of the series regarding A/B testing. You can access other articles with the following links: A/B Testing in Machine Learning. Part 1: How to prepare the A/B tests? A/B Testing in Machine Learning. Part 3: 4 most common mistakes In this article, I will show you the most common problems I encountered when performing A/B tests in real-life scenarios. I will also tell you how to deal with those challenges! How to construct two identical test groups? As I already mentioned in



16 February 2023

Posts

We got an Audience Award during ML in PL 2021!

20 December 2021

5th Health Challenges Congress

17 August 2020

Infertility treatment among the most important AI trends of 2022

21 January 2022

Our customers said...

MIM Solutions developed a cutting-edge user behaviour model that after 3 days of integration increased turnover by 14.2% (keeping the same gross margin percentage and decreased the required amount of resources needed). Cooperation has contributed to exponential growth of RTB House company and global success in the on-line advertisement market!

We would like to express our satisfaction on the cooperation regarding the development of our web application. Piotr Wygocki and the development team did a very professional job. We are satisfied with the solution given to us and with the communication flow through the project. We would like to recommend Piotr Wygocki and his development team. We look forward to working with them in future projects.

The ML/AI production-grade projects require adopting very practical approach. Our cooperation with MIM Solution was a true pleasure; they are professional, highly skilled and thorough. Therefore I highly recommend collaborating with the team, both at conceptual stage as well as in delivery.

I am really impressed by the quality of services I receive from MIM Solutions. They are definitely professionals. They do not miss deadlines, cooperation with them is smooth and they are fully committed to what they do. They are world-class specialists in their field. Even the most difficult challenges in the field of algorithmics, data analysis and artificial intelligence are feasible for them.

Blog

Find out what our ML and Big Data experts think.

Is AI a threat to humanity?

Today’s post will be less technical and more philosophical. For several months, the temperature of the discussion on the development and future of artificial intelligence

Fine-tuning BERT model for arbitrarily long texts, Part 2

Author: Michał Brzozowski This is part 2 of our series about fine-tuning BERT: if you want to read the first part, go to this

Fine-tuning BERT model for arbitrarily long texts, Part 1

Author: Michał Brzozowski Models based on the transformers architecture have become a state-of-the-art solution in NLP. The word “transformer” is indeed what the letter

6 inspirations for making hypotheses in Data Science

Author: Adam G. Dobrakowski Redaction: Zuzanna Kwiatkowska In the Data Science literature, we can find quite a few articles that describe how to do

A/B Testing in Machine Learning. Part 3: 4 most common mistakes

Author: Adam G. Dobrakowski Redaction: Zuzanna Kwiatkowska This post is the third and last in a series of posts about A/B testing. The others

A/B Testing in Machine Learning. Part 2: Most common problems

Author: Adam G. Dobrakowski Redaction: Zuzanna Kwiatkowska This article is the second part of the series regarding A/B testing. You can access other articles

How to Make the Polish Healthcare Sector More Efficient with (Simple) Natural Language Processing?

Author: Zuzanna Kwiatkowska Scientific collaboration: Mateusz Rapicki Have you ever wondered how many things must happen during your 15-minute-long visit to the doctor’s office?

A/B Testing in Machine Learning. Part 1: How to prepare the A/B tests?

Author: Adam G. Dobrakowski Redaction: Zuzanna Kwiatkowska If you work in machine learning, you’ve definitely heard about A/B testing. It is the best and

What is a benchmark and why do you need it?

Author: Adam G. Dobrakowski Redaction: Zuzanna Kwiatkowska In Machine Learning, benchmark is a type of model used to compare performance of other models. There

Get in touch

MIM Solutions can help develop AI in your company, please contact us and we’ll talk you through it.

Name

E-mail

Phone number

Subject

Message

Zgoda

I consent to the processing of my personal data in accordance with the Personal Data Protection Act in connection with sending an inquiry via the above contact form. Providing data is voluntary, but necessary for processing and answering the inquiry. I have been informed that I have the right to access my data, the possibility of correcting it, and requesting the cessation of its processing. The data administrator is MIM Solutions, ul. Świeradowska 47, 02-662 Warsaw, KRS: 0000581404, NIP: PL5213710082.

Blog

Find out what our ML and Big Data experts think.

Tags

Archives

Posts

Our customers said...

Blog

Find out what our ML and Big Data experts think.

Follow us

Get in touch

MIM Solutions can help develop AI in your company, please contact us and we’ll talk you through it.

Who are we?

Why choose us?