# Inferring trueness from linearity – 1. Approach: Normalization

Written by Dr. Janet Thode on . Posted in Method validation In today's blog, we’d like to address a quite common problem during method validations and by using a practical example showing possibilities to solve it.

A SE-HPLC method used as a purity test of a drug shall be validated. It's known that impurities (e.g. aggregates and / or fragments of the product) may arise in low concentrations. In the chromatogram, you’ll see the main peak corresponding to the intact product and a few minor peaks corresponding to impurities, which will be reported as sum.

Let's have a look at the validation parameter trueness ("accuracy"). Unfortunately, neither impurities are separately available for spiking experiments nor independent alternative methods are existing. In this case, the validation guideline ICH Q2(R1) offers the following possibility for the evaluation of trueness: "Accuracy may be inferred once precision, linearity and specificity have been established" (4.1.1 c / 4.1.2 c).

That sounds interesting. Assuming the proof of specificity being no problem, we should be able to infer trueness from the linearity data and check if the results are also precise. This might be quite feasible.

So, let’s have a look at this using a concrete example. For the determination of linearity 5 concentrations were used (examination levels: 80%, 90%, 100%, 110%, 120% of the test concentration; for the sake of simplicity the real concentrations are omitted). Since these experiments were planned to additionally cover trueness, we have prepared 3 independent dilutions for each of these concentrations. After performing the experiments, we obtained the following data set regarding absolute peak areas:

 Absolute peak area [µAU*s] Examination level (%) Sum of minor peaks Main peak 120 99104 15949990 96153 15988085 97723 15879248 110 89383 14567078 87474 14592610 88355 14558575 100 77587 13275901 76162 13264078 78209 13285496 90 65802 11513839 67811 11937356 68012 12037029 80 54208 10489899 55611 10594228 56819 10683872

For the determination of linearity, the peak areas were plotted against the real concentrations for both the sum of minor peaks and the main peak, and regression lines each having a coefficient of determination of R2 > 0.99 were obtained (not shown here). Thus, the predefined acceptance criterion for linearity is met and linearity is clearly proven. Hence, we are allowed to use these data for the evaluation of trueness.

Before we turn to the evaluation of this data set, we should recap what we want to show. Trueness is defined as follows: „The accuracy of an analytical procedure expresses the closeness of agreement between the value which is accepted either as a conventional true value or an accepted reference value and the value found.“. This means that we’d like to examine how well a measured value corresponds to the real theoretical value. We’ve got the measured values, but how can we determine the theoretical values? For this, different approaches exist - today we’ll perform it applying normalization and in a future blog article, we will show another way.

## Hm, so how to normalize?

Applying normalization, we will be able to compare all obtained values with the 100% examination level value. For this purpose, each measured value is multiplied by 100 and then divided by its examination level (for example, first minor peak value: 100 * 99104 / 120 = 82587). Thus, we convert them all to 100% and our data set looks like this:

 Absolute peak area [µAU*s] Examination level (%) Normalized level (%) Sum of minor peaks Main peak (120) 100 82587 13291658 80127 13323404 81436 13232706 (110) 81257 13242798 79521 13266009 80322 13235068 (100) 77587 13275901 76162 13264078 78209 13285496 (90) 73114 12793154 75345 13263729 75569 13374477 (80) 67759 13112373 69513 13242785 71024 13354840

When looking at the numbers, we observe that at least the numbers of the main peak are all quite similar and that the ones of the minor peaks do not differ so much. But what is “do not differ so much”? Mathematically, this can be expressed by the relative standard deviation (RSD). We first calculate the mean and standard deviation (SD) and then divide the standard deviation by the mean and multiply by 100, which gives us the result in percent.

In our example, we obtain the following data:

 Sum of minor peaks Main peak Meanabsolute peak area [µAU*s] 76635 13237232 SDabsolute peak area [µAU*s] 4581 136803 RSD [%] 6.0 1.0

As we already can see when examining the numbers, the RSD of the main peak with 1% and that of the minor peaks with 6% is either very good or in the ballpark. So what does this mean, when we look back at the definition of trueness? It shows that our measured values, when we convert them to 100%, correspond very well to the theoretical one, and thus our measurement is really very true. So, we can use the relative standard deviation of normalized values as an acceptance criterion for the determination of trueness based on linearity data. Telling general limits to be used as acceptance criteria is not possible. They are based on the permissible error and the maximum possible trueness of the method. In our example, however, we would be fine with an acceptance criterion of RSD ≤ 10% for the sum of minor peaks and of RSD ≤ 2% for the main peak ;-)

## There was something else... - what about precision?

Yes, right, the ICH Q2(R1) additionally stated that precision should also be considered: "Accuracy may be inferred once precision, linearity and specificity have been established".

So, using once more our data set, we now take a look at the relative peak areas (in area%) instead of the absolute ones. Usually, they are already evaluated by chromatography system’s software. Alternatively, of course, we could also calculate them on our own, by calculating for each peak its percentage in relation to the sum of all peaks. In our example it looks like this:

 Relative peak area [area%] Examination level (%) Sum of minor peaks Main peak 120 0.6175 99.3825 0.5978 99.4022 0.6116 99.3884 110 0.6099 99.3901 0.5959 99.4041 0.6032 99.3968 100 0.5810 99.4190 0.5709 99.4291 0.5852 99.4148 90 0.5683 99.4317 0.5648 99.4352 0.5618 99.4382 80 0.5141 99.4859 0.5222 99.4778 0.5290 99.4710

Theoretically, we expect that when setting peak areas in relation to each other, they should always be the same regardless of the dilution and are thus only influenced by precision. Therefore, we again determine the relative standard deviation:

 Sum of minor peaks Main peak Meanrelative peak area [area%] 0.5756 99.4244 SDrelative peak area [area%] 0.0330 0.0330 RSD [%] 5.73 0.03

We realize that the thesis above is quite true for the main peak, however, for the sum of minor peaks a dilution effect is seen (at 120%, the mean is about 0.61 area%, at 100% it’s 0.58 area% and at 80% it’s only 0.52 area%). Accordingly, the precision at these very low amounts is not as fantastic as the one of the predominant main peak. Depending on which acceptance criteria were chosen, however, this can still be completely maintainable.

In summary, we can note that we can infer trueness from linearity data (absolute peak areas) by normalizing to 100% and considering the relative deviation thereof and by evaluating precision using relative peak areas, when specificity is given (as we have just assumed here).