Theories#

The goal of all existing z-factor correlation models is to numerically represent the famous Standing-Katz (SK) chart, correlating the pseudo-critical properties, reduced pressure (\(P_{r}\)) and reduced temperature (\(T_{r}\)), to the real gas compressibility factor \(Z\). In the other words, calculation of the z-factor requires values of \(P_{r}\) and \(T_{r}\).

In real life applications, no one knows the exact \(P_{r}\) and \(T_{r}\) values of his gas mixture. This is where pseudo-critical property models, such as Sutton (1985) [1] and Piper et al (1993) [2], comes in handy by approximating them from gas specific gravity (\(\gamma_g\)), which is relatively easy to obtain from lab sample analysis.

This section explains the basic theories behind \(P_{r}\) and \(T_{r}\) correlation from specific gravity and the subsequent z-factor correlation from the computed \(P_{r}\) and \(T_{r}\).

Figure 1: Left is the original SK chart, and the right is the numerical representation of the SK chart using the Dranchuk and Abu-Kassem (DAK) model [3].

1. Pseudo-Critical Property Models#

The z-factor can be derived from \(P_{r}\) and \(T_{r}\) through visual inspection of the SK chart or through numerical computation using various z-factor correlation models. In the other words, z-factor is a function of pseudo-reduced pressure and temperature:

\[Z = f(P_{r}, T_{r})\]

\(P_{r}\) and \(T_{r}\) are defined as the pressure and temperature divided by the mixture’s pseudo-critical pressure (\(P_{pc}\)) and and temperature (\(T_{pc}\)):

\[P_{r} = \frac{P}{P_{pc}}, ~~~~~~~ T_{r} = \frac{T}{T_{pc}}\]

Kay (1936) [4] stated that \(P_{pc}\) and \(T_{pc}\) of a gas mixture can be expressed as the mole fraction (\(x\)) weighted average of the critical pressure (\(P_c\)) and temperature (\(T_c\)) of the mixture’s individual component (\(i\)):

\[P_{pc}=\sum x_{i} P_{c_{i}}, ~~~~~~~T_{pc}=\sum x_{i} T_{c_{i}}\]

However, this method is too inconvenient because you have to manually input individual component’s \(P_c\), \(T_c\), and \(x\), which can be time-consuming. Furthermore, this isn’t practical for oil field applications in which the lab analysis of the “heavy-ends” are often lumped up together and reported as \(C_{6}^{+}\) or \(C_{7}^{+}\). This makes it impossible to know the mole fractions of the components heavier than \(C_{6}\) or \(C_{7}\).

This section introduces various pseudo-critical property models that correlates a gas mixture’s specific gravity (\(\gamma_{g}\)) to its corresponding \(P_{pc}\) and \(T_{pc}\).

1.1. Sutton (1985)#

Sutton (1985) [1] fitted the following regression model for a gas mixture with unknown component composition that that take \(\gamma_{g}\) as input:

\[P_{pc} = 756.8 - 131.07\gamma_{g} - 3.6\gamma^{2}_{g}\]

\[T_{pc} = 169.2 - 349.5\gamma_{g} - 74.0\gamma^{2}_{g}\]

The above correlations are valid over the ranges of specific gravities with which Sutton worked: \(0.57 < \gamma_{g} < 1.68\). He also recommends to apply Wichert-Aziz [5] correction for significant \(H_2S\) and \(CO_2\) fractions:

\[\epsilon = 120 (A^{0.9} - A^{1.6}) + 15(B^{0.5} - B^{4})\]

\[T_{pc}^{'} = T_{pc} - \epsilon\]

\[P_{pc}^{'} = \frac{P_{pc}T_{pc}^{'}}{T_{pc} - B(1 - B)\epsilon}\]

where:

\(\epsilon\) = temperature-correction factor for acid gases [°R]

\(A\) = sum of the mole fractions of \(CO_2\) and \(H_2S\) in the gas mixture [dimensionless]

\(B\) = mole fraction of \(H_2S\) in the gas mixture [dimensionless]

\(T^{'}_{pc}\) = corrected pseudo-critical temperature [°R]

\(P^{'}_{pc}\) = corrected pseudo-critical pressure [psia]

The correction correlation is applicable to concentration ranges of \(CO_2 < 54.4 \space mol\)% and \(H_2S < 73.8 \space mol\)%. Using the Dranchuk and Abu-Kassem (DAK) method [3] as a z-factor correlation model, Sutton’s correlation model reported an average absolute error of 1.418%. The regression coefficients were fitted with 289 points.

1.2. Piper et al. (1993)#

Piper et al. (1993) [2] adapted the method of Stewart et al. (1959) [6] to calculate the pseudo-critical properties of gas mixtures with nitrogen (\(N_2\)), \(CO_2\), and \(H_2S\) fractions:

\[T_{pc} = \frac{K^{2}}{J}, ~~~~~~~P_{pc} = \frac{T_{pc}}{J}\]

and

\[\begin{split}\begin{align} J &= 0.11582 - 0.45820 x_{H_2S}\left(\frac{T_c}{P_c}\right)\_{H_2S} - 0.90348 x_{CO_2}\left(\frac{T_c}{P_c}\right)\_{CO_2} - 0.66026 x_{N_2}\left(\frac{T_c}{P_c}\right)\_{N_2} \\ \\ &~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ + 0.70729\gamma_{g} - 0.099397 \gamma^{2}\_{g}\\ \\ K &= 3.8216 -0.06534 x_{H_2S}\left(\frac{T_c}{\sqrt{P_c}}\right)\_{H_2S} - 0.42113 x_{CO_2}\left(\frac{T_c}{\sqrt{P_c}}\right)\_{CO_2} - 0.91249 x_{N_2}\left(\frac{T_c}{\sqrt{P_c}}\right)\_{N_2} \\ \\ &~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ + 17.438\gamma_{g} - 3.2191 \gamma^{2}\_{g}\\ \end{align}\end{split}\]

where:

\(J\) = Steward, Burkhardt, and Voo (SBV) parameter [°R/psia]

\(K\) = SBV parameter [°R/psia^0.5]

\(x_{H_2S}\) = mole fraction of \(H_2S\) [dimensionless]

\(x_{CO_2}\) = mole fraction of \(CO_2\) [dimensionless]

\(x_{N2}\) = mole fraction of \(N_2\) [dimensionless]

Piper’s correction for non-hydrocarbon impurities have working ranges of \(H_2S < 51.37 \space mol\)%, \(CO_2 < 67.16 \space mol\)%, and \(N_2 < 15.68 \space mol\)%. Using the DAK method [3] as a z-factor correlation model, Piper’s crrelation model reported an average absolute error of 1.304%. The regression coefficients were fitted with 896 points.

1.3. Caveats#

1) The models work only for “naturally occurring” hydrocarbon gases

The models implemented in this library correlates \(\gamma_{g}\) to the corresponding \(P_{pc}\) and \(T_{pc}\) by using the fitted regression coefficients. This means that the working range of the models will be limited by the range of the data points used to fit the coefficients. All pseudo-critical models (that I know of) are developed using only the naturally occurring gas samples. Therefore, it is not recommended to use these models for synthetic gases. If you are dealing with synthetic gases, I recommend using Kay’s (1936) [4] method.

2) Correction is necessary in presence of significant impurities fractions

Sutton’s method (1985) [1] can apply correction for \(H_{2}S\) and \(CO_2\):

>>> from gascompressibility.pseudocritical import Sutton
>>>
>>> Sutton().calc_Tr(sg=0.7, T=75, CO2=0.1, H2S=0.07)
1.5005661019949397

Piper’s method (1993) [2] can apply correction for \(H_{2}S\), \(CO_2\), and \(N_2\):

>>> from gascompressibility.pseudocritical import Piper
>>>
>>> Piper().calc_Tr(sg=0.7, T=75, CO2=0.1, H2S=0.07, N2=0.1)
1.5483056093175225

3. What models should I use?#

Short Answer: For z-correlation model, use zmodel='londono'. But if computational cost is a big concern, use zmodel='kareem' for \(P_r < 15\). For pseudo-critical property model, use pmodel='sutton'. If you have significant nitrogen fractions, use pmodel='piper'.

3.1. Working Ranges of Z-Models#

The below table summarizes the working \(P_r\) and \(T_r\) ranges of each model, according to it’s own original paper.

Model	\(T_r\)	\(P_r\)
DAK	[1, 3]	[0.2, 30]
Hall-Yarborough	[1.15, 3]	(0, 20.5]
Londono	[1, 3]	[0.2, 30]
Kareem	[1.15, 3]	[0.2, 15]

something

However, normally we don’t know the \(P_r\) and \(T_r\) values of a given mixture. The below figure summarizes the corresponding \(P_r\) and \(T_r\) (computed with Sutton’s method) for each of specific gravity, temperature, and pressure ranges. For example, assuming \(\gamma_{g}\) = 0.9 (green lines), z-factor correlation can’t be used for extreme conditions like \(P\) > 19,000 psia, or \(T\) > 800 °F. If Kareem’s method (zmodel='kareem') is used for speed, you can’t use it for \(P\) > 11,500 psia.

Figure source code

import matplotlib.pyplot as plt
import numpy as np
from gascompressibility.pseudocritical import Sutton

pmin = 0
pmax = 25000
Ps = np.linspace(pmin, pmax, 100)
Ps = np.array([round(P, 1) for P in Ps])

tmin = -459
tmax = 1500
Ts = np.linspace(tmin, tmax, 100)
Ts = np.array([round(T, 1) for T in Ts])

sgs = np.arange(0.1, 2.6, 0.4)
sgs = np.array([round(sg, 1) for sg in sgs])

results = {sg: {
    'Pr': np.array([]),
    'P': np.array([]),
    'Tr': np.array([]),
    'T': np.array([]),
} for sg in sgs}

for sg in sgs:
    for P in Ps:
        Pr = Sutton().calc_Pr(sg=sg, P=P)
        results[sg]['P'] = np.append(results[sg]['P'], [P], axis=0)
        results[sg]['Pr'] = np.append(results[sg]['Pr'], [Pr], axis=0)
    for T in Ts:
        Tr = Sutton().calc_Tr(sg=sg, T=T)
        results[sg]['T'] = np.append(results[sg]['T'], [T], axis=0)
        results[sg]['Tr'] = np.append(results[sg]['Tr'], [Tr], axis=0)

fig, axes = plt.subplots(1, 2, figsize=(9, 4))
for i, ax in enumerate(axes):
    if i == 0:
        for sg in sgs:
            Prs = results[sg]['Pr']
            Ps = results[sg]['P']

            p = ax.plot(Ps, Prs, label=sg)

            t = ax.text(Ps[-10], max(Prs) - 3, 'sg = ' + str(sg), color=p[0].get_color())
            t.set_bbox(dict(facecolor='white', alpha=0.7, edgecolor='white', pad=1))

        ax.text(0.06, 0.9, '$P_{r}$  approximation', fontsize=9, transform=ax.transAxes,
            bbox=dict(facecolor='white'))
        ax.set_ylabel('$P_r$', fontsize=11)
        ax.set_xlabel('Pressure (psia)')
        ymax = 60
        ax.hlines(y=30, xmin=pmin, xmax=pmax, color='k', linestyle='--', linewidth=0.8, alpha=0.7)
        ax.text(100, 31.3, '$P_r$ = 30.0', alpha=0.7)
        ax.hlines(y=15, xmin=pmin, xmax=pmax, color='k', linestyle='--', linewidth=0.8, alpha=0.7)
        ax.text(100, 16, '$P_r$ = 15.0', alpha=0.7)
        ax.hlines(y=1, xmin=pmin, xmax=pmax, color='k', linestyle='--', linewidth=0.8, alpha=0.7)
        ax.text(100, 2, '$P_r$ = 1.0', alpha=0.7)
        ax.fill_between(x=Ps, y1=1, y2=30, color='green', interpolate=True, alpha=0.1, zorder=-99)

    else:
        for sg in sgs:
            Trs = results[sg]['Tr']
            Ts = results[sg]['T']

            p = ax.plot(Ts, Trs, label=sg)

            t = ax.text(Ts[-1], max(Trs), 'sg = ' + str(sg), color=p[0].get_color())
            t.set_bbox(dict(facecolor='white', alpha=0.7, edgecolor='white', pad=1))

        ax.text(0.06, 0.9, '$T_{r}$  approximation', fontsize=9, transform=ax.transAxes,
            bbox=dict(facecolor='white'))
        ax.set_ylabel('$T_r$', fontsize=11)
        ax.set_xlabel('Temperature (°F)')
        ymax = 10
        ax.hlines(y=3, xmin=tmin, xmax=tmax, color='k', linestyle='--', linewidth=0.8, alpha=0.7)
        ax.text(tmin, 3.2, '$T_r$ = 3.0', alpha=0.7)
        ax.hlines(y=0.2, xmin=tmin, xmax=tmax, color='k', linestyle='--', linewidth=0.8, alpha=0.7)
        ax.text(tmin, 0.5, '$T_r$ = 0.2', alpha=0.7)
        ax.fill_between(x=Ts, y1=0.2, y2=3, color='green', interpolate=True, alpha=0.1, zorder=-99)


    ymin = 0 - 0.05 * ymax
    ax.set_ylim(ymin, ymax)

    ax.minorticks_on()
    ax.grid(alpha=0.5)
    ax.grid(visible=True, which='minor', alpha=0.1)
    ax.spines.top.set_visible(False)
    ax.spines.right.set_visible(False)


    def setbold(txt):
        return ' '.join([r"$\bf{" + item + "}$" for item in txt.split(' ')])

    bold_txt = setbold('Working Ranges of Z Models')
    plain_txt = ',  for each of specific gravity, pressure, and  temperature ranges'

    fig.suptitle(bold_txt + plain_txt,
                 verticalalignment='top', x=0, horizontalalignment='left', fontsize=11)
    yloc = 0.9
    ax.annotate('', xy=(0.01, yloc), xycoords='figure fraction', xytext=(1.02, yloc),
                arrowprops=dict(arrowstyle="-", color='k', lw=0.7))
    ax.text(0.95, 0.1, 'GasCompressibility-Py', fontsize=9, ha='right', va='center',
            transform=ax.transAxes, color='grey', alpha=0.5)

fig.tight_layout()

3.2. Compatibilities#

GasCompressibility-py currently supports two pseudo-critical models ('sutton' | 'piper') and four z-factor correlation models ('DAK' | 'hall_yarborough' | 'londono' | 'kareem'). Which combination of pseudo-critical model should you use with which z-factor correlation model? The below table presented in Elsharkawy and Elsharkawy (2020) [12] may shed light on determining which combination should be used:

The table dictates that Sutton’s pseudo-critical property model with Londono’s z-factor correlation model yields the highest coefficient of determination (\(R^2\)) of 0.974. However, so long as the models implemented in this package are concerned, you can use any combination you want. They all have \(R^2 \geq 0.957\), which is more than good enough for practical usage in real life applications.

Notes

Unfortunately this performance evaluation table does not include any explicit (fast) z-factor correlation models. Therefore, my recommendation is to avoid Kareem’s method unless computation speed is very important, since there’s no 3rd party paper (that I know of) that evaluates Kareem’s method other than himself.

3.3. Brief Of Each Models#

Pseudo-critical models:

Sutton (1985): Makes corrections for acid fractions: \(H_2S\) and \(CO_2\)
Piper (1993): Improved version of Piper. Additionally supports corrections for \(N_2\) along with \(H_2S\) and \(CO_2\)

Z-factor models:

DAK (1975): The most widely used z-factor model in the oil and gas industry for the past 40 years. You can’t go wrong with this model
Hall-Yarborough (1973): Not recommended.
Londono (2005): Improved version of DAK. Math is exactly the same, but regression coefficients are fitted with 4x more data points.
Kareem (2016): Fast, but have shorter working ranges (\(P_r < 15\))

Theories#

1. Pseudo-Critical Property Models#

1.1. Sutton (1985)#

1.2. Piper et al. (1993)#

1.3. Caveats#

2. Z-Factor Correlation Models#

2.1. DAK (1975)#

2.2. Hall-Yarborough (1973)#

2.3. Londono (2005)#

2.4. Kareem, Iwalewa, and Marhoun (2016)#

2.5. Output Comparison#

2.6. Caveats#

3. What models should I use?#

3.1. Working Ranges of Z-Models#

3.2. Compatibilities#

3.3. Brief Of Each Models#

4. References#