tailestim

A Python package for estimating tail parameters of heavy-tailed distributions, which is useful for analyzing power-law behavior in complex networks. Currently in development (alpha version).

Note

The original estimation implementations are from ivanvoitalov/tail-estimation, which is based on the paper "Scale-free networks well done" (Voitalov et al. 2019). tailestim is a wrapper package that provides a more convenient/modern interface and logging, installable through pip.

Features

Multiple estimation methods including Hill, Moments, Kernel, Pickands, and Smooth Hill estimators
Double-bootstrap procedure for optimal threshold selection
Built-in example datasets

Installation

pip install tailestim

Quick Start

Using Built-in Datasets

from tailestim import TailData
from tailestim import HillEstimator, KernelTypeEstimator, MomentsEstimator

# Load a sample dataset
data = TailData(name='CAIDA_KONECT').data

# Initialize and fit the Hill estimator
estimator = HillEstimator()
estimator.fit(data)

# Get the estimated parameters
result = estimator.get_parameters()
gamma = result['gamma']

# Print full results
print(estimator)

Using degree sequence from networkx graphs

import networkx as nx
from tailestim import HillEstimator, KernelTypeEstimator, MomentsEstimator

# Create or load your network
G = nx.barabasi_albert_graph(10000, 2)
degree = list(dict(G.degree()).values()) # Degree sequence

# Initialize and fit the Hill estimator
estimator = HillEstimator()
estimator.fit(degree)

# Get the estimated parameters
result = estimator.get_parameters()
gamma = result['gamma']

# Print full results
print(estimator)

Available Estimators

The package provides several estimators for tail estimation. For details on parameters that can be specified to each estimator, please refer to the original repository ivanvoitalov/tail-estimation, original paper, or the actual code.

Hill Estimator (HillEstimator)
- Classical Hill estimator with double-bootstrap for optimal threshold selection
- Generally recommended for power law analysis
Moments Estimator (MomentsEstimator)
- Moments-based estimation with double-bootstrap
- More robust to certain types of deviations from pure power law
Kernel-type Estimator (KernelEstimator)
- Kernel-based estimation with double-bootstrap and bandwidth selection
Pickands Estimator (PickandsEstimator)
- Pickands-based estimation (no bootstrap)
- Provides arrays of estimates across different thresholds
Smooth Hill Estimator (SmoothHillEstimator)
- Smoothed version of the Hill estimator (no bootstrap)

Results

The full result can be obtained by estimator.get_parameters(), which returns a dictionary. This includes:

gamma: Power law exponent (γ = 1 + 1/ξ)
xi_star: Tail index (ξ)
k_star: Optimal order statistic
Bootstrap results (when applicable):
- First and second bootstrap AMSE values
- Optimal bandwidths or minimum AMSE fractions

Example Output

When you print(estimator) after fitting, you will get the following output.

==================================================
Tail Estimation Results (HillEstimator)
==================================================

Parameters:
--------------------
Optimal order statistic (k*): 26708
Tail index (ξ): 0.3974
Gamma (powerlaw exponent) (γ): 3.5167

Bootstrap Results:
--------------------
First bootstrap minimum AMSE fraction: 0.2744
Second bootstrap minimum AMSE fraction: 0.2745

Built-in Datasets

The package includes several example datasets:

CAIDA_KONECT
Libimseti_in_KONECT
Pareto (Follows power-law with $\gamma=2.5$)

Load any example dataset using:

from tailestim import TailData
data = TailData(name='dataset_name').data

Loaded data

References

I. Voitalov, P. van der Hoorn, R. van der Hofstad, and D. Krioukov. Scale-free networks well done. Phys. Rev. Res., Oct. 2019, doi: 10.1103/PhysRevResearch.1.033034.
I. Voitalov. ivanvoitalov/tail-estimation, GitHub. Mar. 2018. https://github.com/ivanvoitalov/tail-estimation.

Citations

If you use tailestim in your research or projects, I would greatly appreciate if you could cite this package, the original implementation, and the original paper (Voitalov et al. 2019).

@article{voitalov2019scalefree,
  title = {Scale-free networks well done},
  author = {Voitalov, Ivan and van der Hoorn, Pim and van der Hofstad, Remco and Krioukov, Dmitri},
  journal = {Phys. Rev. Res.},
  volume = {1},
  issue = {3},
  pages = {033034},
  numpages = {30},
  year = {2019},
  month = {Oct},
  publisher = {American Physical Society},
  doi = {10.1103/PhysRevResearch.1.033034},
  url = {https://link.aps.org/doi/10.1103/PhysRevResearch.1.033034}
}

@software{voitalov2018tailestimation,
  author       = {Voitalov, Ivan},
  title        = {tail-estimation},
  month        = mar,
  year         = 2018,
  publisher    = {GitHub},
  url          = {https://github.com/ivanvoitalov/tail-estimation}
}

@software{ueda2025tailestim,
  author       = {Ueda, Minami},
  title        = {tailestim: A Python package for estimating tail parameters of heavy-tailed distributions},
  month        = mar,
  year         = 2025,
  publisher    = {GitHub},
  url          = {https://github.com/mu373/tailestim}
}

License

tailestim is distributed under the terms of the MIT license.

Name		Name	Last commit message	Last commit date
Latest commit History 75 Commits
.github/workflows		.github/workflows
docs		docs
src/tailestim		src/tailestim
tests		tests
.gitignore		.gitignore
.readthedocs.yaml		.readthedocs.yaml
DEVELOP.md		DEVELOP.md
LICENSE.txt		LICENSE.txt
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tailestim

Features

Installation

Quick Start

Using Built-in Datasets

Using degree sequence from networkx graphs

Available Estimators

Results

Example Output

Built-in Datasets

References

Citations

License

About

Releases 5

Contributors 2

Languages

License

mu373/tailestim

Folders and files

Latest commit

History

Repository files navigation

tailestim

Features

Installation

Quick Start

Using Built-in Datasets

Using degree sequence from networkx graphs

Available Estimators

Results

Example Output

Built-in Datasets

References

Citations

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 5

Contributors 2

Languages