Ë t‰Kg<ãó—dZddlmZddlZddlZddlmZddlm Z m Z ddlmZdd gZ e d „d„dd„d d¬« d dd„«Zdd„Ze d„dd„e¬«dddddœ d d„«Zd„Zd„Zd„Zd„Zd„Zy)!z5 Created on Fri Apr 2 09:06:05 2021 @author: matth é)ÚannotationsN)Úspecialé)Ú_axis_nan_policy_factoryÚ_broadcast_arrays)Úarray_namespaceÚentropyÚdifferential_entropycó—|S©N©©Úxs úX/home/alanp/www/video.onchill/myenv/lib/python3.12/site-packages/scipy/stats/_entropy.pyÚró€‰aócó—d|vr|ddSdS)NÚqkérr )Úkwgss rrrs!€Ød‰l˜t D™zÐ5ˆð Ø ð rcó—|fSrr rs rrrs€¨A©4rTéÿÿÿÿ)Ú n_samplesÚ n_outputsÚresult_to_tupleÚpairedÚ too_smallcóR—||dkrtd«‚|€t|«nt||«}|j|«}tjd¬«5d|z|j||d¬«z}ddd«|€t j|«}n`|j|«}t||fd|¬ «\}}t|d¬«}d|z|j |fi|¤Žz}t j||«}|j||¬ «}||tj|«z}|S#1swYŒ°xYw)aÖ Calculate the Shannon entropy/relative entropy of given distribution(s). If only probabilities `pk` are given, the Shannon entropy is calculated as ``H = -sum(pk * log(pk))``. If `qk` is not None, then compute the relative entropy ``D = sum(pk * log(pk / qk))``. This quantity is also known as the Kullback-Leibler divergence. This routine will normalize `pk` and `qk` if they don't sum to 1. Parameters ---------- pk : array_like Defines the (discrete) distribution. Along each axis-slice of ``pk``, element ``i`` is the (possibly unnormalized) probability of event ``i``. qk : array_like, optional Sequence against which the relative entropy is computed. Should be in the same format as `pk`. base : float, optional The logarithmic base to use, defaults to ``e`` (natural logarithm). axis : int, optional The axis along which the entropy is calculated. Default is 0. Returns ------- S : {float, array_like} The calculated entropy. Notes ----- Informally, the Shannon entropy quantifies the expected uncertainty inherent in the possible outcomes of a discrete random variable. For example, if messages consisting of sequences of symbols from a set are to be encoded and transmitted over a noiseless channel, then the Shannon entropy ``H(pk)`` gives a tight lower bound for the average number of units of information needed per symbol if the symbols occur with frequencies governed by the discrete distribution `pk` [1]_. The choice of base determines the choice of units; e.g., ``e`` for nats, ``2`` for bits, etc. The relative entropy, ``D(pk|qk)``, quantifies the increase in the average number of units of information needed per symbol if the encoding is optimized for the probability distribution `qk` instead of the true distribution `pk`. Informally, the relative entropy quantifies the expected excess in surprise experienced if one believes the true distribution is `qk` when it is actually `pk`. A related quantity, the cross entropy ``CE(pk, qk)``, satisfies the equation ``CE(pk, qk) = H(pk) + D(pk|qk)`` and can also be calculated with the formula ``CE = -sum(pk * log(qk))``. It gives the average number of units of information needed per symbol if an encoding is optimized for the probability distribution `qk` when the true distribution is `pk`. It is not computed directly by `entropy`, but it can be computed using two calls to the function (see Examples). See [2]_ for more information. References ---------- .. [1] Shannon, C.E. (1948), A Mathematical Theory of Communication. Bell System Technical Journal, 27: 379-423. https://doi.org/10.1002/j.1538-7305.1948.tb01338.x .. [2] Thomas M. Cover and Joy A. Thomas. 2006. Elements of Information Theory (Wiley Series in Telecommunications and Signal Processing). Wiley-Interscience, USA. Examples -------- The outcome of a fair coin is the most uncertain: >>> import numpy as np >>> from scipy.stats import entropy >>> base = 2 # work in units of bits >>> pk = np.array([1/2, 1/2]) # fair coin >>> H = entropy(pk, base=base) >>> H 1.0 >>> H == -np.sum(pk * np.log(pk)) / np.log(base) True The outcome of a biased coin is less uncertain: >>> qk = np.array([9/10, 1/10]) # biased coin >>> entropy(qk, base=base) 0.46899559358928117 The relative entropy between the fair coin and biased coin is calculated as: >>> D = entropy(pk, qk, base=base) >>> D 0.7369655941662062 >>> D == np.sum(pk * np.log(pk/qk)) / np.log(base) True The cross entropy can be calculated as the sum of the entropy and relative entropy`: >>> CE = entropy(pk, base=base) + entropy(pk, qk, base=base) >>> CE 1.736965594166206 >>> CE == -np.sum(pk * np.log(qk)) / np.log(base) True Nrú+`base` must be a positive number or `None`.Úignore)Úinvalidgð?T©ÚaxisÚkeepdims)r$Úxp©r$) Ú ValueErrorrÚasarrayÚnpÚerrstateÚsumrÚentrrÚdictÚrel_entrÚmathÚlog)ÚpkrÚbaser$r&ÚvecÚ sum_kwargsÚSs rr r s€ðvÐ˜D AšIÜÐFÓGÐGà " Œ˜Ô ´ÀÀBÓ0G€Bà ‰B‹€BÜ ‰˜XÖ &Ø ‰Vb—f‘f˜R d°TfÓ:Ñ :ˆ÷ 'à €zÜl‰l˜2Ó‰à Z‰Z˜‹^ˆÜ" B¨ 8°$¸2Ô>‰ˆˆBÜ˜t¨dÔ3ˆ Ø ‰Vfb—f‘f˜RÑ. :Ñ.Ñ .ˆÜ×Ñ˜r 2Ó&ˆØ ‰ˆs˜ˆÓ€AØÐØ ŒTX‰Xd‹^ÑˆØ€H÷ 'Ð &úsÁDÄD&cóÄ—|d}|j|}|jdtjtj|«dz««}dd|zcxkr|ksyyy)NrÚ window_lengthçà?rTF)ÚshapeÚgetr0ÚfloorÚsqrt)ÚsamplesÚkwargsr$ÚvaluesÚnr8s rÚ"_differential_entropy_is_too_smallrB¢sb€Ø Q‰Z€FØ‰TÑ€AØ—J‘J˜Ü#Ÿz™z¬$¯)©)°A«,¸Ñ*<Ó=ó?€MàMÑ!Ô% AÒ%Øð&ØØrcó—|Srr rs rrrrrcó—|fSrr rs rrrs€¸¹r)rrrÚauto)r8r3r$Úmethodcóx—tj|«}tj||d«}|jd}|€+t j t j|«dz«}dd|zcxkr|ksntd|›d|›d«‚||dkrtd«‚tj|d¬ «}tttttd œ}|j«}||vrdt|«›}t|«‚|dk(r|d krd}n |dkrd}nd}||||«} || tj|«z} | S)aVGiven a sample of a distribution, estimate the differential entropy. Several estimation methods are available using the `method` parameter. By default, a method is selected based the size of the sample. Parameters ---------- values : sequence Sample from a continuous distribution. window_length : int, optional Window length for computing Vasicek estimate. Must be an integer between 1 and half of the sample size. If ``None`` (the default), it uses the heuristic value .. math:: \left \lfloor \sqrt{n} + 0.5 \right \rfloor where :math:`n` is the sample size. This heuristic was originally proposed in [2]_ and has become common in the literature. base : float, optional The logarithmic base to use, defaults to ``e`` (natural logarithm). axis : int, optional The axis along which the differential entropy is calculated. Default is 0. method : {'vasicek', 'van es', 'ebrahimi', 'correa', 'auto'}, optional The method used to estimate the differential entropy from the sample. Default is ``'auto'``. See Notes for more information. Returns ------- entropy : float The calculated differential entropy. Notes ----- This function will converge to the true differential entropy in the limit .. math:: n \to \infty, \quad m \to \infty, \quad \frac{m}{n} \to 0 The optimal choice of ``window_length`` for a given sample size depends on the (unknown) distribution. Typically, the smoother the density of the distribution, the larger the optimal value of ``window_length`` [1]_. The following options are available for the `method` parameter. * ``'vasicek'`` uses the estimator presented in [1]_. This is one of the first and most influential estimators of differential entropy. * ``'van es'`` uses the bias-corrected estimator presented in [3]_, which is not only consistent but, under some conditions, asymptotically normal. * ``'ebrahimi'`` uses an estimator presented in [4]_, which was shown in simulation to have smaller bias and mean squared error than the Vasicek estimator. * ``'correa'`` uses the estimator presented in [5]_ based on local linear regression. In a simulation study, it had consistently smaller mean square error than the Vasiceck estimator, but it is more expensive to compute. * ``'auto'`` selects the method automatically (default). Currently, this selects ``'van es'`` for very small samples (<10), ``'ebrahimi'`` for moderate sample sizes (11-1000), and ``'vasicek'`` for larger samples, but this behavior is subject to change in future versions. All estimators are implemented as described in [6]_. References ---------- .. [1] Vasicek, O. (1976). A test for normality based on sample entropy. Journal of the Royal Statistical Society: Series B (Methodological), 38(1), 54-59. .. [2] Crzcgorzewski, P., & Wirczorkowski, R. (1999). Entropy-based goodness-of-fit test for exponentiality. Communications in Statistics-Theory and Methods, 28(5), 1183-1202. .. [3] Van Es, B. (1992). Estimating functionals related to a density by a class of statistics based on spacings. Scandinavian Journal of Statistics, 61-72. .. [4] Ebrahimi, N., Pflughoeft, K., & Soofi, E. S. (1994). Two measures of sample entropy. Statistics & Probability Letters, 20(3), 225-234. .. [5] Correa, J. C. (1995). A new estimator of entropy. Communications in Statistics-Theory and Methods, 24(10), 2439-2449. .. [6] Noughabi, H. A. (2015). Entropy Estimation Using Numerical Methods. Annals of Data Science, 2(2), 231-241. https://link.springer.com/article/10.1007/s40745-015-0045-9 Examples -------- >>> import numpy as np >>> from scipy.stats import differential_entropy, norm Entropy of a standard normal distribution: >>> rng = np.random.default_rng() >>> values = rng.standard_normal(100) >>> differential_entropy(values) 1.3407817436640392 Compare with the true entropy: >>> float(norm.entropy()) 1.4189385332046727 For several sample sizes between 5 and 1000, compare the accuracy of the ``'vasicek'``, ``'van es'``, and ``'ebrahimi'`` methods. Specifically, compare the root mean squared error (over 1000 trials) between the estimate and the true differential entropy of the distribution. >>> from scipy import stats >>> import matplotlib.pyplot as plt >>> >>> >>> def rmse(res, expected): ... '''Root mean squared error''' ... return np.sqrt(np.mean((res - expected)**2)) >>> >>> >>> a, b = np.log10(5), np.log10(1000) >>> ns = np.round(np.logspace(a, b, 10)).astype(int) >>> reps = 1000 # number of repetitions for each sample size >>> expected = stats.expon.entropy() >>> >>> method_errors = {'vasicek': [], 'van es': [], 'ebrahimi': []} >>> for method in method_errors: ... for n in ns: ... rvs = stats.expon.rvs(size=(reps, n), random_state=rng) ... res = stats.differential_entropy(rvs, method=method, axis=-1) ... error = rmse(res, expected) ... method_errors[method].append(error) >>> >>> for method, errors in method_errors.items(): ... plt.loglog(ns, errors, label=method) >>> >>> plt.legend() >>> plt.xlabel('sample size') >>> plt.ylabel('RMSE (1000 trials)') >>> plt.title('Entropy Estimator Error (Exponential Distribution)') rr9rzWindow length (z7) must be positive and less than half the sample size (z).rr r')Úvasicekúvan esÚcorreaÚebrahimirEz`method` must be one of rEé rIièrKrH)r*r)Úmoveaxisr:r0r<r=r(ÚsortÚ_vasicek_entropyÚ_van_es_entropyÚ_correa_entropyÚ_ebrahimi_entropyÚlowerÚsetr1) r@r8r3r$rFrAÚsorted_dataÚmethodsÚmessageÚress rr r ¬sO€ôhZ‰Z˜Ó €FÜ [‰[˜ rÓ *€FØ‰RÑ€AàÐÜŸ ™ ¤4§9¡9¨Q£<°#Ñ#5Ó6ˆ àMÑ!Ô% AÔ%ÜØ˜m˜_ð-*Ø*+¨¨Bð 0ó ð ð Ð˜D AšIÜÐFÓGÐGä—'‘'˜& rÔ*€Kä*Ü(Ü(Ü,Ü'ñ )€Gð \‰\‹^€FØ WÑØ,¬S°«\¨NÐ;ˆÜ˜Ó!Ð!à ÒØŠ7Ø‰FØ $ŠYØ‰FàˆFà ˆ'&‰/˜+ }Ó 5€CàÐØŒrv‰vd‹|Ñˆà€Jrcóî—tj|j«}||d<tj|ddgf|«}tj|ddgf|«}tj|||fd¬«S)z9Pad the data for computing the rolling window difference.r.rr')r*Úarrayr:Úbroadcast_toÚconcatenate)ÚXÚmr:ÚXlÚXrs rÚ_pad_along_last_axisralsj€ô H‰HQ—W‘WÓ€EØ€Eˆ"IÜ ‰˜˜3 ˜8™ eÓ ,€BÜ ‰˜˜3 ˜9™ uÓ -€BÜ >‰>˜2˜q "˜+¨BÔ/Ð/rcóÔ—|jd}t||«}|dd|zd…f|ddd|z…fz }tj|d|zz|z«}tj|d¬«S)z:Compute the Vasicek estimator as described in [6] Eq. 1.3.r.rNéþÿÿÿr')r:rar*r1Úmean)r]r^rAÚdifferencesÚlogss rrOrOvsp€à ‰‰€AÜ˜Q Ó"€AØC˜˜Q™™K‘. 1 S¨)¨B°©F¨) ^Ñ#4Ñ4€KÜ 6‰6!Qq‘S‘'˜KÑ'Ó(€DÜ 7‰74˜bÔ!Ð!rcó†—|jd}|d|d…f|dd|…fz }d||z ztjtj|dz|z|z«d¬«z}tj||dz«}|tjd|z«ztj|«ztj|dz«z S)z1Compute the van Es estimator as described in [6].r.Nrr')r:r*r,r1Úarange)r]r^rAÚ differenceÚterm1Úks rrPrPs¯€ð ‰‰€AØ3˜™7‘˜a S q b S ™kÑ)€JØ ˆq‰s‰G”b—f‘fœRŸV™V Q q¡S¨!¡G¨jÑ$8Ó9ÀÔCÑC€EÜ ‰ !Qq‘SÓ€AØ”2—6‘6˜!˜A™#“;Ñ¤§¡¨£Ñ*¬R¯V©V°A°a±C«[Ñ8Ð8rcóÐ—|jd}t||«}|dd|zd…f|ddd|z…fz }tjd|dz«j t «}tj|«dz}d|||kdz |zz|||k<d|||||z dzk\z |zz||||z dzk\<tj||z||zz«}tj|d¬«S)z3Compute the Ebrahimi estimator as described in [6].r.rNrcrr') r:rar*rhÚastypeÚfloatÚ ones_liker1rd)r]r^rAreÚiÚcirfs rrRrRŠsû€ð ‰‰€AÜ˜Q Ó"€AàC˜˜Q™™K‘. 1 S¨)¨B°©F¨) ^Ñ#4Ñ4€Kä ‰ !Qq‘SÓ× Ñ ¤Ó'€AÜ ‰a‹˜Ñ €BØa˜˜Q™‘i !‘m QÑ&Ñ&€B€qˆAvJØ˜a ! A¨¨1©¨Q©¡J¡-Ñ/°Ñ2Ñ2€B€qˆA‰EA‰I~Ñä 6‰6!k‘/ R¨!¡VÑ,Ó-€DÜ 7‰74˜bÔ!Ð!rcóÞ—|jd}t||«}tjd|dz«}tj||dz«dd…df}||z}||zdz }tj|d|fdd¬«}|d|f|z }tj ||zd¬«} |tj |d zd¬«z} tjtj| | z«d¬«S) z1Compute the Correa estimator as described in [6].rrN.rcTr#r'r)r:rar*rhrdr,r1)r]r^rArpÚdjÚjÚj0ÚXibarriÚnumÚdens rrQrQ›sß€ð ‰‰€AÜ˜Q Ó"€Aä ‰ !Qq‘SÓ€AÜ ‰A2q˜‘sÓ šA˜t˜GÑ $€BØ ˆB‰€AØ ˆQ‰‰€BäG‰GAc˜2g‘J R°$Ô7€EØ3˜7‘˜eÑ#€JÜ &‰&˜B‘ RÔ (€CØ ŒBF‰F:˜q‘= rÔ*Ñ *€CÜG‰G”B—F‘F˜3˜s™7“O¨"Ô-Ð-Ð-r)NNr) r2únp.typing.ArrayLikerznp.typing.ArrayLike | Noner3úfloat | Noner$ÚintÚreturnúnp.number | np.ndarray)r)r@ryr8z int | Noner3rzr$r{rFÚstrr|r})Ú__doc__Ú __future__rr0Únumpyr*ÚscipyrÚ_axis_nan_policyrrÚscipy._lib._array_apirÚ__all__r rBr rarOrPrRrQr rrÚr†sðñõ#ÛÛÝßIÝ1àÐ,Ð -€ñÙñð¡¸Øôð.2Ø!%ØðE Ø*ðE àðE ððE ð(ò E óðE óPñÙ˜1©nØ0ôð!%ØØØñ yØðyððyðð yð ðyð ð yðòyó ðyòx0ò"ò9ò"ó".r