Real-time collaboration for Jupyter Notebooks, Linux Terminals, LaTeX, VS Code, R IDE, and more,
all in one place. Commercial Alternative to JupyterHub.
Real-time collaboration for Jupyter Notebooks, Linux Terminals, LaTeX, VS Code, R IDE, and more,
all in one place. Commercial Alternative to JupyterHub.
Jupyter notebook sphinx2ipynb/doc/build/ipynb/sympy/doc/src/modules/ntheory.ipynb
Number Theory
Ntheory Class Reference
class sympy.ntheory.generate.Sieve
An infinite list of prime numbers, implemented as a dynamically growing sieve of Eratosthenes. When a lookup is requested involving an odd number that has not been sieved, the sieve is automatically extended up to that number.
extend(n)
Grow the sieve to cover all primes <= n (a real number).
extend_to_no(i)
Extend to include the ith prime number.
i must be an integer.
The list is extended by 50% if it is too short, so it is likely that it will be longer than requested.
primerange(a, b)
Generate all prime numbers in the range [a, b).
search(n)
Return the indices i, j of the primes that bound n.
If n is prime then i == j.
Although n can be an expression, if ceiling cannot convert it to an integer then an n error will be raised.
Ntheory Functions Reference
sympy.ntheory.generate.prime(nth)
Return the nth prime, with the primes indexed as prime(1) = 2, prime(2) = 3, etc.... The nth prime is approximately n*log(n) and can never be larger than 2**n.
sympy.ntheory.primetest.isprime : Test if n is prime primerange : Generate all primes in a given range primepi : Return the number of primes less than or equal to n
sympy.ntheory.generate.primepi(n)
Return the value of the prime counting function pi(n) = the number of prime numbers less than or equal to n.
sympy.ntheory.primetest.isprime : Test if n is prime primerange : Generate all primes in a given range prime : Return the nth prime
sympy.ntheory.generate.nextprime(n, ith=1)
Return the ith prime greater than n.
i must be an integer.
Potential primes are located at 6*j +/- 1. This property is used during searching.
prevprime : Return the largest prime smaller than n primerange : Generate all primes in a given range
sympy.ntheory.generate.prevprime(n)
Return the largest prime smaller than n.
Potential primes are located at 6*j +/- 1. This property is used during searching.
nextprime : Return the ith prime greater than n primerange : Generates all primes in a given range
sympy.ntheory.generate.primerange(a, b)
Generate a list of all prime numbers in the range [a, b).
If the range exists in the default sieve, the values will be returned from there; otherwise values will be returned but will not modify the sieve.
Some famous conjectures about the occurence of primes in a given range are [1]:
Twin primes: though often not, the following will give 2 primes
an infinite number of times:
primerange(6n - 1, 6n + 2)
Legendre's: the following always yields at least one prime
primerange(n**2, (n+1)**2+1)
Bertrand's (proven): there is always a prime in the range
primerange(n, 2*n)
Brocard's: there are at least four primes in the range
primerange(prime(n)**2, prime(n+1)**2)
The average gap between primes is log(n) [2]; the gap between primes can be arbitrarily large since sequences of composite numbers are arbitrarily large, e.g. the numbers in the sequence n! + 2, n! + 3 ... n! + n are all composite.
The Sieve method, primerange, is generally faster but it will occupy more memory as the sieve stores values. The default instance of Sieve, named sieve, can be used:
nextprime : Return the ith prime greater than n prevprime : Return the largest prime smaller than n randprime : Returns a random prime in a given range primorial : Returns the product of primes based on condition Sieve.primerange : return range from already computed primes
or extend the sieve to contain the requested range.
sympy.ntheory.generate.randprime(a, b)
Return a random prime number in the range [a, b).
Bertrand's postulate assures that randprime(a, 2*a) will always succeed for a > 1.
primerange : Generate all primes in a given range
sympy.ntheory.generate.primorial(n, nth=True)
Returns the product of the first n primes (default) or the primes less than or equal to n (when "nth=False").
One can argue that the primes are infinite since if you take a set of primes and multiply them together (e.g. the primorial) and then add or subtract 1, the result cannot be divided by any of the original factors, hence either 1 or more new primes must divide this product of primes.
In this case, the number itself is a new prime:
In this case two new primes are the factors:
Here, some primes smaller and larger than the primes multiplied together are obtained:
primerange : Generate all primes in a given range
sympy.ntheory.generate.cycle_length(f, x0, nmax=None, values=False)
For a given iterated sequence, return a generator that gives the length of the iterated cycle (lambda) and the length of terms before the cycle begins (mu); if "values" is True then the terms of the sequence will be returned instead. The sequence is started with value "x0".
Note: more than the first lambda + mu terms may be returned and this is the cost of cycle detection with Brent's method; there are, however, generally less terms calculated than would have been calculated if the proper ending point were determined, e.g. by using Floyd's method.
This will yield successive values of i <-- func(i):
A function is defined:
and given a seed of 4 and the mu and lambda terms calculated:
We can see what is meant by looking at the output:
There are 6 repeating values after the first 2.
If a sequence is suspected of being longer than you might wish, "nmax" can be used to exit early (and mu will be returned as None):
Code modified from:
sympy.ntheory.factor_.smoothness(n)
Return the B-smooth and B-power smooth values of n.
The smoothness of n is the largest prime factor of n; the power- smoothness is the largest divisor raised to its multiplicity.
factorint, smoothness_p
sympy.ntheory.factor_.smoothness_p(n, m=-1, power=0, visual=None)
Return a list of [m, (p, (M, sm(p + m), psm(p + m)))...] where:
p**M is the base-p divisor of n
sm(p + m) is the smoothness of p + m (m = -1 by default)
psm(p + m) is the power smoothness of p + m
The list is sorted according to smoothness (default) or by power smoothness if power=1.
The smoothness of the numbers to the left (m = -1) or right (m = 1) of a factor govern the results that are obtained from the p +/- 1 type factoring methods.
If visual=True then an annotated string will be returned:
This string can also be generated directly from a factorization dictionary and vice versa:
The table of the output logic is:
Visual
Input
True
False
other
dict
str
tuple
str
str
str
tuple
dict
tuple
str
tuple
str
n
str
tuple
tuple
mul
str
tuple
tuple
factorint, smoothness
sympy.ntheory.factor_.trailing(n)
Count the number of trailing zero digits in the binary representation of n, i.e. determine the largest power of 2 that divides n.
sympy.ntheory.factor_.multiplicity(p, n)
Find the greatest integer m such that p**m divides n.
sympy.ntheory.factor_.perfect_power(n, candidates=None, big=True, factor=True)
Return "(b, e)" such that "n" == "b**e" if "n" is a perfect power; otherwise return "False".
By default, the base is recursively decomposed and the exponents collected so the largest possible "e" is sought. If "big=False" then the smallest possible "e" (thus prime) will be chosen.
If "candidates" for exponents are given, they are assumed to be sorted and the first one that is larger than the computed maximum will signal failure for the routine.
If "factor=True" then simultaneous factorization of n is attempted since finding a factor indicates the only possible root for n. This is True by default since only a few small factors will be tested in the course of searching for the perfect power.
sympy.ntheory.factor_.pollard_rho(n, s=2, a=1, retries=5, seed=1234, max_steps=None, F=None)
Use Pollard's rho method to try to extract a nontrivial factor of "n". The returned factor may be a composite number. If no factor is found, "None" is returned.
The algorithm generates pseudo-random values of x with a generator function, replacing x with F(x). If F is not supplied then the function x**2 + "a" is used. The first value supplied to F(x) is "s". Upon failure (if "retries" is > 0) a new "a" and "s" will be supplied; the "a" will be ignored if F was supplied.
The sequence of numbers generated by such functions generally have a a lead-up to some number and then loop around back to that number and begin to repeat the sequence, e.g. 1, 2, 3, 4, 5, 3, 4, 5 -- this leader and loop look a bit like the Greek letter rho, and thus the name, 'rho'.
For a given function, very different leader-loop values can be obtained so it is a good idea to allow for retries:
Here is an explicit example where there is a two element leadup to a sequence of 3 numbers (11, 14, 4) that then repeat:
Instead of checking the differences of all generated values for a gcd with n, only the kth and 2*kth numbers are checked, e.g. 1st and 2nd, 2nd and 4th, 3rd and 6th until it has been detected that the loop has been traversed. Loops may be many thousands of steps long before rho finds a factor or reports failure. If "max_steps" is specified, the iteration is cancelled with a failure after the specified number of steps.
Use the default setting with a bad value of "a" and no retries:
If retries is > 0 then perhaps the problem will correct itself when new values are generated for a:
Richard Crandall & Carl Pomerance (2005), "Prime Numbers: A Computational Perspective", Springer, 2nd edition, 229-231
sympy.ntheory.factor_.pollard_pm1(n, B=10, a=2, retries=0, seed=1234)
Use Pollard's p-1 method to try to extract a nontrivial factor of "n". Either a divisor (perhaps composite) or "None" is returned.
The value of "a" is the base that is used in the test gcd(a**M - 1, n). The default is 2. If "retries" > 0 then if no factor is found after the first attempt, a new "a" will be generated randomly (using the "seed") and the process repeated.
Note: the value of M is lcm(1..B) = reduce(ilcm, range(2, B + 1)).
A search is made for factors next to even numbers having a power smoothness less than "B". Choosing a larger B increases the likelihood of finding a larger factor but takes longer. Whether a factor of n is found or not depends on "a" and the power smoothness of the even mumber just less than the factor p (hence the name p - 1).
Although some discussion of what constitutes a good "a" some descriptions are hard to interpret. At the modular.math site referenced below it is stated that if gcd(aM - 1, n) = N then aM % q**r is 1 for every prime power divisor of N. But consider the following:
So we should (and can) find a root with B=16:
If we attempt to increase B to 256 we find that it doesn't work:
But if the value of "a" is changed we find that only multiples of 257 work, e.g.:
Checking different "a" values shows that all the ones that didn't work had a gcd value not equal to "n" but equal to one of the factors:
But does aM % d for every divisor of n give 1?
No, only one of them. So perhaps the principle is that a root will be found for a given value of B provided that:
the power smoothness of the p - 1 value next to the root does not exceed B
a**M % p != 1 for any of the divisors of n.
By trying more than one "a" it is possible that one of them will yield a factor.
With the default smoothness bound, this number can't be cracked:
Increasing the smoothness bound helps:
Looking at the smoothness of the factors of this number we find:
The B and B-pow are the same for the p - 1 factorizations of the divisors because those factorizations had a very large prime factor:
Note that until B reaches the B-pow value of 1787, the number is not cracked;
The B value has to do with the factors of the number next to the divisor, not the divisors themselves. A worst case scenario is that the number next to the factor p has a large prime divisisor or is a perfect power. If these conditions apply then the power-smoothness will be about p/2 or p. The more realistic is that there will be a large prime factor next to p requiring a B value on the order of p/2. Although primes may have been searched for up to this level, the p/2 is a factor of p - 1, something that we don't know. The modular.math reference below states that 15% of numbers in the range of 1015 to 1515 + 104 are 106 power smooth so a B of 106 will fail 85% of the time in that range. From 108 to 108 + 103 the percentages are nearly reversed...but in that range the simple trial division is quite fast.
Richard Crandall & Carl Pomerance (2005), "Prime Numbers: A Computational Perspective", Springer, 2nd edition, 236-238
sympy.ntheory.factor_.factorint(n, limit=None, use_trial=True, use_rho=True, use_pm1=True, verbose=False, visual=None)
Given a positive integer "n", "factorint(n)" returns a dict containing the prime factors of "n" as keys and their respective multiplicities as values. For example:
For input less than 2, factorint behaves as follows:
"factorint(1)" returns the empty factorization, "{}"
"factorint(0)" returns "{0:1}"
"factorint(-n)" adds "-1:1" to the factors and then factors "n"
Partial Factorization:
If "limit" (> 3) is specified, the search is stopped after performing trial division up to (and including) the limit (or taking a corresponding number of rho/p-1 steps). This is useful if one has a large number and only is interested in finding small factors (if any). Note that setting a limit does not prevent larger factors from being found early; it simply means that the largest factor may be composite. Since checking for perfect power is relatively cheap, it is done regardless of the limit setting.
This number, for example, has two small factors and a huge semi-prime factor that cannot be reduced easily:
This number has a small factor and a residual perfect power whose base is greater than the limit:
Visual Factorization:
If "visual" is set to "True", then it will return a visual factorization of the integer. For example:
Note that this is achieved by using the evaluate=False flag in Mul and Pow. If you do other manipulations with an expression where evaluate=False, it may evaluate. Therefore, you should use the visual option only for visualization, and use the normal dictionary returned by visual=False if you want to perform operations on the factors.
You can easily switch between the two forms by sending them back to factorint:
If you want to send a number to be factored in a partially factored form you can do so with a dictionary or unevaluated expression:
The table of the output logic is:
Input
True
False
other
dict
mul
dict
mul
n
mul
dict
dict
mul
mul
dict
dict
Algorithm:
The function switches between multiple algorithms. Trial division quickly finds small factors (of the order 1-5 digits), and finds all large factors if given enough time. The Pollard rho and p-1 algorithms are used to find large factors ahead of time; they will often find factors of the order of 10 digits within a few seconds:
Any of these methods can optionally be disabled with the following boolean parameters:
"use_trial": Toggle use of trial division
"use_rho": Toggle use of Pollard's rho method
"use_pm1": Toggle use of Pollard's p-1 method
"factorint" also periodically checks if the remaining part is a prime number or a perfect power, and in those cases stops.
If "verbose" is set to "True", detailed progress is printed.
smoothness, smoothness_p, divisors
sympy.ntheory.factor_.primefactors(n, limit=None, verbose=False)
Return a sorted list of n's prime factors, ignoring multiplicity and any composite factor that remains if the limit was set too low for complete factorization. Unlike factorint(), primefactors() does not return -1 or 0.
divisors
sympy.ntheory.factor_.divisors(n, generator=False)
Return all divisors of n sorted from 1..n by default. If generator is True an unordered generator is returned.
The number of divisors of n can be quite large if there are many prime factors (counting repeated factors). If only the number of factors is desired use divisor_count(n).
This is a slightly modified version of Tim Peters referenced at: http://stackoverflow.com/questions/1010381/python-factorization
primefactors, factorint, divisor_count
sympy.ntheory.factor_.divisor_count(n, modulus=1)
Return the number of divisors of "n". If "modulus" is not 1 then only those that are divisible by "modulus" are counted.
factorint, divisors, totient
sympy.ntheory.factor_.totient(*args, **kw_args)
Calculate the Euler totient function phi(n)
divisor_count
sympy.ntheory.modular.symmetric_residue(a, m)
Return the residual mod m such that it is within half of the modulus.
sympy.ntheory.modular.crt(m, v, symmetric=False, check=True)
Chinese Remainder Theorem.
The moduli in m are assumed to be pairwise coprime. The output is then an integer f, such that f = v_i mod m_i for each pair out of v and m. If "symmetric" is False a positive integer will be returned, else |f| will be less than or equal to the LCM of the moduli, and thus f may be negative.
If the moduli are not co-prime the correct result will be returned if/when the test of the result is found to be incorrect. This result will be None if there is no solution.
The keyword "check" can be set to False if it is known that the moduli are coprime.
As an example consider a set of residues "U = [49, 76, 65]" and a set of moduli "M = [99, 97, 95]". Then we have:
This is the correct result because:
If the moduli are not co-prime, you may receive an incorrect result if you use "check=False":
Note: the order of gf_crt's arguments is reversed relative to crt, and that solve_congruence takes residue, modulus pairs.
Programmer's note: rather than checking that all pairs of moduli share no GCD (an O(n**2) test) and rather than factoring all moduli and seeing that there is no factor in common, a check that the result gives the indicated residuals is performed -- an O(n) operation.
solve_congruence sympy.polys.galoistools.gf_crt : low level crt routine used by this routine
sympy.ntheory.modular.crt1(m)
First part of Chinese Remainder Theorem, for multiple application.
sympy.ntheory.modular.crt2(m, v, mm, e, s, symmetric=False)
Second part of Chinese Remainder Theorem, for multiple application.
sympy.ntheory.modular.solve_congruence(*remainder_modulus_pairs, **hint)
Compute the integer "n" that has the residual "ai" when it is divided by "mi" where the "ai" and "mi" are given as pairs to this function: ((a1, m1), (a2, m2), ...). If there is no solution, return None. Otherwise return "n" and its modulus.
The "mi" values need not be co-prime. If it is known that the moduli are not co-prime then the hint "check" can be set to False (default=True) and the check for a quicker solution via crt() (valid when the moduli are co-prime) will be skipped.
If the hint "symmetric" is True (default is False), the value of "n" will be within 1/2 of the modulus, possibly negative.
What number is 2 mod 3, 3 mod 5 and 2 mod 7?
If you prefer to work with all remainder in one list and all moduli in another, send the arguments like this:
The moduli need not be co-prime; in this case there may or may not be a solution:
The symmetric flag will make the result be within 1/2 of the modulus:
crt : high level routine implementing the Chinese Remainder Theorem
sympy.ntheory.multinomial.binomial_coefficients(n)
Return a dictionary containing pairs where are binomial coefficients and . Examples
binomial_coefficients_list, multinomial_coefficients
sympy.ntheory.multinomial.binomial_coefficients_list(n)
Return a list of binomial coefficients as rows of the Pascal's triangle.
binomial_coefficients, multinomial_coefficients
sympy.ntheory.multinomial.multinomial_coefficients(m, n)
Return a dictionary containing pairs "{(k1,k2,..,km) : C_kn}" where "C_kn" are multinomial coefficients such that "n=k1+k2+..+km".
For example:
The algorithm is based on the following result:
Code contributed to Sage by Yann Laigle-Chapuy, copied with permission of the author.
binomial_coefficients_list, binomial_coefficients
sympy.ntheory.multinomial.multinomial_coefficients_iterator(m, n, _tuple=<type 'tuple'>)
multinomial coefficient iterator
This routine has been optimized for m large with respect to n by taking advantage of the fact that when the monomial tuples t are stripped of zeros, their coefficient is the same as that of the monomial tuples from "multinomial_coefficients(n, n)". Therefore, the latter coefficients are precomputed to save memory and time.
sympy.ntheory.partitions_.npartitions(n, verbose=False)
Calculate the partition function P(n), i.e. the number of ways that n can be written as a sum of positive integers.
P(n) is computed using the Hardy-Ramanujan-Rademacher formula, described e.g. at http://mathworld.wolfram.com/PartitionFunctionP.html
The correctness of this implementation has been tested for 10**n up to n = 8.
sympy.ntheory.primetest.mr(n, bases)
Perform a Miller-Rabin strong pseudoprime test on n using a given list of bases/witnesses.
Richard Crandall & Carl Pomerance (2005), "Prime Numbers: A Computational Perspective", Springer, 2nd edition, 135-138
A list of thresholds and the bases they require are here: http://en.wikipedia.org/wiki/Miller–Rabin_primality_test#Deterministic_variants_of_the_test
sympy.ntheory.primetest.isprime(n)
Test if n is a prime number (True) or not (False). For n < 10**16 the answer is accurate; greater n values have a small probability of actually being pseudoprimes.
Negative primes (e.g. -2) are not considered prime.
The function first looks for trivial factors, and if none is found, performs a safe Miller-Rabin strong pseudoprime test with bases that are known to prove a number prime. Finally, a general Miller-Rabin test is done with the first k bases which will report a pseudoprime as a prime with an error of about 4**-k. The current value of k is 46 so the error is about 2 x 10**-28.
sympy.ntheory.generate.primerange : Generates all primes in a given range sympy.ntheory.generate.primepi : Return the number of primes less than or equal to n sympy.ntheory.generate.prime : Return the nth prime
sympy.ntheory.residue_ntheory.n_order(a, n)
Returns the order of "a" modulo "n".
The order of "a" modulo "n" is the smallest integer "k" such that "a**k" leaves a remainder of 1 with "n".
sympy.ntheory.residue_ntheory.is_primitive_root(a, p)
Returns True if "a" is a primitive root of "p"
"a" is said to be the primitive root of "p" if gcd(a, p) == 1 and totient(p) is the smallest positive number s.t.
a**totient(p) cong 1 mod(p)
sympy.ntheory.residue_ntheory.primitive_root(p)
Returns the smallest primitive root or None
[1] W. Stein "Elementary Number Theory" (2011), page 44 [2] P. Hackman "Elementary Number Theory" (2009), Chapter C
p : positive integer
sympy.ntheory.residue_ntheory.sqrt_mod(a, p, all_roots=False)
find a root of "x**2 = a mod p"
a : integer p : positive integer all_roots : if True the list of roots is returned or None
If there is no root it is returned None; else the returned root is less or equal to "p // 2"; in general is not the smallest one. It is returned "p // 2" only if it is the only root.
Use "all_roots" only when it is expected that all the roots fit in memory; otherwise use "sqrt_mod_iter".
sympy.ntheory.residue_ntheory.quadratic_residues(p)
Returns the list of quadratic residues.
sympy.ntheory.residue_ntheory.nthroot_mod(a, n, p, all_roots=False)
find the solutions to "x**n = a mod p"
a : integer n : positive integer p : positive integer all_roots : if False returns the smallest root, else the list of roots
sympy.ntheory.residue_ntheory.is_nthpow_residue(a, n, m)
Returns True if "x**n == a (mod m)" has solutions.
Hackman "Elementary Number Theory" (2009), page 76
sympy.ntheory.residue_ntheory.is_quad_residue(a, p)
Returns True if "a" (mod "p") is in the set of squares mod "p", i.e a % p in set([i**2 % p for i in range(p)]). If "p" is an odd prime, an iterative method is used to make the determination:
legendre_symbol, jacobi_symbol
sympy.ntheory.residue_ntheory.legendre_symbol(a, p)
0 if a is multiple of p
1 if a is a quadratic residue of p
-1 otherwise
p should be an odd prime by definition
is_quad_residue, jacobi_symbol
sympy.ntheory.residue_ntheory.jacobi_symbol(m, n)
Returns the product of the legendre_symbol(m, p) for all the prime factors, p, of n.
0 if m cong 0 mod(n)
1 if x**2 cong m mod(n) has a solution
-1 otherwise
The relationship between the jacobi_symbol and legendre_symbol can be demonstrated as follows:
is_quad_residue, legendre_symbol