Probabilistic Method: New Frontiers and Old Challenges

The probabilistic method:

New frontiers and old challenges

Shachar Lovett (UCSD)

ELC Tokyo Complexity Workshop,

March 2013

The probabilistic method

•

Non-constructive way to prove

existence

of

mathematical objects

•

Pioneered by Erdös in combinatorics

•

Numerous applications

 in theoretical

computer science, combinatorics, number

theory, analysis, geometry,…

The probabilistic method

•

Recipe: to prove

existence

 of an object with

certain

prescribed properties

1.

 Define a

distribution

 on the objects

2.

Sample

 an object

3.

 Prove that with

positive probability

 the object

sampled has the

required properties



The required objects must exist !

Example: bad movies

•

How to find a bad movie… pick one at random!

Photo courtesy of

elijahconsulting.com

Example: good codes

•

Binary code: subset C of {0,1}

(codewords)

•

Code is

good

 if there are constants





>0 s.t

–

|C|





(good

rate

–

Two codewords differ in at least



n coordinates

(good

distance

•

Claim: good codes exist (for suitable





•

In fact, they are abundant

Example: good codes

CODEWORD

MINIMAL DISTANCE

AROUND CODEWORD

Example: good codes

Example: good codes

•

Almost all codes are good

•

This, however, tells us very little on how to

find good codes

(of course, we can randomly

sample them and hope for the best)

•

Even more, it tells us nothing on how to build

efficient

algorithms

for encoding, decoding, …

Archetypical setup

•

In most cases, the probabilistic method shows

that not only the required objects

exist

, but in

fact they are

abundant

–

Most codes are good, most sparse graphs are

expanders, most functions are hard to compute,

most matrices are rigid, …

•

Challenge:

find explicit objects

–

Major line of research in TCS, number theory,

geometry, combinatorics, …

Rare objects

•

There are quite a few cases, however, when the standard

application of the probabilistic method fails miserably

•

This is simply because the required objects

may exist

, but

they are certainly

not abundant

•

Looking for few (if any) needles in very large haystacks…

Photo courtesy of

footprinthr.com.au

Rare objects

•

Solutions?

•

Explicit constructions:

 great, when we have them.

This is the ultimate goal. But, in many cases we

don’t have any.

•

More delicate probabilistic techniques:

the focus

of this talk.

–

Potential for new algorithmic tools

Probabilistic techniques for rare objects

1.

Limited independence:

 Lovász local lemma

2.

Discrepancy problems:

 Spencer’s “six

standard deviations suffice” theorem

3.

Regular structures:

 approximation of discrete

processes by continuous processes

Probabilistic techniques for rare objects

1.

Limited independence:

Lovász local lemma

2. Discrepancy problems: Spencer’s “six

standard deviations suffice” theorem

3. Regular structures: approximation of discrete

processes by continuous processes

Lovász local lemma

Lovász-Erdös ‘75, Spencer ’77, Shearer ’85]

•

Powerful tool to analyze events which are

mostly

independent

•

Typical application:

–

,…,E

 “bad” events

–

Each E

depends

on at most d other events

–

Prob[E



 1/ed

(e=2.718…)

Thm: It is possible that

none of the events occur

Lovász local lemma

•

Example: G is d-regular graph on n vertices

•

Color

 vertices with

 {-1,+1}

•

Goal: minimize

absolute sum of neighbors

for

the worst vertex

•

Think of n >> d, say n=exp(d)

•

What is the best we can hope for?

Lovász local lemma

•

d-regular graph, color vertices {-1,+1},

minimize absolute sum of neighbors

Lovász local lemma

•

d-regular graph, color vertices {-1,+1},

minimize absolute sum of neighbors

Lovász local lemma

•

d-regular graph, color vertices {-1,+1},

minimize absolute sum of neighbors

abs sum: 3

Lovász local lemma

•

d-regular graph, color vertices {-1,+1}, minimize

absolute sum of neighbors

•

Naïve bound:

•

Random assignment: w.h.p

also d

 if n>>exp(d)

•

Lovász local lemma: can get

–

That is, there are “rare” colorings which are

much

better

 than typical colorings

•

Beck-Fiala conjecture (special case): can get

Lovász local lemma

Lovász local lemma

[Moser ‘08, Moser-Tardos ’09, Kolipaka-Szegedy’ 11]

•

Constructive version:

find a feasible solution

 where

none of the bad events holds

•

Can be done!

•

Algorithm

 (for our example): Start with a

random

coloring

; If a vertex has too large abs sum of neighbors,

resample his neighbors

. Repeat.

•

Amazingly, converges to feasible solution in poly-time!!!

Probabilistic techniques for rare objects

1. Limited independence: Lovász local lemma

2.

Discrepancy problems:

Spencer’s “six

standard deviations suffice” theorem

3. Regular structures: approximation of discrete

processes by continuous processes

Discrepancy problems

•

Elements: 1,2,….,n

•

Sets: S

,…,S



 {1,…,n}

•

Goal:

Color

 each element {-1,+1} to

minimize

absolute sum (discrepancy)

 in worst case set

•

For example, the graph problem we saw before is

an instance of discrepancy problems

–

Sets = neighbors of vertices

•

A general framework

Discrepancy in geometry

Discrepancy in number theory

•

Example:

–

Color {1,…,n} so that each

arithmetic progression

 (e.g.

{a,a+d,a+2d,…,a+kd}

) is balanced

•

Random coloring:  discrepancy

•

True answer:

[Roth’64, Matoušek-Spencer ’96]

•

Conjecture of Erdös: for

homogeneous arithmetic

progressions

(e.g.

{0,d,2d,…,kd}

) the answer is

O(1)

–

Best known is

O(log n)

Discrepancy problems

•

What is the best bound if no prior knowledge

is assumed on the structure of the sets?

•

[Spencer ’85]: for

any family of n elements

and m sets

there exists a coloring with

discrepancy

–

Beats the                          for random colorings

–

For m=n gets

–

Previously mentioned results use the same proof

technique + domain specific knowledge

Algorithmic discrepancy theory

[Bansal ‘10, L-Meka’12]

•

Constructive version:

find a coloring

 with good

discrepancy

•

Can be done!

•

Bansal algorithm: SDP relaxation + clever rounding

•

L-Meka algorithm: Random walk on fractional colorings

Some applications of algorithmic

discrepancy theory

•

Phase transition for

random integer programs

[Chandrasekaran-Vempala’12]

•

Improved

approximation algorithm

 for bin-

packing [Rothvoss’13]

•

Codes

 with bounded peak-to-mean envelope

[Work in progress with Kotzer, Litsyn]

Open problems in discrepancy

Probabilistic techniques for rare objects

1. Limited independence: Lovász local lemma

2. Discrepancy problems: Spencer’s “six

standard deviations suffice” theorem

3.

Regular structures:

approximation of discrete

processes by continuous processes

Regular structures

•

Combinatorial structures with

regular

symmetric structure

•

Examples:

–

Regular graphs

–

Designs

–

K-wise permutations

Regular graphs

•

Graph is

d-regular

 if all vertices have degree d

•

Easy to construct, many examples

•

Not so easy to

count

•

E.g. what is the

probability that a random graph

is d-regular

Designs

•

Design = regular hypergraph

•

t-(n,k,



) design: k-uniform hypergraph on n

vertices, where each t elements are in exactly



 edges

•

d-regular graph: 1-(n,2,d) design

Designs

•

t-(n,k,



) design: k-uniform hypergraph, where each t

elements are in exactly



 edges

•

Not so easy to construct anymore

•

Algebraic constructions

 for small parameters

–

e.g. 5-(24,8,1) design

•

[Teirlinck’87]

Recursive combinatorial construction

for

all t; k=t+1; and n,



 satisfying some modular conditions

–

Few other asymptotic constructions for sparse parameters

Designs

•

t-(n,k,



) design: k-uniform hypergraph, where each t

elements are in exactly



 edges

•

Existence unknown for almost all parameters

•

Open problem:

Steiner systems

–

t-(n,k,1) designs

–

Known for t



•

Open problem:

Hadamard matrices

–

2-(4n-1,2n-1,n-1) designs

–

Known in special cases, e.g. n=2

K-wise permutations

K-wise permutations

Common scenarios

•

Very regular objects

•

Explicit constructions

known in

few cases

, if

any

•

When constructions are unknown, so is

existence…

Why not use the probabilistic method?

•

If we don’t know how to construct regular

objects explicitly, why not

prove existence by

the probabilistic method

•

Lets take as a case study the

problem of d-

regular graphs

 (and forget for a minute that

we know how to construct them explicitly)

Probabilistically constructing

d-regular graphs

Probabilistically constructing

d-regular graphs

Probabilistically constructing

d-regular graphs

Probabilistically constructing

regular structures

•

[Kuperberg-L-Peled’12]

General technique

for

approximation discrete distributions by

Gaussian distributions

•

Allows to

prove existence

(and count)

–

Designs

 for (nearly) all underlying parameters

–

K-wise permutations

•

Main tools: Fourier analysis, coding theory

Open problems is regular objects

•

More applications

–

q-analogs (vector field analog of designs) [Fazeli, Vardy]

–

Conjectures in group theory

•

Make it algorithmic

–

Even special cases will be very interesting, e.g. finding

4-wise permutations

–

Given the state of affairs for the Lovász local lemma

and Spencer’s theorem, we are hopeful…

Facing the future

•

Many potential applications for techniques which

prove existence of

rare combinatorial structures

•

In principal, might give

efficient algorithms

to

problems with

exponential solution space

•

Some examples:

–

Hadamard matrices and other “tight” structures

–

Graph problems with disallowed minors

–

Local codes

Tight structures

Graphs with disallowed minors

•

Probabilistic techniques are widely used in

graph theory, but they don’t always work

•

Example:

Zarankiewicz problem

–

How many edges can a bipartite (n,n) graph have

without having a complete K

k,k

 sub-graph?

–

Only constructions which match lower bounds are

algebraic, for k=1,2,3

–

Probabilistic techniques all fail (so far)

Local codes

•

Error correcting codes allow to detect and

correct errors

•

Typically require the receiver to

read the

entire codeword

•

A code is

local

 if some operations can be done

locally, while only reading a small part of the

received codeword

Local codes

•

Many variants

•

Locally testable codes

 are at the combinatorial

core of many PCP theorems

•

Locally decodable codes

 are related to Private

Information Retrievel (PIR) schemes

•

Locally correctable codes

 are related to matrix

rigidity and arithmetic circuits lower bounds

Local codes

•

Many constructions: combinatorial and

algebraic

•

Large gaps

 between lower and upper bounds

•

No probabilistic constructions

 (so far) – can

potentially dramatically improve upper bounds

Summary

•

The probabilistic method is a

powerful tool

with

applications in many fields

•

Mathematical extensions

 of the probabilistic

method can show existence of very interesting

objects

•

Algorithmic analogs

 can yield new types of

algorithms, which find “rare” solutions efficiently

THANK YOU!

Photo courtesy of

personal.vu.nl/h.c.tijms

Rare events…

Slide Note

Embed Share

Download

This content delves into the probabilistic method, highlighting its significance in tackling new frontiers and overcoming old challenges. Shachar Lovett, from UCSD, shares insights at the ELC Tokyo Complexity Workshop in March 2013, shedding light on advancements and persistent obstacles in this area of study. Exploring the intersection of theory and applications, this session is a valuable resource for researchers and enthusiasts looking to understand and navigate the intricate realm of probabilistic reasoning.

karl_eda Follow

Uploaded on Mar 05, 2025 | 0 Views

Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

You are allowed to download the files provided on this website for personal or commercial use, subject to the condition that they are used lawfully. All files are the property of their respective owners.

Download Presentation

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.

E N D

Presentation Transcript

The probabilistic method: New frontiers and old challenges Shachar Lovett (UCSD) ELC Tokyo Complexity Workshop, March 2013

The probabilistic method Non-constructive way to prove existence of mathematical objects Pioneered by Erd s in combinatorics Numerous applications in theoretical computer science, combinatorics, number theory, analysis, geometry,

The probabilistic method Recipe: to prove existence of an object with certain prescribed properties, 1. Define a distribution on the objects 2. Sample an object 3. Prove that with positive probability the object sampled has the required properties The required objects must exist !

Example: bad movies How to find a bad movie pick one at random! Photo courtesy of elijahconsulting.com

Example: good codes Binary code: subset C of {0,1}n (codewords) Code is good if there are constants , >0 s.t |C| 2 n (good rate) Two codewords differ in at least n coordinates (good distance) Claim: good codes exist (for suitable , ) In fact, they are abundant

Example: good codes CODEWORD MINIMAL DISTANCE AROUND CODEWORD

Example: good codes Good code: C {0,1}n, |C|=2 n, min dist n Distribution: uniform over subsets of size 2 n Probability that the code contains two elements of hamming distance n is approximately ? 2?? 2 ??2 ? Exponentially small if , are small enough

Example: good codes Almost all codes are good This, however, tells us very little on how to find good codes (of course, we can randomly sample them and hope for the best) Even more, it tells us nothing on how to build efficient algorithms for encoding, decoding,

Archetypical setup In most cases, the probabilistic method shows that not only the required objects exist, but in fact they are abundant Most codes are good, most sparse graphs are expanders, most functions are hard to compute, most matrices are rigid, Challenge: find explicit objects Major line of research in TCS, number theory, geometry, combinatorics,

Rare objects There are quite a few cases, however, when the standard application of the probabilistic method fails miserably This is simply because the required objects may exist, but they are certainly not abundant Looking for few (if any) needles in very large haystacks Photo courtesy of footprinthr.com.au

Rare objects Solutions? Explicit constructions: great, when we have them. This is the ultimate goal. But, in many cases we don t have any. More delicate probabilistic techniques: the focus of this talk. Potential for new algorithmic tools

Probabilistic techniques for rare objects 1. Limited independence: Lov sz local lemma 2. Discrepancy problems: Spencer s six standard deviations suffice theorem 3. Regular structures: approximation of discrete processes by continuous processes

Probabilistic techniques for rare objects 1. Limited independence: Lov sz local lemma 2. Discrepancy problems: Spencer s six standard deviations suffice theorem 3. Regular structures: approximation of discrete processes by continuous processes

Lovsz local lemma [Lov sz-Erd s 75, Spencer 77, Shearer 85] Powerful tool to analyze events which are mostly independent Typical application: E1, ,En bad events Each Ei depends on at most d other events Prob[Ei] 1/ed (e=2.718 ) Thm: It is possible that none of the events occur

Lovsz local lemma Example: G is d-regular graph on n vertices Color vertices with {-1,+1} Goal: minimize absolute sum of neighbors for the worst vertex Think of n >> d, say n=exp(d) What is the best we can hope for?

Lovsz local lemma d-regular graph, color vertices {-1,+1}, minimize absolute sum of neighbors

Lovsz local lemma d-regular graph, color vertices {-1,+1}, minimize absolute sum of neighbors - + + - - - + + + + - +

Lovsz local lemma d-regular graph, color vertices {-1,+1}, minimize absolute sum of neighbors - abs sum: 3 + + - - - + + + + - +

Lovsz local lemma d-regular graph, color vertices {-1,+1}, minimize absolute sum of neighbors Na ve bound: d Random assignment: w.h.p also d if n>>exp(d) ( log( )) O d d Lov sz local lemma: can get That is, there are rare colorings which are much better than typical colorings Beck-Fiala conjecture (special case): can get ( ) O d

Lovsz local lemma d-regular graph, color vertices {-1,+1}, minimize absolute sum of neighbors Theorem: exist coloring with abs sum ? ? ??? ? Proof: Consider a random coloring Bad event Ei: Vertex i has abs neighbors sum ? ? ??? ? Graph has degree d Pr[Ei] O(1/d2) Each event depends on d2 other events LLL: There exists a coloring where no Ei occurs

Lovsz local lemma [Moser 08, Moser-Tardos 09, Kolipaka-Szegedy 11] Constructive version: find a feasible solution where none of the bad events holds Can be done! Algorithm (for our example): Start with a random coloring; If a vertex has too large abs sum of neighbors, resample his neighbors. Repeat. Amazingly, converges to feasible solution in poly-time!!!

Probabilistic techniques for rare objects 1. Limited independence: Lov sz local lemma 2. Discrepancy problems: Spencer s six standard deviations suffice theorem 3. Regular structures: approximation of discrete processes by continuous processes

Discrepancy problems Elements: 1,2, .,n Sets: S1, ,Sm {1, ,n} Goal: Color each element {-1,+1} to minimize absolute sum (discrepancy) in worst case set For example, the graph problem we saw before is an instance of discrepancy problems Sets = neighbors of vertices A general framework

Discrepancy in geometry Example: n points in Rd (think of d as constant) Color them so that each half-space is balanced Random coloring: discrepancy O True answer: ??? ? log ? 12 1 2? [Alexander 90, Matou ek 95]

Discrepancy in number theory Example: Color {1, ,n} so that each arithmetic progression (e.g. {a,a+d,a+2d, ,a+kd}) is balanced Random coloring: discrepancy True answer: O n ( log( )) O n n 1/4 ( ) [Roth 64, Matou ek-Spencer 96] Conjecture of Erd s: for homogeneous arithmetic progressions (e.g. {0,d,2d, ,kd}) the answer is O(1) Best known is O(log n)

Discrepancy problems What is the best bound if no prior knowledge is assumed on the structure of the sets? [Spencer 85]: for any family of n elements and m sets there exists a coloring with discrepancy Beats the for random colorings For m=n gets Previously mentioned results use the same proof technique + domain specific knowledge ( log( log( )) 6 n / )) O ( n m n O n m

Algorithmic discrepancy theory [Bansal 10, L-Meka 12] Constructive version: find a coloring with good discrepancy Can be done! Bansal algorithm: SDP relaxation + clever rounding L-Meka algorithm: Random walk on fractional colorings

Some applications of algorithmic discrepancy theory Phase transition for random integer programs [Chandrasekaran-Vempala 12] Improved approximation algorithm for bin- packing [Rothvoss 13] Codes with bounded peak-to-mean envelope [Work in progress with Kotzer, Litsyn]

Open problems in discrepancy Beck-Fiala theorem: if any element belongs to d sets, then the discrepancy is < 2d Beck-Fiala conjecture: true answer is ? Best non-constructive bound: ? Best constructive bound: ? Special case of yet another conjecture (Komlos conjecture) ? ?log?[Banaszczyk 97] ? log? Constructive versions of the domain specific applications of Spencer s theorem

Probabilistic techniques for rare objects 1. Limited independence: Lov sz local lemma 2. Discrepancy problems: Spencer s six standard deviations suffice theorem 3. Regular structures: approximation of discrete processes by continuous processes

Regular structures Combinatorial structures with regular symmetric structure Examples: Regular graphs Designs K-wise permutations

Regular graphs Graph is d-regular if all vertices have degree d Easy to construct, many examples Not so easy to count E.g. what is the probability that a random graph is d-regular?

Designs Design = regular hypergraph t-(n,k, ) design: k-uniform hypergraph on n vertices, where each t elements are in exactly edges d-regular graph: 1-(n,2,d) design

Designs t-(n,k, ) design: k-uniform hypergraph, where each t elements are in exactly edges Not so easy to construct anymore Algebraic constructions for small parameters e.g. 5-(24,8,1) design [Teirlinck 87] Recursive combinatorial construction for all t; k=t+1; and n, satisfying some modular conditions Few other asymptotic constructions for sparse parameters

Designs t-(n,k, ) design: k-uniform hypergraph, where each t elements are in exactly edges Existence unknown for almost all parameters Open problem: Steiner systems t-(n,k,1) designs Known for t 5 Open problem: Hadamard matrices 2-(4n-1,2n-1,n-1) designs Known in special cases, e.g. n=2m

K-wise permutations Sn = group of permutations on n elements Family F Sn is k-wise independent if its action on k-tuples is uniform, that is if ? ? ! ?! ? ?:? ?1 = ?1, ,? ?? = ?? = |?| for all distinct i1, ,ik and j1, ,jk

K-wise permutations Trivial solution: all permutations (e.g. n!) Explicit algebraic constructions of polynomial size (e.g ?? ?) are known only for k=1,2,3 Based on group symmetries Provably fail for k>3 and n>24 Explicit combinatorial constructions of exponential size (e.g ?? ?) [Finucane-Peled-Yaari 12]

Common scenarios Very regular objects Explicit constructions known in few cases, if any When constructions are unknown, so is existence

Why not use the probabilistic method? If we don t know how to construct regular objects explicitly, why not prove existence by the probabilistic method? Lets take as a case study the problem of d- regular graphs (and forget for a minute that we know how to construct them explicitly)

Probabilistically constructing d-regular graphs G random graph on n vertices, each edge is independently chosen with probability ? = ? ? 1 Expected degree of a vertex = d What is the probability that all vertices have exactly degree d? Main problem: correlation

Probabilistically constructing d-regular graphs G random graph on n vertices, each edge is independently chosen with probability ? = ? ? 1 Random variables X1, ,Xn: Xi = degree of vertex i They are all dependent! Eliminates method which apply limited dependence, like the Lov sz local lemma

Probabilistically constructing d-regular graphs G random graph on n vertices, each edge is independently chosen with probability ? = ? ? 1 [Mckay-Wormald 90] Joint distribution of degrees (which is discrete) can be approximated by a Gaussian distribution (if d is large enough) Allows to approximately count d-regular graphs (up to 1+o(1) terms, for large enough d)

Probabilistically constructing regular structures [Kuperberg-L-Peled 12] General technique for approximation discrete distributions by Gaussian distributions Allows to prove existence (and count) Designs for (nearly) all underlying parameters K-wise permutations Main tools: Fourier analysis, coding theory

Open problems is regular objects More applications q-analogs (vector field analog of designs) [Fazeli, Vardy] Conjectures in group theory Make it algorithmic Even special cases will be very interesting, e.g. finding 4-wise permutations Given the state of affairs for the Lov sz local lemma and Spencer s theorem, we are hopeful

Facing the future Many potential applications for techniques which prove existence of rare combinatorial structures In principal, might give efficient algorithms to problems with exponential solution space Some examples: Hadamard matrices and other tight structures Graph problems with disallowed minors Local codes

Tight structures Some of the famous open problems in combinatorics are on tight structures Example: Hadamard matrices Orthogonal ? ? matrices with {-1,1} entries Can only exist if n=4m Equivalent to 2-(4m-1,2m-1,m-1) designs Several algebraic constructions Conjecture: exist for any n=4m Why not use probabilistic techniques to prove existence?

Graphs with disallowed minors Probabilistic techniques are widely used in graph theory, but they don t always work Example: Zarankiewicz problem How many edges can a bipartite (n,n) graph have without having a complete Kk,k sub-graph? Only constructions which match lower bounds are algebraic, for k=1,2,3 Probabilistic techniques all fail (so far)

Local codes Error correcting codes allow to detect and correct errors Typically require the receiver to read the entire codeword A code is local if some operations can be done locally, while only reading a small part of the received codeword

Local codes Many variants Locally testable codes are at the combinatorial core of many PCP theorems Locally decodable codes are related to Private Information Retrievel (PIR) schemes Locally correctable codes are related to matrix rigidity and arithmetic circuits lower bounds

Local codes Many constructions: combinatorial and algebraic Large gaps between lower and upper bounds No probabilistic constructions (so far) can potentially dramatically improve upper bounds

Probabilistic Method: New Frontiers and Old Challenges

Download Presentation

Presentation Transcript

Related

More Related Content