<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Some stuff &#187; sequences</title>
	<atom:link href="http://blog.yhuang.org/?feed=rss2&#038;tag=sequences" rel="self" type="application/rss+xml" />
	<link>https://blog.yhuang.org</link>
	<description>here.</description>
	<lastBuildDate>Wed, 27 Aug 2025 08:50:58 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.1.1</generator>
		<item>
		<title>bounding overlaps</title>
		<link>https://blog.yhuang.org/?p=983</link>
		<comments>https://blog.yhuang.org/?p=983#comments</comments>
		<pubDate>Mon, 21 Jan 2013 06:35:45 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[marginal probability]]></category>
		<category><![CDATA[masks]]></category>
		<category><![CDATA[math]]></category>
		<category><![CDATA[normalizing]]></category>
		<category><![CDATA[probabilities]]></category>
		<category><![CDATA[schematic view]]></category>
		<category><![CDATA[sequences]]></category>
		<category><![CDATA[singleton]]></category>
		<category><![CDATA[universal constraints]]></category>
		<category><![CDATA[vector]]></category>

		<guid isPermaLink="false">http://scripts.mit.edu/~zong/wpress/?p=983</guid>
		<description><![CDATA[A Venn diagram gives a schematic view of joint counts on a set of categories, e.g. where . Each &#8220;patch&#8221; of the diagram corresponds to one of possible values of . If we have the total count , then we can take the counts as probabilities by normalizing with . Suppose we are given only [...]]]></description>
			<content:encoded><![CDATA[<p>A Venn diagram gives a schematic view of joint counts on a set of \(n\) categories, e.g. \(c(S_1^n=s_1^n)\) where \(s_i\in\{0,1\}\). Each &#8220;patch&#8221; of the diagram corresponds to one of \(2^n\) possible values of \(s_1^n\).</p>
<p>If we have the total count \(C\triangleq \sum_{s_j: j\in \{1,&#8230;,n\}} c(S_1^n=s_1^n)\), then we can take the counts as probabilities by normalizing with \(p(S_1^n=s_1^n)=c(S_1^n=s_1^n)/C\).</p>
<p>Suppose we are given only singleton marginals \(p(S_i=1)\triangleq \sum_{s_j: j\in \{1,&#8230;,n\}\backslash i} p(S_1^n=s_1^n)\), then we can bound the other probabilities by imposing universal constraints on probabilities to be between 0 and 1.<br />
<span id="more-983"></span><br />
To get the bounds, we solve a linear programming problem. The standard form is \(\min_x f&#8217;x \ \ \mathrm{s.t.}\  Ax \leq b\). Here, \(f\) is a length-\(2^n\) vector that indicates the particular marginal probability we want to bound, by selecting among the \(s_1^n\)-sequences. \(A\) is \([I_{2^n\times 2^n}; B; \mathbf{1}_{1\times 2^n}]\) where the columns correspond to all possible \(s_1^n\)-sequences, and where the rows of \(B\) are the \(n\) half-1 masks indicating the \(n\) singleton marginals. Finally, let \(u=[\mathbf{1}_{2^n\times1}; p(S_1=s_1);...;p(S_n=s_n); 1]\) and \(l=[\mathbf{0}_{2^n\times1}; p(S_1=s_1);...;p(S_n=s_n); 1]\).</p>
<p>Hence to get the lower bound on the desired marginal, we solve \(\min_x f&#8217; x \ \ \mathrm{s.t.}\  A x \leq u\ \mathrm{and}\ -A x \leq -l\). Likewise, for the upper bound, \(-\min_x -f&#8217; x \ \ \mathrm{s.t.}\  A x \leq u\ \mathrm{and}\ -A x \leq -l\).</p>
<p>For example, \(n=2\), \(p(S_1=1)=0.6\), \(p(S_2=1)=0.5\), then we can bound \(p(S_1=1,S_2=1)\) by \(0.1\le p(S_1=1,S_2=1)\le 0.5\).</p>
<p>Of course, we can add additional constraints when we know them. In some cases with categories, we have that \(p(S_1^n=0)=0\), that is, everything must belong to at least one category. Then it would appear the doublet marginal \(p(S_i=1,S_j=1)\) satisfies<br />
\(\max \{0, p(S_i=1)+p(S_j=1)-1\}\le p(S_i=1,S_j=1)\)<br />
\(\le \min \{(\sum_{i=1}^n p(S_i=1))-1, p(S_i=1), p(S_j=1)\}\).</p>
]]></content:encoded>
			<wfw:commentRss>https://blog.yhuang.org/?feed=rss2&#038;p=983</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
