First, I'll answer your questions below, but let's get this out of the way first:
You still haven't even addressed the FACT that the Wells people doubted Anderson and then flat out stated that he probably used the gauge that he said, to his best recollection, he did not use.
THEY LIED.
The Wells people are lying. Don't you get it?
As for the rest, you are wrong about #2. Therefore you are wrong about #1 and #3. In the Appendix, the Exponent people discussed 3 things about the Colts balls that made the comparisons difficult if not impossible. One, the balls showed weird readings with the gauges that they couldn't figure out. Specifically, ball #3 showed a much higher reading on Blakeman's device than on Prioleau's device, even though Blakeman's device was .4 under Prioleau's on all 3 other balls. No one could explain this, so they decided to discard that ball from the statistical analysis. Effectively, they had 12 Patriots balls with readings and 3 Colts balls. But, as some Patriots balls barely went down in pressure (.2 PSI), the report assumes that some balls were used in the game in cold wet conditions while others remained in the bag. In other words, what would be the ratings of Colts balls if all balls were tested. That's two factors that makes this comparison difficult. Third factor: why did they only test 4 Colts balls? They ran out of time before kickoff. They tested the Colts balls last. The Wells report even has this timed. The were tested after being in the shower room for 10 minutes. A Physicist Martin Schmaltz from BU wrote that this would be enough time for them to achieve equilibrium. It may also be enough time for some of the Patriots balls to achieve equilibrium.
Here are the relevant pages, BURIED in the Appendix:
LOL, a torrential downpour!