Bernardo Subercaseaux | Mathematics, Philosophy, Not-poems, and learning How To Surf from a book

For a better formatted version of this blog-post, please go to https://chileantheoryguy.substack.com/p/mathematics-philosophy-not-poems?utm_source=post-email-title&publication_id=1265277&post_id=93081777&isFreemail=false&triedRedirect=true.

What can be as silly as learning how to surf from a book? (No offense to Kenneth Martin, whose book I haven’t read.)

Description of image — **Figure 1:** A book about learning how to surf.

In this post, I’ll try to convince the reader that a valid answer might be: “learning how to Math from a book!”.

Thanks for reading Bernardo’s Substack! Subscribe for free to receive new posts and support my work.

Intentionally, I’m using Math as a verb in the same way one uses Surf both as a verb and for the sport itself, and I think this distinction is pretty darn important.

I have never surfed, but I’m inclined to think that reading books about surfing can be instructive in a particular sense; you can read about Surf, about its different elements, about what kind of weather is ideal for surfing, the wood that makes the best boards, etc. But you’re not exactly learning how to Surf; at most you could get some ideas about things to have in mind once you go to the beach, which is when the “learning how to Surf” would happen. I think most people would agree with me on this, but they would argue back that Math or Philosophy are different, and those subjects you can indeed learn from a book. When asked “what is the difference?”, I’m inclined to think they would answer that it’s because Surfing is a physical activity, that you need to learn with your body, whereas Math or Philosophy are brainy stuff and thus a book going through your brain is enough to learn them. I think this is just wrong, and that learning how to Math from a book is as silly as learning how to Surf from a book!

To clarify right away, I think Math books are extremely important and helpful, but in a similar way that a surfing book can be important and helpful; I believe one could get a lot of out a surfing book if you’re going to the beach often, and contrasting your hands-on experience with the book, using it as a guide to making your hands-on in-water learning experience more effective, but never as a replacement for it. For Math, I believe one should not “read books”, but rather have them as companions for Mathing. In fact, I think the fact that one can just go to a coffee shop with a Math book and a notebook and Math away is an amazing gift of life; you can Math accompanied by the great giants of the present and the past, and there’s not even the need to stand on their shoulds as Newton famously did. Just Mathing side by side with them, behind them even, is an amazing opportunity.

The story however is far from over. In the remaining of this post I’ll ambitiously attempt the following:

Present more support to these previous claims, relating them with philosophy and poetry.
Dive into the pragmatics of how love for Math can get complicated, and what has worked for me as a way to recover the flame in our relationship, that has definitely been hurt multiple times across the years.
Frame things in a way that might be helpful for establishing connections with other aspects of behavioral pragmatics that transfer across disciplines.

Immanuel Kant, my favorite philosopher of times, is famously quoted to have said something along the lines of

“One should not learn Philosophy, but rather how to Philosophyze.” – Immanuel Kant (Freely quoted).

A concrete citation is the following, from his lecture note on educational aspects of Philosophy.

“One can thus learn philosophy, without being able to philosophize. Thus whoever properly wants to become a philosopher: he must make a free use of his reason, and not merely an imitative, so to speak, mechanical use. […] How can one learn philosophy? One either derives philosophical cognitions from the first sources of their production, i.e., from the principles of reason; or one learns them from those who have philosophized. The easiest way is the latter. But that is not properly philosophy. Suppose there were a true philosophy, [if] one learned it, then one would still have only a historical cognition. A philosopher must be able to philosophize, and for that one must not learn philosophy; otherwise one can judge nothing. […] One can make a distinction between the two expressions, to learn philosophy and to learn to philosophize. To learn is to imitate the judgments of others, hence is quite distinct from one’s own reflection.” (Lectures on Pedagogy, pulled from Manchester University)

This citation really captures most of the issue for me. But there’s still a lot left; in particular, what does the difference between learning Philosophy and learning to Philosophyze look like? What shall we do in practice the next time we sit at a Café?

I can’t resist pulling out another reference, this time from the great Mexican poet José Emilio Pacheco. Decades ago, he wrote this beautiful not-poem that captures part of the same idea, but this time in relation to poetry. I have now the pleasure of introducing you, dear reader, to my English translation of this brilliant not-poem, that as far as I am aware is not available in English anywhere on the internet.

In defense of Anonimity. A letter to George B. Moore to deny him an interview. José Emilio Pacheco (1939-2014). I don’t know why we write, dear George, and sometimes I wonder why later on we publish what we have written. In other words, we throw a bottle to the sea that is full of garbage and bottles with messages. We will never know to whom nor where will it be carried by the tides. Most likely, it will succumb in the storm and the abyss, in the bottom sand, that is death. And nonetheless, this act of a castaway is never useless. Because on a Sunday you call me from Estes Park, Colorado. You tell me that you have read what’s inside the bottle (across the seas: our two different languages) and you want to interview me. How to explain to you that I have never given an interview? That my ambition is to be read, not to be “well known”? That is the text that matters and not the author of the text? That I disapprove of the literary circus? I then receive your enormous telegram (how much must you have spent, dear friend, to send it) I cannot answer nor remain silent. And these verses come up to me. It is not a poem. It does not aspire to the privilege of poetry (it is not voluntary). And I will use, as the ancients did, the verse as an instrument for all of that (tale, letter, treatise, drama, story, agriculture manual) that today we say in prose. To begin not-answering you I’ll say: I have nothing to add to what is there in my poems, I am not interested in commenting on them, I am not worried about my place in “history” (if I have any). I write and that is all. I write: I give half of the poem. Poetry is not black signs on the white page. I call poetry to that place where of encounter with foreign experience. The reader will make (or not) the poem that I’ve only sketched. We don’t read others: we read ourselves in them. I find it a miracle that someone I don’t know can look at themself in my mirror. If there is a merit in this —said Pessoa— it belongs to the verses, not to the author of the verses. If by any chance one is a great poet, one will leave behind three or four valid poems, sorrounded by failures and drafts. One’s personal opinions are truly of little interest. Weird world we live in: the interest in poets is every day a bit bigger, and the interest in poems every day a bit smaller. The poet stopped being the voice of the tribe, the one who speaks out for those who don’t. The poet has become another entertainer. Their drunkenness, their sexual scandals, their clinical history, their alliances and beefs with the other clowns of the circus, or the trapeze artist or the elephant tamer. They have assured a wide audience who now does not need to read their poems. I keep thinking that poetry is something else: a form of love that exists only in silence, in a secret pact between two personhoods, between two that almost always are strangers to one another. By any chance did you read that Juan Ramón Jiménez, thought half a century ago about editing a poetry magazine that was going to be called Anonimity. Anonimity would publish poems, not signatures; it will be made out of text and not out of authors. And I wish, as the Spaniard poet wished, that poetry was anonymous, since it is collective (to that I aim my verses and my versions). There is a chance that you will agree with me. You, that have read me and do not know me. We will never meet, and nonetheless we are friends. If you have enjoyed my verses, What does it matter that they’re mine / from another / from noone? The truth is, the poems you’ve read are yours: You, their author, who invents them when reading them.

Wow. Just wow. Amazing isn’t it? If there is ever a contest for the coolest ways of denying an interview, I believe this should be enough to take the first three places alone.

This not-poem (out of respect to JEP’s will) transcends far beyond my point in this essay but meets it somewhere along the way: poetry beyond reading the work of another, but rather of working it out by the practice of reading it. Perhaps Pacheco’s point is partly that the important thing once again is not poetry, but Poeting, which is halfway done by the writer, halfway done by the reader who poets the verses out one by one whilst reading them. Once humanity is extinguished (if ever) there will be no poetry, merely black signs printed on white pages.

So how does one Poet? how does one Math? how does one Philosophyze? I’ll try to sketch a couple of ideas that have been personally important to me.

Also, this is probably a good time for you, dear reader, to take a short break before keep reading.

Part II: Pragmatics that you probably already know but are good to remember

Getting the game

The idea of “getting the game” has probably been the most important idea I’ve learned in my lifetime. It’s probably pretty obvious in hindsight, but some of us need the extra help.

First, in this framework, I think of many things as games. Math as a game. Poetry as a game. Philosophy as a game. Music as a game. Dentistry as a game. Dating as a game. Living life as a game. The semantics of this should become clearer as we go. These games are composed of sub-games, for example, Math can be a sub-game of the game of life, and your Calculus class a sub-game of Math. The idea now is that I want to do well at a game, and enjoy playing it. Some games I choose to play, and some games I’m forced to play by external forces. So how do you do well at a game, and enjoy playing it? Getting the game is always the first step.

Getting the game means developing an understanding of why other people have liked this game in the past, and why they have found it useful or interesting. It means developing an understanding of the mechanics of the game, the winning conditions, and of the prizes at stake. It means developing some honest respect for the game and the good players and the good moves.

Let’s take dentistry as an example. Here’s how I started really enjoying going to the dentist. I once talked to a dentistry student I met in Chile, and they were really passionate about dentistry. Weird, right? So it made me curious, what the #%@& do you like about dentistry? We talked a bit about it and it got me curious. After a bit of thought, I guess it’s actually pretty cool that our bodies have teeth, these marble-looking things in our mouths that serve as the first interface between food and our body. Isn’t it kinda crazy that our DNA encodes the fabrication of these pieces, and that it actually includes doing it twice; baby teeth and permanent teeth? And they’re so delicate and in constant interaction with external bodies that they require a lot of additional care, more so than other parts of our body. That they’re far from uniform, with different teeth serving different purposes and their shape and structure are accordingly different. Isn’t that kind of freaking cool? If you had sat me down in an abstract world to design the way large animals would get their nutrients, I don’t think I’d have ever come up with such an amazing solution. So how did dentistry originate? what were the cornerstones of its development into its modern form? What are the most important open problems in dentistry? What cool questions about teeth are there that we don’t have answers for? This is what I mean by start getting the game of dentistry. Once you get a bit more of it, it’s pretty cool, and then once a year you get to visit someone who works with teeth full-time and gets to look at yours, your particular set of teeth, what is wrong with them? what is good about them? am I gonna lose them? what’s the right way to take care of them? All of these start being cool questions once you’ve got more of the game. Sure, the procedure might still hurt, but now it’s a painful part of a game you get. It makes it much better for me. I really recommend this animated video (in French, but you can use auto-translate), probably meant for kids, about what the heck is happening inside your mouth.

Now let’s think about chess. It might seem like a boring nerdy game to a lot of people, but oh boy it’s beautiful once you get the game. Once you start getting the ideas behind gambits, behind openings, behind castling and king safety, once you start getting a good grasp of what the pieces are worth, and all the fascinating sub-games inside of chess. Once you get more of the game, not even being good at it, you can watch a video of Magnus Carlsen playing, and oh boy isn’t that beautiful? It’s a pure display of elegance and mastery in a way beyond the dreams of the beginner. And even though I assume a ton of the deep things going on are flying right over my head, the tiny superficial portion that a noob like me is able to appreciate is already mind-blowing; some moves are so freaking cool they make you want to stand up and clap.

I was 14 when I started getting the game of academics. Before that, I was a really bad student. I had bad grades and bad behavior at school. So much so that I was left in “conditional” state at the second school I transferred to, the Chilean term for when you’re given a last warning before getting kicked out (which would have really limited my options of going to a good college). But the next school year something changed.

We had a new math teacher, and he started the year with trigonometry. It’s hard to find something that sounds worse to a student that’s at risk of getting kicked out of school at 14 than trigonometry. But he started the class talking about this Greek guy, Hipparchus, much before Christ, trying to understand what was going on in the skies; where was the moon gonna go next, where was the sun gonna go next, how could you locate back your homeland if you’re lost in the sea and all you have is the stars as a guide. Álvaro Sanchez, my new math teacher, really made me think about this dude, perhaps sitting on a boat in the night, looking at the stars and thinking about whether he could figure things out, whether he could understand how the brilliant corpses in the distant sky work, and act upon it, to orientate oneself, to predict the tides, etc. I saw something in there, and I’m grateful to this day to Álvaro Sanchez for gifting me that moment. For the first time, I realized that there was, in the dreadful subject of Maths, a game that Hipparchus had engaged in, and for the first time, I had the feeling that I could also, perhaps, borrow the joystick for a second and play. Everything changed for me there. I started seeing the manipulation of trigonometric equations in a similar fashion I see a chess tactic now, or a punching combo in Street Fighter.

Since then, whenever I try to learn something new, I try to get the game first. Even if I am forced to do or learn something, even if it’s not appealing to me personally, I try to get why others have cared, and what others have been captivated by inside the subject. I’ve also learned that playing a game you don’t get is sometimes the only way to get the game, and there will be more on this particular point later on, in a chewing vs swallowing sub-section. But it’s always important for me to remember that excitement is not zero-sum; you can get more excited about more things all the time, as a conscious decision, and it seems to only make things better.

When there are no friends in sight, look for the enemy

This one has also been huge for me, it is about the importance of identifying roadblocks. Whenever there’s something I want to do, let’s say a theorem I want to prove, there are basically two cases:

I have a pretty clear idea to try.
I have no idea about what to try next.

When on case 1., there’s not too much to think through; try the idea, if it works, champagne, if it doesn’t, repeat until case 2.

So the main issue is what to do on case 2; when there are “no friends in sight” (i.e., clear ideas of what to do next). This perhaps obvious technique is that then the next thing is to identify what the “enemy” is. If the theorem is not-trivial, then there must be a reason why it is not trivial; some obstacle along the way. If the theorem is of the form “All X’s are Y”, then look for what the obstacle to Y is. If everything is a Y and there are no obstructions, your theorem is free. So there must be something preventing some things from being Y; what are those? Can some X fall under that obstacle?

The point here is that at all points there should be a concrete enemy preventing you from accomplishing the task. This doesn’t mean that defeating the enemy will be easy, but there should be an enemy. Okay, but once you know the enemy, how do you defeat it? Here’s the trick; with the same recursive procedure. Do you have a pretty clear idea of how to beat the enemy? If not, there must be something in the enemy that’s obstructing you from it, what is it? Oh is it the big scary Bazooka they’re carrying? Well, you have a pretty clear idea of how to avoid receiving damage from a Bazooka right? No? Then what’s between you and that?! and so on.

Note of course that this is only a methodological procedure with no guarantee of success; if at any point one thinks, not only I don’t have any idea of how to avoid receiving damage from a Bazooka, but rather I’m utterly convinced that this is impossible… That’s great too! Can’t you just prove that it is impossible?! Uhmmm, well not quite, it’s hard to prove the Bazooka is going to kill me regardless of what I do, I mean the enemy could miss the shot, or maybe the Bazooka is not loaded. Very good, is there a strategy that safely allows you to check whether the Bazooka is loaded? Perhaps throwing a decoy instead of yourself?

You see the point: it’s hard to get fully and completely stuck working like this because whatever you’re trying is either possible or impossible; so if you do methodologically sound steps, one way or another you should gain something. If there’s a path from your current state of knowledge A to your desired state of knowledge B, then that path must go through some intermediate state C that is very close to A, and your mission is to not focus on B and how far it seems, but rather on where C is. A different matter is whether things are achievable in a given timeframe, or whether they’re worth attempting at all given their probability of failure. But I think is crucial to be confident in that if you really really want to do something, it’s hard to be 100% mentally stuck, as decomposition techniques should get you closer to smaller and smaller tasks, that at some point should be atomically solvable, or atomically impossible. Note as well that “conceptually stuck” is different from cases like “I can’t advance on this paperwork until getting Claire’s signature on this other form”. Then you can be caught in a deadlock, but at least it is not a conceptual deadlock.

Let’s work through an extremely simple example.

Theorem. All trees (i.e., connected acyclic undirected graphs) are bicolorable.

If you can immediately think of a proof, pretend you don’t for the sake of the exercise. I’ll pretend so.

Okay, so are all graphs bicolorable? If so, then we have it! No? So there are non-bicolorable graphs?! Oh, By trying on paper I got the triangle!

Is the theorem false?! Not really, because the triangle is not a tree, as it has a cycle. But this is good, we found an enemy and his name is the triangle. Is there a way, even though the triangle itself is not a tree, that it still wins an enemy and prevents our theorem from being true, say because it’s a part of a larger tree? Well, not quite either, because a tree couldn’t really have a triangle as a part, that would break acyclicity. Okay, the triangle is defeated, so are we done? It seems that what things are pointing to is ‘‘All graphs that don’t contain triangles are bicolorable”. Is that the same as what we are trying to prove? Well not quite, a square doesn’t contain a triangle and yet it’s not a tree.

But the square can be colored with 2 colors! Not really an enemy of bicolorable graphs. Okay so ‘‘All graphs that don’t contain triangles are bicolorable” is an appealing idea; it even seems a nice converse to “All graphs that contain triangles are not bicolorable”, which we already know by now! Squares are no problemo, so if there’s any difference between the truth value of ‘‘All graphs that don’t contain triangles are bicolorable” and that of our theorem, it must be because of longer cycles, longer than 4 in particular.

Huh, also cycles of length 5 require 3 colors! What about 6? huh, 2 colors! What about 7? huh, 3 colors! What about 8? huh, 3 colors! Okay, the pattern is clear. It seems that cycles of odd length are not bicolorable; good thing trees don’t have them! Can we prove that if your graph doesn’t have cycles of odd length, then it is bicolorable? What would be an obstacle? A graph that is not bicolorable and yet doesn’t have cycles of odd length. Let’s look for one. After a bit of pen-and-paper time, one should get frustrated: I don’t seem to find any examples. Things start to point out to “bicolorable if, and only if, no odd-length cycles”.

Exercise to the reader: continue this extremely painful proof method until completion, or perhaps with a different (but easy!) problem. I really recommend doing it.

For a longer exposition on this idea, I really encourage the reader to go for Solving Math Problems Terribly. Solving problems terribly is an amazing skill to learn!

Once you’ve kept this idea in mind, the next step is to use it extensively in pedagogy. It’s really nice when proofs depict the enemy, and they show why it can’t hurt you, instead of just walking through the grass while leaving the reader wondering why no enemies attack. An example of a proof that I wouldn’t like to read about trees being bicolorable is the following:

Define trees inductively as either an empty graph or a vertex (which we call root) from which an arbitrary collection of trees hang. Now let’s prove by induction the stronger statement that all trees can be colored red and blue with the root receiving the color red. Trivial for a single vertex, and for the inductive case, color the root with blue, and each hanging tree via the inductive hypothesis. If feeling generous to the reader, argue that this is a fine coloring overall, because the only edges are those inside the trees (fine by inductive hypothesis), and those from the root to each tree root, which is fine because they are red-blue edges. Finally, invert all colors to preserve the inductive hypothesis.

This is correct, but where is the enemy?! I don’t think good proofs have to be extremely explicit in identifying the obstacles, but it sure helps, and if anything, the non-triviality of your theorem is justified precisely by the number and difficulty of overcoming the different obstacles, so you might as well show them properly.

It’s very important to find the smallest enemies one can tackle at the time. Usually, when you play a video game, difficulty can be bad in two ways; either the game is too easy, which makes it boring, or it is too hard, which makes it frustrating. I have never heard of a student quitting their Ph.D. because they found it to be boringly easy, so our case is always that of fighting against a game that gets frustratingly hard at times. And how can games be less frustratingly hard and thus more enjoyable? By having a well-calibrated progression of difficulty. This is not trivia; game designers (both video- and board games), need to spend a lot of time balancing the game so that it’s not too easy nor too hard. When you play a videogame, you don’t start fighting the boss right away, but rather you warm your way up to it by beating a bunch of weak little monsters. The weak little monsters in mathematics, at least for me, are concrete small examples of what I want to prove. Organizing the quest in a way that has a nice progression of difficulty is really important for me to not get frustrated and quit, and it’s not a trivial task; it requires conscious effort.

Street-Fighting Mathematics

This is a fantastic term I learned from Ryan O’Donnell, which pointed me to an older reference, an eponymous book by Sanjoy Mahajan.

So what the heck is street-fighting Mathematics?! First, street-fighting is a term that as opposed to other forms of fighting like Karate or Boxing, refers to a fight without rules, where everything is allowed: hair pulling, punching at the crotch, etc.

The idea of street-fighting mathematics for me is to rebel against a preconceived notion of math as being elegant and correct at all points, justified from the beginning in all of its steps; rule-respecting. There are no rules, whatever you think could work to help solve the problem you’re interested in, you should try. Use a computer, ask your friends, change the theorem statement so now it’s easier, assume all graphs will have only 6 vertices, assume π is equal to 3, use all the dirty tricks physicist use, etc. For the love of god, please refrain from mocking physicists for making their life easier by assuming stuff; ought to do the same first, and then start checking whether you could the same with one fewer assumption.

I try to street-fight my way around everything honestly, obeying only the minimum set of rules I actually think I must follow. I’ve realized some of the best mathematicians I know follow this principle too, either consciously or unconsciously, they take mental shortcuts and try to see if the gaps can be filled later. The justification for the effectiveness of this technique appears to me as being something like this:

Our brain, even when thinking about abstract concepts and formal symbolic manipulation, is driven by intuition, and details come later on, once the intuition has done the initial chopping, like our teeth that mechanically reduce our bites into much smaller pieces that then can be swallowed and absorbed. This is similar to the following: if you try to memorize the sentence “The quick brown fox jumps over the lazy dog” and then say it out loud without looking, this is pretty easy, but now that you memorized it, spell it out loud. This is harder, and usually, our strategy for doing it is going back and forth from “spelling” mode, to “remembering the next word” mode. Things can be substantially easier by not doing them right first, by going for the big picture first, and by filling in the gaps later. In other words, when you have to fill in a box at a Chinese buffet, put in the spring rolls first, and the rice later to fill in the empty spaces.

Let me give a concrete example of how this technique is used.

Problem. Let us say a number whose digits in base 10 are only zero and one is a “binary impostor”. So 10001 is a binary impostor and so is 1110, but not 1030. Now prove that there are infinitely many binary impostors divided by the current year.

If you read this in 2022, the problem is not too hard. 2022*5 = 10110, which is a binary impostor, and we can always add more 0s, making for infinite binary impostors. This easy case gave us the idea of finding a single multiple of the year that is a binary impostor, and then padding it with 0s. But what if you’re reading this in 2023?! The first thing to do, unless you immediately see the solution, is to go to Python and print the first 100 multiples of 2023. Huh, unfortunately, none of them is a binary impostor. What about 2024? Huh neither…

The opposite way around, print the remainder mod 2023 of randomly generated binary impostors.

import random

def to_bin(n):
    if n <= 1: return n
    return to_bin(n//2)*10 + n%2

rems = []
for _ in range(50):
    bi = to_bin(random.randint(100, 5000))
    rems.append(bi%2023)

print(sorted(rems))

Here’s what I got.

Notice something? 230 appears twice! Now I’m curious if the numbers that gave rise to these 230s look weird.

Now 230 doesn’t show up, but I see 3 different random binary impostors that are 56 mod 2023. This is perhaps useful, but I don’t see a pattern right away.

Binary impostors are constructed (see the Python code) by taking one, and adding a 1 or 0 are the end. What does that do mod 2023? I don’t even think now, just go to Python; street-fighting. I get some output, but still don’t see any patterns. What to do?! I’m going to cheat now and pretend the question was with a number smaller than 2023, because spotting patterns in these decently large numbers is not obvious.

What about 2? Mmh, but I immediately see 10 as a multiple. What about 3? The first binary impostor divisible by 3 seems to be 111, I could have thought this, but I shamelessly just coded it. Huh okay. What about 4? Then it’s 100. What about 5? Then it’s 10 again. What about 6? It’s 1110. Okay, too many similar questions, I’ll just do the little piece of code that prints for every small value of N the smallest binary impostor divisible by N.

import random

def to_bin(n):
    if n <= 1: return n
    return to_bin(n//2)*10 + n%2

rems = []
M = {}

def first_bi_div(n):
    for i in range(1, n*n*n):
        if to_bin(i)%n == 0:
            return to_bin(i)

for i in range(2, 60):
    print(first_bi_div(i))

You might ask now, why the n*n*n upper-bound? I don’t know, just made it up, seemed big enough and tractable; street-fighting.

I look at the output and now I notice something; these don’t look like the random ones I was generating earlier, they have lots of 1s in a row or lots of 0s in a row.

Question: if I consider numbers of the form 111…0000, what are they divisible by?

Okay, obvious observation, the 0s at the end will make for 2s and 5s… But 2023 is not divisible by 2 or 5, so not super helpful. At least now I know that I should look at impostors ending in 1.

Except for the first one, they have prime divisors, and 3 pops up (2 and 5 are discarded by the previous idea), 7 appears, 11 appears, and 13 appears, but I don’t see 17. I look at my previous code and it tells me the first binary impostor divisible by 17 is 11101. I now wonder if at some point 1111…..111 will be divisible by 17. Let Python figure it out. I do the first hacky code I come up with. The answer is 1111111111111111; incidentally only a single 1 more than what I tried with the factor Unix tool. Reminds me of this meme.

Okay, so I’ll keep going and checking whether the numbers of the form 111….111 are eventually divisible by anything (that is a multiple of 2 or 5).

Huh, is true up to 100 at least. Promising stuff!

It quickly found one divisible by 2023! Our original problem is done.

But now I want more, it seems to be true for any number!

This is a fairly large number of 1s though, I don’t know how much further I can push the computer… I guess I don’t really need to keep all these 1s on the computer, I’ll just see how adding an extra 1 affects the result mod 2023. I do this but don’t see an obvious pattern… At some point, the sequence of mods repeats itself naturally, so I’ll check what the period is.

The first binary impostor of the form 1111….111 divisible by 2023 has 816 ones, the next has 1632, and so on. 816 is the period. Quite cool actually, so I don’t even need the 0-padding, I’m actually showing that there are infinitely many unary impostors divisible by 2023!

What if it’s not 2023 though? Well, I realize that the sequence mod N must be periodic as well, with a period smaller than N. The problem is that perhaps in all that period it never gives me a 0 mod N. If all numbers have infinite many unary impostors divisible by them, then this never happens. If it happens, then there are no unary impostors divisible by them, but there are pairs of unary impostors that have the same remainder mod N. Now I see it, if I subtract the smaller unary impostor from the larger one, I get a binary impostor, and it has to be 0 mod N, because it’s the subtraction of two numbers that are equal mod N :)

Nice! I finally have infinitely binary impostors divisible by any given year :)

Exercise to the reader. Play around and see whether it’s true that all numbers not divisible by 2 or 5 have infinitely many unary impostors divisible by them.

Now here is how I’d probably write the proof of infinitely many binary impostors divisible by any natural N in a paper:

This sort of writing hides the attempts and failures of the mathematician behind it, and even though concision is a very positive quality, it’s worth thinking about this trade-off, and about how there’s so much more behind the scenes of a proof that a first glance shows. It’s very common to hear amongst undergrads in Math classes that “there’s no way in hell I could’ve come up with that proof”. They might be right some of the time, but other times they just ignore the highly non-linear path the authors might have taken to get where they got. It’s very dangerous for one education to think that the reason you couldn’t come up with that proof by yourself is because of some fundamental difference between you and the authors, e.g., they’re simply smarter; they might be smarter, who knows, but that doesn’t have much to do with the phenomenon at hand.

Of course, this methodology is harder to apply to more abstract problems, and it’s not a recipe for problem-solving, which remains wide open. The idea for me is to avoid “mathematicians’ block”; similar to how writers get blocked in front of the white piece of paper, the mathematician gets blocked in front of the whiteboard, the computer, or the white piece of paper. The standard advice to go past the well-documented writers block is: just sit down and write something, anything; make it as bad as you need to be able to write something, but write! Street-fighting and looking for obstacles are key aspects of my methodology to avoid the mathematicians’ block.

Chewing vs. Swallowing

Something I struggled with for a long time, and I’m just starting to be a tiny bit better about, is that sometimes one gets in the trap of trying to understand every single thing that is required to do something, prove a theorem, understand a paper, fill out a tax-form, etc. Most of the time, this is simply a procrastination strategy to avoid doing whatever we are dreading.

One could think that the best thinkers really process all the details, and understand every single thing that they’re reading, and every single previous result required to understand what they’re reading, and so on. My experience with the best philosophers and mathematicians I’ve met is quite the opposite; they swallow a lot of the stuff, rather than careful timid wise owls I associate them more with fast-moving astute foxes; they try things rapidly, look up stuff, constantly assume things, swallow other people’s results without chewing them, and recognize when what they want to do requires going back to the previous result and chew it out to obtain the missing bits.

This point is very tightly related to having clear goals; once goals are clear, this also sheds light on which parts of the process one should chew and really try to understand, and which parts of the process one can just swallow.

Do what you want — or even better, don’t do what you dread

A key component of my mathematical growth is to try as hard as I can to work on problems and approaches I really want to work on, regardless of whether other people find them silly or useless. Cultivating my own excitement for mathematics is globally more important for my career than an impactful paper, so it’s not worth losing much global excitement to win some local recognition points.

For example, I have published 2 papers at FUN with algorithms, a conference about CS applied to fun contexts. One about the game of Hangman, with Jérémy Barbay, and one about Wordle with Daniel Lokshtanov. It’s very likely that they won’t serve my career much, but I enjoyed working on them, and it cultivated my love for Maths. In hindsight, I’m pretty happy I did it; not only I had fun, but they made me a better mathematician.

The website for FUN with algorithms had at some point the following quote by the great Donald Knuth:

“…pleasure has probably been the main goal all along. But I hesitate to admit it, because computer scientists want to maintain their image as hard-working individuals who deserve high salaries. Sooner or later society will realise that certain kinds of hard work are in fact admirable even though they are more fun than just about anything else.”

A practical application of this idea for me is in the negative case; when there’s something I don’t want to do, I try really hard to not do it. I mean that I try hard to explore whether I actually really really really have to do it, “and there’s not another way?”, and see where that leads. A good fraction of the time I figure out a workaround that allows me to not do the thing I don’t want to do. The other fraction of the time, I’m confident that the dreadful thing really really has to be done, and I know of a good reason for that, which helps me do it. So if one’s to be doing something one doesn’t enjoy, at least it needs to be something one really must do.

An example I read recently was about someone on Twitter who really hated doing dishes, and so they just started buying plastic disposable silverware and plates. Maybe this is not great for environmental reasons, but the mindset is great; if I dread this thing so much, how can I not have to do it anymore?

A step beyond this; I try to collaborate as much as I can with people that share this value, as I feel more comfortable discussing research aesthetics with them. I believe research aesthetics and compatibility are real things that shouldn’t be discounted for building effective collaborations. I’m happy, for example, about my advisor Marijn Heule being explicit about working on the problems we really want to solve and see solved.

A lot of the magic happens in post-production

This part can be summarized as “don’t finish too quickly”. I learned this very recently in life, and it improved my mathematical, philosophical, and literary skills immediately; very few pieces of learning have done that for me.

The main idea is that, perhaps because of some internal mild form of anxiety, one really wants to finish the proof, the paragraph, the argument, and be done. You wrote the last sentence or said the last phrase, and you’re done. Nice.

But the thing is, a lot of the magic happens in post-production, once you sit down calmly with the material you have created and squeeze the lemon until the last drop. When you see a wonderful video on YouTube, it didn’t look like that the whole time. It probably looked like a disaster if you had been able to see the intermediate stages. The last 20% of the effort, at post-production, can really be the 80% of the “wow, this is so high-quality” effect.

So my intention here is to never be quick in brushing off a proof once I think I have it. A lot of the time for me, the real gain in understanding comes after having come up with a proof, but when I’m looking back at this disastrous creation and cleaning it up. I try to ask myself “why is it that I was able to succeed with this method here?”; I had an idea A, and it was not obvious to me before that A was going to work. Then I went ahead and tried it, and it turned out to work, but notice that if I stop here, because my proof is ready, I still don’t understand super well why it worked, it’s still not absorbed as part of my new intuition. So I try to investigate what part of the ideas was I doubtful about, what part was I not confident it, and why my doubts were justified; what was the obstacle preventing the obstacle my initial doubt was worried about.

Let’s see a philosophical example. Anselm’s ontological argument for the existence of god goes as follows.

“[Even a] fool, when he hears of … a being than which nothing greater can be conceived … understands what he hears, and what he understands is in his understanding.… And assuredly that, than which nothing greater can be conceived, cannot exist in the understanding alone. For suppose it exists in the understanding alone: then it can be conceived to exist in reality; which is greater.… Therefore, if that, than which nothing greater can be conceived, exists in the understanding alone, the very being, than which nothing greater can be conceived, is one, than which a greater can be conceived. But obviously this is impossible. Hence, there is no doubt that there exists a being, than which nothing greater can be conceived, and it exists both in the understanding and in reality.” St. Anselm, Archbishop of Canterbury (1033-1099).

A bullet-point version of the argument in modern English (that I am taking literally from the Internet Encyclopedia of Philosophy) is:

It is a conceptual truth (or, so to speak, true by definition) that God is a being than which none greater can be imagined (that is, the greatest possible being that can be imagined).

God exists as an idea in the mind.

A being that exists as an idea in the mind and in reality is, other things being equal, greater than a being that exists only as an idea in the mind.

Thus, if God exists only as an idea in the mind, then we can imagine something that is greater than God (that is, a greatest possible being that does exist).

But we cannot imagine something that is greater than God (for it is a contradiction to suppose that we can imagine a being greater than the greatest possible being that can be imagined.)

Therefore, God exists.

The first criticism I read against this argument came from a monk contemporary to Anselm: Gaunilo of Marmoutier. He basically posited that the same argument could be used to prove the existence of a perfect Island, as a perfect Island that exists in the real world is more perfect than one that exists only in the mind, and therefore the perfect Island must also exist in the real world.

I made a 3-year long intellectual mistake by thinking that Anselm’s idea was destroyed by this counter. But the thing is, the “pisland” (short for perfect island, one of my favorite philosophy-lingo) argument, is more like a gotcha, like an issue that has been raised, but it doesn’t give you good intuition on why Anslem’s argument is wrong, just on a way of exposing a potential failure mode in the argument.

The issue is actually much more complicated than this, and one really needs to stay after the credits to see it. I don’t think the failure in the argument was understood until Kant introduced the Copernican Revolution of philosophy, 700 years later.

In a nutshell, Kant tackles the argument by going against point 3., his idea being that thinking of existence as a property is a categorical mistake. In other words, the universe is not made out of objects that have a self.exists = True or self.exists = False property; existence is not a property, but a precondition for the instantiation of properties into a particular object; and properties themselves are representational concepts rather than real things. There are real things, exactly as they are, and then their properties are in our mental representations of them. Redness, as a property is only a conceptual representation; (over-simplifying) there are red things, which are things whose reality makes the “redness” mental representation toggle on in our brains. For a statistical example, even if your data looks like the following image:

clusters do not exist in the data, only in our conceptual representation of the data. Existence, however, is not even a candidate for a conceptual representation that real objects trigger, but rather a precondition for any conceptual representation. Talking about existence as a property is a type error, so to say.

However, even this is not enough! Because Anselm actually had a second formulation of the ontological argument, which uses the property of necessary existence rather than existence, which can be formalized as an actual property. Using necessary existence, Kurt Gödel, the greatest logician, came up with a formal proof in modal logic of god’s existence based on a refined version of Anselm’s second ontological argument. Funnily enough, Gödel’s argument can be verified by a computer given its formal nature, and it turns out to be consistent! (that is, if one accepts its axioms, the conclusion follows!). To the best of my knowledge, the implication of modal collapse, and the axioms themselves, are still a matter of research.

By finishing too quickly with the pisland argument, I missed on a lot more about the subject.

Stealing as much as possible

The following is a picture of the magnificent Las Meninas, by Diego Velázquez. It’s one of the most studied paintings of all time, a true masterpiece (there’s a lot to say about this painting, but I won’t go into it. Luckily for you, people with infinitely more knowledge about art and art history have extensively written about it. If you’re into Foucault, he wrote some very… Foucaultian things about it, it’s quite interesting),

But now take a look at this.

This is Picasso’s 1957 version of Las Meninas. Truly fantastic stuff. So the thing is, Picasso once said jokingly: Good artists copy, great artists steal. This is a phrase that needs to be taken very carefully to make sense of it. It’s not about defending actual intellectual property theft (please don’t do that — and double please don’t do that saying I told you to do that), but rather about the way great artists relate to their inspirations. Stealing is a way of saying appropriating fully, it is a way of saying that you don’t owe any faithfulness to your inspirations, only credit. They did it first, and that needs to be acknowledged, but you can do with it whatever you want, transform it at your wish, use it for all kinds of silly stuff. Copying has a connotation of sameness that does not allow for repurposing; copying means using the same solution for the same problem, but it is when you steal someone’s toolkit that you can use it in all kinds of new problems.

A concept I really like about good science is that it should not only contain results, but also “reusable brain stuff” (RBS), RBS is what you can steal from the paper and use, probably with tweaks, to solve other new problems. Being mindful of the reusable brain stuff in other people’s work has really changed my relation to the material I read; now I’m constantly on the look for what can I steal. Richard Feynman famously said that he kept around a dozen math or physics problems that he wanted to solve memorized in his head, and whenever he was going to a talk that showed a new math trick, or read a new paper, he would iterate over the dozen problems in his head asking the question “can this trick I’m learning now help me solve this problem?”.

Paraphrasing Pacheco, the Mexican writer, coming up with the equation is only half of the way, and the other half is done by the reader when they use it. By using them, you become in some sense their author, who invents them again in the use.

Thanks for staying here.

P.S. The excuse for this post can be said to be the following tweet, by Jérémy Barbay, in response to my public declaration of a renewed love for mathematics.

So thanks Jérémy for the encouragement! and I hope this reflection can be helpful, or interesting, to someone out there. If it is interesting to you, reader, please let me know what you think!

Or at least within the first page of Google search. I’d be happy to find other translations if anyone points them out to me.

There’s at least one reasonable way I can think for my argument to be wrong; imagine a world where understanding is fundamentally obtained through quantum leaps that cannot be broken down in intermediate steps. In this world I’m wrong, and I don’t see a simple way to prove (nor disprove for that matter) our world is not that world, but I’m pretty confident in this argument being reasonably inferred by Bayesian updating over my inner mental experience. I’d be more than happy to update these beliefs, or simply discuss them further, with the interested reader. It is also worth clarifying that this world model does not mean that our brain works in some sort of continuous fashion over a manifold of possible mental states; far from it, I mean that the resolution of our possible mental states seems good enough to abstractly model reasoning as a continuous phenomenon, similarly to how at human-scale physics, it’s reasonable to model position or speed as continuous variables, regardless of whether they actually only admit a discrete number of quantum states they can collapse into, simply because that is happening way beyond our resolution of interest. More in general, what can continuity possibly be if not the limiting idea (i.e., a purely mental construction) of discreteness being small enough to be abstracted away?