feat(useless-rewrite): Useless rewrite from `log1mexp(log1mexp(x))` to `x` #535

lmmx · 2023-12-06T22:26:39Z

Motivation for these changes

Picking up this ticket on the PyData Global 2023 OSS sprint 🏃

Optimize log1mexp(log1mexp(x)) -> x #471

The motivating issue concerned an issue using PyMC's Censored functionality which was traced back to the need for a "useless rewrite":

replacing a $log(1 - exp(x))$ within another $log(1 - exp(x))$
- i.e. a $log(1 - exp(log(1 - exp(x))))$
by just an $x$.

Implementation details

🧹 refactor(nan switch): DRY out nan switch rewrite function, easier to follow important parts 🧹

The first change is in preparation for adding a new case, which is to simplify the case handling (avoid repetition).
This is achieved by putting an "inner function" (function within a function) that captures the x and node from the function body in its scope, meaning we don't need to pass them as parameters, so the body of each case becomes simpler
Every case has a "nan switch", so this trick lets us avoid repeating ourselves but retaining clarity about the variables we're using.

✍️ Add new useless rewrite case ✍️

The previous operation and the node operation are both going to be log1mexp for this case
The condition for the case is that x >= 0 (confirm?)

Checklist

Explain motivation and implementation 👆
Make sure that the pre-commit linting/style checks pass.
Link relevant issues, preferably in nice commit messages.
The commits correspond to relevant logical changes. Note that if they don't, we will rewrite/rebase/squash the git history before merging.
Are the changes covered by tests and docstrings?
Fill out the short summary sections 👇

Major / Breaking Changes

N/A

New features

"Useless rewrite" to optimise $log(1 - exp(log(1 - exp(x))))$ into just $x$

Bugfixes

Would resolve Optimize log1mexp(log1mexp(x)) -> x #471

Documentation

...

Maintenance

Took an opportunity to tidy up the 'nan switch' cases into a dict, which is clearer to read than the repetitive if blocks

…follow important parts

…write

…early

lmmx · 2023-12-07T13:28:48Z

I think to finish this I need to add a test here

pytensor/tests/tensor/rewriting/test_math.py

Lines 1945 to 1971 in c10c376

    
           @pytest.mark.parametrize("exp_op", [exp, expm1]) 
        
           def test_exp_log1mexp(self, exp_op): 
        
               # exp(log1mexp(x)) -> switch(x <= 0, 1 - exp(x), nan) 
        
               # expm1(log1mexp(x)) -> switch(x <= 0, - exp(x), nan) 
        
               data_valid = -np.random.random((4, 3)).astype("float32") 
        
               data_valid[0, 0] = 0  # edge case 
        
               data_invalid = data_valid + 1 
        
               x = fmatrix() 
        
               f = function([x], exp_op(log1mexp(x)), mode=self.mode) 
        
               graph = f.maker.fgraph.toposort() 
        
               ops_graph = [ 
        
                   node 
        
                   for node in graph 
        
                   if isinstance(node.op, Elemwise) 
        
                   and isinstance( 
        
                       node.op.scalar_op, (aes.Log, aes.Log1p, aes.Log1mexp, aes.Expm1) 
        
                   ) 
        
               ] 
        
               assert len(ops_graph) == 0 
        
               if exp_op == exp: 
        
                   expected = 1 - np.exp(data_valid) 
        
               else: 
        
                   expected = -np.exp(data_valid) 
        
               np.testing.assert_almost_equal(f(data_valid), expected) 
        
               assert np.all(np.isnan(f(data_invalid)))

ricardoV94 · 2023-12-08T17:49:57Z

pytensor/tensor/rewriting/math.py

@@ -319,7 +319,7 @@ def local_exp_log(fgraph, node):
 @register_specialize
 @node_rewriter([Elemwise])


This should be a bit better, as it will only call the rewrite on nodes with these Ops

Suggested change

@node_rewriter([Elemwise])

@node_rewriter([exp, expm1, log1mexp])

ricardoV94 · 2023-12-08T17:52:11Z

pytensor/tensor/rewriting/math.py

@@ -319,7 +319,7 @@ def local_exp_log(fgraph, node):
 @register_specialize
 @node_rewriter([Elemwise])
 def local_exp_log_nan_switch(fgraph, node):
-    # Rewrites of the kind exp(log...(x)) that require a `nan` switch
+    """Rewrites of the kind exp(log...(x)) that require a `nan` switch."""
    x = node.inputs[0]

    if not isinstance(node.op, Elemwise):


Not needed (can't select the return None below for the git suggestion)

Suggested change

if not isinstance(node.op, Elemwise):

ricardoV94 · 2023-12-08T17:59:19Z

pytensor/tensor/rewriting/math.py

-        return [new_out]
+    x = x.owner.inputs[0]
+
+    op_map = {


This is neat but I find the old form more readable tbh? Also if I ever have to debug this rewrite I would rather have the unrolled if/elses.

I am happy with the nan_switch_helper and nesting the if/else based on the outer Ops.

ricardoV94 · 2023-12-08T18:01:57Z

I think to finish this I need to add a test here

pytensor/tests/tensor/rewriting/test_math.py

Lines 1945 to 1971 in c10c376

@pytest.mark.parametrize("exp_op", [exp, expm1])

def test_exp_log1mexp(self, exp_op):

# exp(log1mexp(x)) -> switch(x <= 0, 1 - exp(x), nan)

# expm1(log1mexp(x)) -> switch(x <= 0, - exp(x), nan)

data_valid = -np.random.random((4, 3)).astype("float32")

data_valid[0, 0] = 0 # edge case

data_invalid = data_valid + 1

x = fmatrix()

f = function([x], exp_op(log1mexp(x)), mode=self.mode)

graph = f.maker.fgraph.toposort()

ops_graph = [

node

for node in graph

if isinstance(node.op, Elemwise)

and isinstance(

node.op.scalar_op, (aes.Log, aes.Log1p, aes.Log1mexp, aes.Expm1)

)

]

assert len(ops_graph) == 0

if exp_op == exp:

expected = 1 - np.exp(data_valid)

else:

expected = -np.exp(data_valid)

np.testing.assert_almost_equal(f(data_valid), expected)

assert np.all(np.isnan(f(data_invalid)))

Sounds about right

ricardoV94 · 2023-12-08T18:08:48Z

The condition for the case is that x >= 0 (confirm?)

The valid condition is that x <= 0 for which the inner log1mexp is defined. x > 0 shoud yield nan, as that would lead to taking the log of a negative number.

lmmx added 4 commits December 6, 2023 22:13

refactor(nan switch): DRY out nan switch rewrite function, easier to …

8a82ba7

…follow important parts

feat(useless-rewrite): add case for log1mexp(log1mexp(x)) -> x re…

65b2a00

…write

refactor(case-logic): use utility helper functions not bool variables

a5d1827

refactor(case-logic): use a concise nested dict to store the cases cl…

c10c376

…early

lmmx marked this pull request as ready for review December 7, 2023 13:23

ricardoV94 reviewed Dec 8, 2023

View reviewed changes

lmmx closed this by deleting the head repository Feb 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(useless-rewrite): Useless rewrite from `log1mexp(log1mexp(x))` to `x` #535

feat(useless-rewrite): Useless rewrite from `log1mexp(log1mexp(x))` to `x` #535

Uh oh!

lmmx commented Dec 6, 2023 •

edited

Loading

Uh oh!

lmmx commented Dec 7, 2023

Uh oh!

ricardoV94 Dec 8, 2023 •

edited

Loading

Uh oh!

ricardoV94 Dec 8, 2023

Uh oh!

ricardoV94 Dec 8, 2023

Uh oh!

ricardoV94 commented Dec 8, 2023

Uh oh!

ricardoV94 commented Dec 8, 2023 •

edited

Loading

Uh oh!

Uh oh!

		@@ -319,7 +319,7 @@ def local_exp_log(fgraph, node):
		@register_specialize
		@node_rewriter([Elemwise])

	@node_rewriter([Elemwise])
	@node_rewriter([exp, expm1, log1mexp])

feat(useless-rewrite): Useless rewrite from log1mexp(log1mexp(x)) to x #535

feat(useless-rewrite): Useless rewrite from log1mexp(log1mexp(x)) to x #535

Uh oh!

Conversation

lmmx commented Dec 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation for these changes

Implementation details

Checklist

Major / Breaking Changes

New features

Bugfixes

Documentation

Maintenance

Uh oh!

lmmx commented Dec 7, 2023

Uh oh!

ricardoV94 Dec 8, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Dec 8, 2023

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Dec 8, 2023

Choose a reason for hiding this comment

Uh oh!

ricardoV94 commented Dec 8, 2023

Uh oh!

ricardoV94 commented Dec 8, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

feat(useless-rewrite): Useless rewrite from `log1mexp(log1mexp(x))` to `x` #535

feat(useless-rewrite): Useless rewrite from `log1mexp(log1mexp(x))` to `x` #535

lmmx commented Dec 6, 2023 •

edited

Loading

ricardoV94 Dec 8, 2023 •

edited

Loading

ricardoV94 commented Dec 8, 2023 •

edited

Loading