Open Source · 7 Languages · 900+ Problems

Project Euler Solutions

Complete solutions in C++, Python & Java — with step-by-step mathematical explanations

All Problems

Problem 901: Well Drilling

Project Euler Problem 901 Solution

EulerSolve provides an optimized solution for Project Euler Problem 901, Well Drilling, with C++, Python, Java, and a step-by-step mathematical explanation.

Problem Summary The depth sequence starts from $d_0=0,\qquad d_1=x,$ and then follows the nonlinear recurrence $d_{k+1}=e^{d_k-d_{k-1}}\qquad (k\ge 1).$ An admissible drilling schedule must be strictly increasing, so the required condition is $d_0<d_1<d_2<\cdots.$ The quantity to evaluate is $C(x)=\sum_{k\ge 1} d_k e^{-d_{k-1}}.$ The C++, Python, and Java implementations all treat the answer as the boundary point of the upper admissible branch: they locate the smallest $x$ in that branch and then evaluate $C(x)$ there. Numerically, this boundary is near $x_\star\approx 0.746542014027$, and the corresponding value is $C(x_\star)\approx 2.364497769$. Mathematical Approach Two identities make the search practical: one turns the monotonicity condition into a statement about successive gaps, and the other rewrites the objective as a rapidly convergent tail sum....

Detailed mathematical approach

Problem Summary

The depth sequence starts from

$d_0=0,\qquad d_1=x,$

and then follows the nonlinear recurrence

$d_{k+1}=e^{d_k-d_{k-1}}\qquad (k\ge 1).$

An admissible drilling schedule must be strictly increasing, so the required condition is

$d_0<d_1<d_2<\cdots.$

The quantity to evaluate is

$C(x)=\sum_{k\ge 1} d_k e^{-d_{k-1}}.$

The C++, Python, and Java implementations all treat the answer as the boundary point of the upper admissible branch: they locate the smallest $x$ in that branch and then evaluate $C(x)$ there. Numerically, this boundary is near $x_\star\approx 0.746542014027$, and the corresponding value is $C(x_\star)\approx 2.364497769$.

Mathematical Approach

Two identities make the search practical: one turns the monotonicity condition into a statement about successive gaps, and the other rewrites the objective as a rapidly convergent tail sum.

Step 1: Express Admissibility Through Gap Variables

Define the forward gaps

$\Delta_k=d_k-d_{k-1}\qquad (k\ge 1).$

The schedule is strictly increasing exactly when

$\Delta_k>0\qquad \text{for all }k\ge 1.$

From the recurrence we immediately get

$d_{k+1}=e^{\Delta_k}.$

For $k\ge 2$, since $d_k=e^{\Delta_{k-1}}$, the next gap satisfies

$\Delta_{k+1}=d_{k+1}-d_k=e^{\Delta_k}-e^{\Delta_{k-1}}.$

Because the exponential function is strictly increasing, this means

$\Delta_{k+1}>0 \iff \Delta_k>\Delta_{k-1}\qquad (k\ge 2).$

So admissibility is stronger than mere positivity: after the first two terms, the gap sequence must keep increasing as well.

Step 2: Rewrite the Objective in a Simpler Form

For $k\ge 2$, substitute the recurrence into one summand of $C(x)$:

$d_k e^{-d_{k-1}}=e^{d_{k-1}-d_{k-2}}e^{-d_{k-1}}=e^{-d_{k-2}}.$

Therefore the cost becomes

$C(x)=x+\sum_{k\ge 2} e^{-d_{k-2}}=x+\sum_{j\ge 0} e^{-d_j}.$

Expanding the first few terms gives

$C(x)=x+1+e^{-x}+e^{-e^x}+e^{-d_3}+e^{-d_4}+\cdots.$

This identity is very useful: once the depths become moderately large, the tail shrinks extremely fast.

Step 3: Why a Threshold Search Works

Every finite prefix $d_0,d_1,\dots,d_n$ depends continuously on the starting value $x$. If one sampled value of $x$ eventually produces a non-positive gap while a nearby value keeps all tested gaps positive, then a boundary point must lie between them.

The implementations exploit this by searching for the left edge of the upper admissible branch. Below that edge, some future gap becomes non-positive and strict increase fails. Above that edge, the simulated trajectory stays increasing and the depths quickly escape upward.

This is why bisection is appropriate: once a failing value and a succeeding value are bracketed, repeated midpoint testing isolates the smallest admissible $x$ on that branch.

Step 4: Why the Boundary Value Is the Relevant Candidate

On the upper admissible branch, any smaller starting value is forbidden because it eventually breaks monotonicity. That makes the left boundary $x_\star$ the natural candidate explored by the implementations.

The C++ implementation also checks a separated low admissible branch numerically and confirms that its sampled costs remain above the value found at the upper boundary. In other words, the final reported value comes from the smallest admissible $x$ on the upper branch, not from a larger interior point.

Step 5: Truncating the Infinite Sum Safely

Once some depth exceeds $80$, every later tail term in

$x+\sum_{j\ge 0} e^{-d_j}$

is at most $e^{-80}\approx 1.8\times 10^{-35}$. That is astronomically smaller than the $10^{-9}$ precision of the printed answer. So the implementations can stop the simulation after the depth crosses $80$, with a generous hard cap of $200$ recurrence steps as a safety limit.

Worked Example

Take $x=0.75$, which lies very close to the upper-branch boundary used by the implementations. The first depths are

$\begin{aligned} d_0&=0,\\ d_1&=0.75,\\ d_2&=e^{0.75}\approx 2.117000017,\\ d_3&=e^{2.117000017-0.75}\approx 3.923562400,\\ d_4&=e^{3.923562400-2.117000017}\approx 6.089478119,\\ d_5&=e^{6.089478119-3.923562400}\approx 8.722585700. \end{aligned}$

The cost identity now gives

$\begin{aligned} C(0.75)&=0.75+1+e^{-0.75}+e^{-2.117000017}+e^{-3.923562400}+e^{-6.089478119}+\cdots\\ &\approx 0.75+1+0.472366553+0.120392262+0.019770539+0.002266591+\cdots\\ &\approx 2.3648. \end{aligned}$

This already sits very close to the boundary value reported by the implementations, and it shows how quickly the tail becomes negligible.

How the Code Works

The C++, Python, and Java implementations all simulate the recurrence directly from $d_0=0$ and a trial value of $d_1=x$. During this simulation they check the gap $d_k-d_{k-1}$ at each step, and they reject the trial immediately if any gap becomes non-positive.

To locate the upper admissible branch, the implementations first perform a coarse scan on the interval $0.6\le x\le 1.5$ with step $0.01$. This finds the first transition from failure to success on that branch.

After bracketing the transition, they run $90$ bisection iterations. The midpoint is tested by the same recurrence simulation, so the boundary estimate tightens until the desired floating-point precision is reached.

Once the boundary value has been isolated, the implementations replay the recurrence and accumulate the cost. They stop when the depth exceeds $80$, because the remaining tail is far below the displayed precision.

The C++ implementation adds extra sanity checks around the boundary and compares against sampled points on the low branch, while the Python and Java implementations keep only the core search-and-evaluate procedure.

Complexity Analysis

Let $T$ be the maximum number of recurrence steps used in a single simulation. Here $T\le 200$. One admissibility test or one cost evaluation therefore costs $O(T)$ time and $O(1)$ memory.

If the coarse scan uses $S$ samples and the refinement uses $B$ bisection steps, the total running time is

$O((S+B)T).$

Since $B=O(\log(1/\varepsilon))$ for target precision $\varepsilon$, this is also $O(T\log(1/\varepsilon))$ once the initial scan interval is fixed. Because all constants are small and fixed in the implementations, the practical runtime is effectively constant and the memory usage remains $O(1)$.

Footnotes and References

Problem page: Project Euler 901
Recurrence relation: Wikipedia — Recurrence relation
Exponential function: Wikipedia — Exponential function
Bisection method: Wikipedia — Bisection method
Fixed-point iteration: Wikipedia — Fixed-point iteration