diff --git a/docs/index.html b/docs/index.html index 7a2bce33..4f5f3991 100644 --- a/docs/index.html +++ b/docs/index.html @@ -1,161 +1,174 @@ + + + + A Primer on Bézier Curves - - - - A Primer on Bézier Curves + - + - - - - + + + - - - + + - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - + + - - - - + + + - - + + -
- This page on GitHub - - This page on GitHub - -
+
+ This page on GitHub + + This page on GitHub + +
-
- - - submit to reddit - submit to hacker news - tweet your read - -
+
+ + + submit to reddit + submit to hacker news + tweet your read + +
- + +
+

+ A Primer on Bézier Curves +

+

A free, online book for when you really need to know how to do Bézier things.

+
+ Read this in your own language: + +

+ (Don't see your language listed, or want to see it reach 100%? + Help translate this content!) +

+
-
- -

A Primer on Bézier Curves

-

A free, online book for when you really need to know how to do Bézier things.

-
- Read this in your own language: - -

(Don't see your language listed, or want to see it reach 100%? Help translate this content!)

-
- -

- Welcome to the Primer on Bezier Curves. This is a free website/ebook dealing with both - the maths and programming aspects of Bezier Curves, covering a wide range of topics - relating to drawing and working with that curve that seems to pop up everywhere, from - Photoshop paths to CSS easing functions to Font outline descriptions. -

-

- If this is your first time here: welcome! Let me know if you were looking for anything - in particular that the primer doesn't cover over on the issue tracker! -

- -

Donations and sponsorship

+

+ Welcome to the Primer on Bezier Curves. This is a free website/ebook dealing with both the maths and programming aspects of Bezier Curves, + covering a wide range of topics relating to drawing and working with that curve that seems to pop up everywhere, from Photoshop paths to CSS + easing functions to Font outline descriptions. +

+

+ If this is your first time here: welcome! Let me know if you were looking for anything in particular that the primer doesn't cover over on the + issue tracker! +

-

- If this is a resource that you're using for research, as work reference, or even writing your own software, - please consider donating - (any amount helps) or signing up as a patron on Patreon. - I don't get paid to work on this, so if you find this site valuable, and you'd like it - to stick around for a long time to come, a lot of coffee went into writing this over the - years, and a lot more coffee will need to go into it yet: if you can spare a coffee, you'd - be helping keep a resource alive and well. -

-

- Also, if you are a company and your staff uses this book as a resource, or you use it as an - onboarding resource, then please: consider sponsoring the site! I am more than happy to work - with your finance department on sponsorship invoicing and recognition. -

- +

Donations and sponsorship

- -
+
-

- — Pomax -

- + +
- -
+
+
+

Preface

+

+ In order to draw things in 2D, we usually rely on lines, which typically get classified into two categories: straight lines, and curves. The + first of these are as easy to draw as they are easy to make a computer draw. Give a computer the first and last point in the line, and BAM! + straight line. No questions asked. +

+

+ Curves, however, are a much bigger problem. While we can draw curves with ridiculous ease freehand, computers are a bit handicapped in that + they can't draw curves unless there is a mathematical function that describes how it should be drawn. In fact, they even need this for + straight lines, but the function is ridiculously easy, so we tend to ignore that as far as computers are concerned; all lines are + "functions", regardless of whether they're straight or curves. However, that does mean that we need to come up with fast-to-compute + functions that lead to nice looking curves on a computer. There are a number of these, and in this article we'll focus on a particular + function that has received quite a bit of attention and is used in pretty much anything that can draw curves: Bézier curves. +

+

+ They're named after Pierre Bézier, who is principally responsible for making + them known to the world as a curve well-suited for design work (publishing his investigations in 1962 while working for Renault), although + he was not the first, or only one, to "invent" these type of curves. One might be tempted to say that the mathematician + Paul de Casteljau was first, as he began investigating the nature of these + curves in 1959 while working at Citroën, and came up with a really elegant way of figuring out how to draw them. However, de Casteljau did + not publish his work, making the question "who was first" hard to answer in any absolute sense. Or is it? Bézier curves are, at their core, + "Bernstein polynomials", a family of mathematical functions investigated by + Sergei Natanovich Bernstein, whose publications on them date back at + least as far as 1912. +

+

+ Anyway, that's mostly trivia, what you are more likely to care about is that these curves are handy: you can link up multiple Bézier curves + so that the combination looks like a single curve. If you've ever drawn Photoshop "paths" or worked with vector drawing programs like Flash, + Illustrator or Inkscape, those curves you've been drawing are Bézier curves. +

+

+ But what if you need to program them yourself? What are the pitfalls? How do you draw them? What are the bounding boxes, how do you + determine intersections, how can you extrude a curve, in short: how do you do everything that you might want to do with these curves? That's + what this page is for. Prepare to be mathed! +

+
+

Virtually all Bézier graphics are interactive.

+

+ This page uses interactive examples, relying heavily on Bezier.js, as well as maths + formulae which are typeset into SVG using the XeLaTeX typesetting system and + pdf2svg by David Barton. +

+

This book is open source.

+

+ This book is an open source software project, and lives on two github repositories. The first is + https://github.com/pomax/bezierinfo and is the purely-for-presentation version you are + viewing right now. The other repository is https://github.com/pomax/BezierInfo-2, + which is the development version, housing all the code that gets turned into the web version, and is also where you should file + issues if you find bugs or have ideas on what to change or add to the primer. +

+

How complicated is the maths going to be?

+

+ Most of the mathematics in this Primer are early high school maths. If you understand basic arithmetic, and you know how to read English, + you should be able to get by just fine. There will at times be far more complicated maths, but if you don't feel like digesting + them, you can safely skip over them by either skipping over the "detail boxes" in section or by just jumping to the end of a section with + maths that looks too involving. The end of sections typically simply list the conclusions so you can just work with those values directly. +

+

What language is all this example code in?

+

+ There are way too many programming languages to favour one of all others, soo all the example code in this Primer uses a form of + pseudo-code that uses a syntax that's close enough to, but not actually, modern scripting languages like JS, Python, etc. That means you + won't be able to copy-paste any of it without giving it any thought, but that's intentional: if you're reading this primer, presumably you + want to learn, and you don't learn by copy-pasting. You learn by doing things yourself, making mistakes, and then fixing + those mistakes. Now, of course, I didn't intentionally add errors in the example code just to trick you into making mistakes (that would + be horrible!) but I did intentionally keep the code from favouring one programming language over another. Don't worry though, if + you know even a single procedural programming language, you should be able to read the examples without any difficulties. +

+

Questions, comments:

+

+ If you have suggestions for new sections, hit up the Github issue tracker (also + reachable from the repo linked to in the upper right). If you have questions about the material, there's currently no comment section + while I'm doing the rewrite, but you can use the issue tracker for that as well. Once the rewrite is done, I'll add a general comment + section back in, and maybe a more topical "select this section of text and hit the 'question' button to ask a question about it" system. + We'll see. +

+

Help support the book!

+

+ If you enjoyed this book, or you simply found it useful for something you were trying to get done, and you were wondering how to let me + know you appreciated this book, you have two options: you can either head on over to the + Patreon page for this book, or if you prefer to make a one-time donation, head on over to + the buy Pomax a coffee page. This + work has grown from a small primer to a 100-plus print-page-equivalent reader on the subject of Bézier curves over the years, and a lot of + coffee went into the making of it. I don't regret a minute I spent on writing it, but I can always do with some more coffee to keep on + writing! +

+
+
+
+

What's new?

+

+ This primer is a living document, and so depending on when you last look at it, there may be new content. Click the following link to expand + this section to have a look at what got added, when, or click through to the News posts for more detailed updates. (RSS feed + available) +

+ + + +
+

November 2020

+
    +
  • Added a section on finding curve/circle intersections

  • +
+

October 2020

+
    +
  • Added the Ukranian locale! Help out in getting its localization to 100%!

  • +
+

August-September 2020

+
    +
  • +

    + Completely overhauled the site: the Primer is now a normal web page that works fine with JS disabled, but obviously better with JS + turned on. +

    +
  • +
+

June 2020

+
    +
  • Added automatic CI/CD using Github Actions

  • +
+

January 2020

+
    +
  • Added reset buttons to all graphics

  • +
  • Updated to preface to correctly describe the on-page maths

  • +
  • Fixed the Catmull-Rom section because it had glaring maths errors

  • +
+

August 2019

+
    +
  • Added a section on (plain) rational Bezier curves

  • +
  • Improved the Graphic component to allow for sliders

  • +
+

December 2018

+
    +
  • Added a section on curvature and calculating kappa.

  • +
  • +

    + Added a Patreon page! Head on over to patreon.com/bezierinfo to help support this + site! +

    +
  • +
+

August 2018

+
    +
  • Added a section on finding a curve's y, if all you have is the x coordinate.

  • +
+

July 2018

+
    +
  • Rewrote the 3D normals section, implementing and explaining Rotation Minimising Frames.

  • +
  • Updated the section on curve order raising/lowering, showing how to get a least-squares optimized lower order curve.

  • +
  • +

    (Finally) updated 'npm test' so that it automatically rebuilds when files are changed while the dev server is running.

    +
  • +
+

June 2018

+
    +
  • Added a section on direct curve fitting.

  • +
  • Added source links for all graphics.

  • +
  • Added this "What's new?" section.

  • +
+

April 2017

+
    +
  • Added a section on 3d normals.

  • +
  • Added live-updating for the social link buttons, so they always link to the specific section you're reading.

  • +
+

February 2017

+
    +
  • Finished rewriting the entire codebase for localization.

  • +
+

January 2016

+
    +
  • Added a section to explain the Bezier interval.

  • +
  • Rewrote the Primer as a React application.

  • +
+

December 2015

+
    +
  • Set up the split repository between BezierInfo-2 as development repository, and bezierinfo as live page.

  • +
  • +

    + Removed the need for client-side LaTeX parsing entirely, so the site doesn't take a full minute or more to load all the graphics. +

    +
  • +
+

May 2015

+
    +
  • Switched over to pure JS rather than Processing-through-Processing.js

  • +
  • Added Cardano's algorithm for finding the roots of a cubic polynomial.

  • +
+

April 2015

+
    +
  • Added a section on arc length approximations.

  • +
+

February 2015

+
    +
  • Added a section on the canonical cubic Bezier form.

  • +
+

November 2014

+
    +
  • Switched to HTTPS.

  • +
+

July 2014

+
    +
  • Added the section on arc approximation.

  • +
+

April 2014

+
    +
  • Added the section on Catmull-Rom fitting.

  • +
+

November 2013

+
    +
  • Added the section on Catmull-Rom / Bezier conversion.

  • +
  • Added the section on Bezier cuves as matrices.

  • +
+

April 2013

+
    +
  • Added a section on poly-Beziers.

  • +
  • Added a section on boolean shape operations.

  • +
+

March 2013

+
    +
  • First drastic rewrite.

  • +
  • Added sections on circle approximations.

  • +
  • Added a section on projecting a point onto a curve.

  • +
  • Added a section on tangents and normals.

  • +
  • Added Legendre-Gauss numerical data tables.

  • +
+

October 2011

+
    +
  • +

    + First commit for the bezierinfo site, based on the pre-Primer webpage that covered + the basics of Bezier curves in HTML with Processing.js examples. +

    +
  • +
+
+
+
+
+

+ + A lightning introduction +

+

+ Let's start with the good stuff: when we're talking about Bézier curves, we're talking about the things that you can see in the following + graphics. They run from some start point to some end point, with their curvature influenced by one or more "intermediate" control points. + Now, because all the graphics on this page are interactive, go manipulate those curves a bit: click-drag the points, and see how their + shape changes based on what you do. +

+
+ + + Scripts are disabled. Showing fallback image. + + + + + + Scripts are disabled. Showing fallback image. + + + +
-
- -

Preface

-

In order to draw things in 2D, we usually rely on lines, which typically get classified into two categories: straight lines, and curves. The first of these are as easy to draw as they are easy to make a computer draw. Give a computer the first and last point in the line, and BAM! straight line. No questions asked.

-

Curves, however, are a much bigger problem. While we can draw curves with ridiculous ease freehand, computers are a bit handicapped in that they can't draw curves unless there is a mathematical function that describes how it should be drawn. In fact, they even need this for straight lines, but the function is ridiculously easy, so we tend to ignore that as far as computers are concerned; all lines are "functions", regardless of whether they're straight or curves. However, that does mean that we need to come up with fast-to-compute functions that lead to nice looking curves on a computer. There are a number of these, and in this article we'll focus on a particular function that has received quite a bit of attention and is used in pretty much anything that can draw curves: Bézier curves.

-

They're named after Pierre Bézier, who is principally responsible for making them known to the world as a curve well-suited for design work (publishing his investigations in 1962 while working for Renault), although he was not the first, or only one, to "invent" these type of curves. One might be tempted to say that the mathematician Paul de Casteljau was first, as he began investigating the nature of these curves in 1959 while working at Citroën, and came up with a really elegant way of figuring out how to draw them. However, de Casteljau did not publish his work, making the question "who was first" hard to answer in any absolute sense. Or is it? Bézier curves are, at their core, "Bernstein polynomials", a family of mathematical functions investigated by Sergei Natanovich Bernstein, whose publications on them date back at least as far as 1912.

-

Anyway, that's mostly trivia, what you are more likely to care about is that these curves are handy: you can link up multiple Bézier curves so that the combination looks like a single curve. If you've ever drawn Photoshop "paths" or worked with vector drawing programs like Flash, Illustrator or Inkscape, those curves you've been drawing are Bézier curves.

-

But what if you need to program them yourself? What are the pitfalls? How do you draw them? What are the bounding boxes, how do you determine intersections, how can you extrude a curve, in short: how do you do everything that you might want to do with these curves? That's what this page is for. Prepare to be mathed!

-
- -

Virtually all Bézier graphics are interactive.

-

This page uses interactive examples, relying heavily on Bezier.js, as well as maths formulae which are typeset into SVG using the XeLaTeX typesetting system and pdf2svg by David Barton.

-

This book is open source.

-

This book is an open source software project, and lives on two github repositories. The first is https://github.com/pomax/bezierinfo and is the purely-for-presentation version you are viewing right now. The other repository is https://github.com/pomax/BezierInfo-2, which is the development version, housing all the code that gets turned into the web version, and is also where you should file issues if you find bugs or have ideas on what to change or add to the primer.

-

How complicated is the maths going to be?

-

Most of the mathematics in this Primer are early high school maths. If you understand basic arithmetic, and you know how to read English, you should be able to get by just fine. There will at times be far more complicated maths, but if you don't feel like digesting them, you can safely skip over them by either skipping over the "detail boxes" in section or by just jumping to the end of a section with maths that looks too involving. The end of sections typically simply list the conclusions so you can just work with those values directly.

-

What language is all this example code in?

-

There are way too many programming languages to favour one of all others, soo all the example code in this Primer uses a form of pseudo-code that uses a syntax that's close enough to, but not actually, modern scripting languages like JS, Python, etc. That means you won't be able to copy-paste any of it without giving it any thought, but that's intentional: if you're reading this primer, presumably you want to learn, and you don't learn by copy-pasting. You learn by doing things yourself, making mistakes, and then fixing those mistakes. Now, of course, I didn't intentionally add errors in the example code just to trick you into making mistakes (that would be horrible!) but I did intentionally keep the code from favouring one programming language over another. Don't worry though, if you know even a single procedural programming language, you should be able to read the examples without any difficulties.

-

Questions, comments:

-

If you have suggestions for new sections, hit up the Github issue tracker (also reachable from the repo linked to in the upper right). If you have questions about the material, there's currently no comment section while I'm doing the rewrite, but you can use the issue tracker for that as well. Once the rewrite is done, I'll add a general comment section back in, and maybe a more topical "select this section of text and hit the 'question' button to ask a question about it" system. We'll see.

-

Help support the book!

-

If you enjoyed this book, or you simply found it useful for something you were trying to get done, and you were wondering how to let me know you appreciated this book, you have two options: you can either head on over to the Patreon page for this book, or if you prefer to make a one-time donation, head on over to the buy Pomax a coffee page. This work has grown from a small primer to a 100-plus print-page-equivalent reader on the subject of Bézier curves over the years, and a lot of coffee went into the making of it. I don't regret a minute I spent on writing it, but I can always do with some more coffee to keep on writing!

-
-
-
-

What's new?

-

This primer is a living document, and so depending on when you last look at it, there may be new content. Click the following link to expand this section to have a look at what got added, when, or click through to the News posts for more detailed updates. (RSS feed available)

- - - -

November 2020

  • Added a section on finding curve/circle intersections

    -
-

October 2020

  • Added the Ukranian locale! Help out in getting its localization to 100%!

    -
-

August-September 2020

  • Completely overhauled the site: the Primer is now a normal web page that works fine with JS disabled, but obviously better with JS turned on.

    -
-

June 2020

  • Added automatic CI/CD using Github Actions

    -
-

January 2020

  • Added reset buttons to all graphics

    -
  • -
  • Updated to preface to correctly describe the on-page maths

    -
  • -
  • Fixed the Catmull-Rom section because it had glaring maths errors

    -
-

August 2019

  • Added a section on (plain) rational Bezier curves

    -
  • -
  • Improved the Graphic component to allow for sliders

    -
-

December 2018

  • Added a section on curvature and calculating kappa.

    -
  • -
  • Added a Patreon page! Head on over to patreon.com/bezierinfo to help support this site!

    -
-

August 2018

  • Added a section on finding a curve's y, if all you have is the x coordinate.

    -
-

July 2018

  • Rewrote the 3D normals section, implementing and explaining Rotation Minimising Frames.

    -
  • -
  • Updated the section on curve order raising/lowering, showing how to get a least-squares optimized lower order curve.

    -
  • -
  • (Finally) updated 'npm test' so that it automatically rebuilds when files are changed while the dev server is running.

    -
-

June 2018

  • Added a section on direct curve fitting.

    -
  • -
  • Added source links for all graphics.

    -
  • -
  • Added this "What's new?" section.

    -
-

April 2017

  • Added a section on 3d normals.

    -
  • -
  • Added live-updating for the social link buttons, so they always link to the specific section you're reading.

    -
-

February 2017

  • Finished rewriting the entire codebase for localization.

    -
-

January 2016

  • Added a section to explain the Bezier interval.

    -
  • -
  • Rewrote the Primer as a React application.

    -
-

December 2015

  • Set up the split repository between BezierInfo-2 as development repository, and bezierinfo as live page.

    -
  • -
  • Removed the need for client-side LaTeX parsing entirely, so the site doesn't take a full minute or more to load all the graphics.

    -
-

May 2015

  • Switched over to pure JS rather than Processing-through-Processing.js

    -
  • -
  • Added Cardano's algorithm for finding the roots of a cubic polynomial.

    -
-

April 2015

  • Added a section on arc length approximations.

    -
-

February 2015

  • Added a section on the canonical cubic Bezier form.

    -
-

November 2014

  • Switched to HTTPS.

    -
-

July 2014

  • Added the section on arc approximation.

    -
-

April 2014

  • Added the section on Catmull-Rom fitting.

    -
-

November 2013

  • Added the section on Catmull-Rom / Bezier conversion.

    -
  • -
  • Added the section on Bezier cuves as matrices.

    -
-

April 2013

  • Added a section on poly-Beziers.

    -
  • -
  • Added a section on boolean shape operations.

    -
-

March 2013

  • First drastic rewrite.

    -
  • -
  • Added sections on circle approximations.

    -
  • -
  • Added a section on projecting a point onto a curve.

    -
  • -
  • Added a section on tangents and normals.

    -
  • -
  • Added Legendre-Gauss numerical data tables.

    -
-

October 2011

  • First commit for the bezierinfo site, based on the pre-Primer webpage that covered the basics of Bezier curves in HTML with Processing.js examples.

    -
-
-
-

A lightning introduction

-

Let's start with the good stuff: when we're talking about Bézier curves, we're talking about the things that you can see in the following graphics. They run from some start point to some end point, with their curvature influenced by one or more "intermediate" control points. Now, because all the graphics on this page are interactive, go manipulate those curves a bit: click-drag the points, and see how their shape changes based on what you do.

-
- - - Scripts are disabled. Showing fallback image. - - - - - - Scripts are disabled. Showing fallback image. - - - -
- -

These curves are used a lot in computer aided design and computer aided manufacturing (CAD/CAM) applications, as well as in graphic design programs like Adobe Illustrator and Photoshop, Inkscape, GIMP, etc. and in graphic technologies like scalable vector graphics (SVG) and OpenType fonts (TTF/OTF). A lot of things use Bézier curves, so if you want to learn more about them... prepare to get your learn on!

- -
-
-

So what makes a Bézier Curve?

-

Playing with the points for curves may have given you a feel for how Bézier curves behave, but what are Bézier curves, really? There are two ways to explain what a Bézier curve is, and they turn out to be the entirely equivalent, but one of them uses complicated maths, and the other uses really simple maths. So... let's start with the simple explanation:

-

Bézier curves are the result of linear interpolations. That sounds complicated but you've been doing linear interpolation since you were very young: any time you had to point at something between two other things, you've been applying linear interpolation. It's simply "picking a point between two points".

-

If we know the distance between those two points, and we want a new point that is, say, 20% the distance away from the first point (and thus 80% the distance away from the second point) then we can compute that really easily:

- - -

So let's look at that in action: the following graphic is interactive in that you can use your up and down arrow keys to increase or decrease the interpolation ratio, to see what happens. We start with three points, which gives us two lines. Linear interpolation over those lines gives us two points, between which we can again perform linear interpolation, yielding a single point. And that point —and all points we can form in this way for all ratios taken together— form our Bézier curve:

- - - Scripts are disabled. Showing fallback image. - - - - - -

And that brings us to the complicated maths: calculus.

-

While it doesn't look like that's what we've just done, we actually just drew a quadratic curve, in steps, rather than in a single go. One of the fascinating parts about Bézier curves is that they can both be described in terms of polynomial functions, as well as in terms of very simple interpolations of interpolations of [...]. That, in turn, means we can look at what these curves can do based on both "real maths" (by examining the functions, their derivatives, and all that stuff), as well as by looking at the "mechanical" composition (which tells us, for instance, that a curve will never extend beyond the points we used to construct it).

-

So let's start looking at Bézier curves a bit more in depth: their mathematical expressions, the properties we can derive from them, and the various things we can do to, and with, Bézier curves.

- -
-
-

The mathematics of Bézier curves

-

Bézier curves are a form of "parametric" function. Mathematically speaking, parametric functions are cheats: a "function" is actually a well defined term representing a mapping from any number of inputs to a single output. Numbers go in, a single number comes out. Change the numbers that go in, and the number that comes out is still a single number.

-

Parametric functions cheat. They basically say "alright, well, we want multiple values coming out, so we'll just use more than one function". An illustration: Let's say we have a function that maps some value, let's call it x, to some other value, using some kind of number manipulation:

- - -

The notation f(x) is the standard way to show that it's a function (by convention called f if we're only listing one) and its output changes based on one variable (in this case, x). Change x, and the output for f(x) changes.

-

So far, so good. Now, let's look at parametric functions, and how they cheat. Let's take the following two functions:

- - -

There's nothing really remarkable about them, they're just a sine and cosine function, but you'll notice the inputs have different names. If we change the value for a, we're not going to change the output value for f(b), since a isn't used in that function. Parametric functions cheat by changing that. In a parametric function all the different functions share a variable, like this:

- - -

Multiple functions, but only one variable. If we change the value for t, we change the outcome of both fa(t) and fb(t). You might wonder how that's useful, and the answer is actually pretty simple: if we change the labels fa(t) and fb(t) with what we usually mean with them for parametric curves, things might be a lot more obvious:

- - -

There we go. x/y coordinates, linked through some mystery value t.

-

So, parametric curves don't define a y coordinate in terms of an x coordinate, like normal functions do, but they instead link the values to a "control" variable. If we vary the value of t, then with every change we get two values, which we can use as (x,y) coordinates in a graph. The above set of functions, for instance, generates points on a circle: We can range t from negative to positive infinity, and the resulting (x,y) coordinates will always lie on a circle with radius 1 around the origin (0,0). If we plot it for t from 0 to 5, we get this:

- - - Scripts are disabled. Showing fallback image. - - - - - -

Bézier curves are just one out of the many classes of parametric functions, and are characterised by using the same base function for all of the output values. In the example we saw above, the x and y values were generated by different functions (one uses a sine, the other a cosine); but Bézier curves use the "binomial polynomial" for both the x and y outputs. So what are binomial polynomials?

-

You may remember polynomials from high school. They're those sums that look like this:

- - -

If the highest order term they have is , they're called "cubic" polynomials; if it's , it's a "square" polynomial; if it's just x, it's a line (and if there aren't even any terms with x it's not a polynomial!)

-

Bézier curves are polynomials of t, rather than x, with the value for t being fixed between 0 and 1, with coefficients a, b etc. taking the "binomial" form, which sounds fancy but is actually a pretty simple description for mixing values:

- - -

I know what you're thinking: that doesn't look too simple! But if we remove t and add in "times one", things suddenly look pretty easy. Check out these binomial terms:

- - -

Notice that 2 is the same as 1+1, and 3 is 2+1 and 1+2, and 6 is 3+3... As you can see, each time we go up a dimension, we simply start and end with 1, and everything in between is just "the two numbers above it, added together", giving us a simple number sequence known as Pascal's triangle. Now that's easy to remember.

-

There's an equally simple way to figure out how the polynomial terms work: if we rename (1-t) to a and t to b, and remove the weights for a moment, we get this:

- - -

It's basically just a sum of "every combination of a and b", progressively replacing a's with b's after every + sign. So that's actually pretty simple too. So now you know binomial polynomials, and just for completeness I'm going to show you the generic function for this:

- - -

And that's the full description for Bézier curves. Σ in this function indicates that this is a series of additions (using the variable listed below the Σ, starting at ...=<value> and ending at the value listed on top of the Σ).

-
+ +

+ And that's the full description for Bézier curves. Σ in this function indicates that this is a series of additions (using the variable + listed below the Σ, starting at ...=<value> and ending at the value listed on top of the Σ). +

+
+

How to implement the basis function

+

We could naively implement the basis function as a mathematical construct, using the function as our guide, like this:

-

How to implement the basis function

-

We could naively implement the basis function as a mathematical construct, using the function as our guide, like this:

- - - - - -
1 - -
2
3
4
5
- -

I say we could, because we're not going to: the factorial function is incredibly expensive. And, as we can see from the above explanation, we can actually create Pascal's triangle quite easily without it: just start at [1], then [1,1], then [1,2,1], then [1,3,3,1], and so on, with each next row fitting 1 more number than the previous row, starting and ending with "1", with all the numbers in between being the sum of the previous row's elements on either side "above" the one we're computing.

-

We can generate this as a list of lists lightning fast, and then never have to compute the binomial terms because we have a lookup table:

+ return sum + + + + 2 + + + 3 + + + 4 + + + 5 + + - - - - - - - - - - - - - - - - - -
1 - -
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
- -

So what's going on here? First, we declare a lookup table with a size that's reasonably large enough to accommodate most lookups. Then, we declare a function to get us the values we need, and we make sure that if an n/k pair is requested that isn't in the LUT yet, we expand it first. Our basis function now looks like this:

+ return lut[n][k] + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + 18 + + - - - - -
1 - -
2
3
4
5
- -

Perfect. Of course, we can optimize further. For most computer graphics purposes, we don't need arbitrary curves (although we will also provide code for arbitrary curves in this primer); we need quadratic and cubic curves, and that means we can drastically simplify the code:

+ return sum + + + + 2 + + + 3 + + + 4 + + + 5 + + - - - - - - - - - - - - -
1 - -
2
3
4
5
6
7
8
9
10
11
12
13
- -

And now we know how to program the basis function. Excellent.

-
+ return mt3 + 3*mt2*t + 3*mt*t2 + t3 + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + -

So, now we know what the basis function looks like, time to add in the magic that makes Bézier curves so special: control points.

+

And now we know how to program the basis function. Excellent.

+
-
-
-

Controlling Bézier curvatures

-

Bézier curves are, like all "splines", interpolation functions. This means that they take a set of points, and generate values somewhere "between" those points. (One of the consequences of this is that you'll never be able to generate a point that lies outside the outline for the control points, commonly called the "hull" for the curve. Useful information!). In fact, we can visualize how each point contributes to the value generated by the function, so we can see which points are important, where, in the curve.

-

The following graphs show the interpolation functions for quadratic and cubic curves, with "S" being the strength of a point's contribution to the total sum of the Bézier function. Click-and-drag to see the interpolation percentages for each curve-defining point at a specific t value.

-
- - - Scripts are disabled. Showing fallback image. - - - - - - - - Scripts are disabled. Showing fallback image. - - - - - - - - Scripts are disabled. Showing fallback image. - - - - - -
+

So, now we know what the basis function looks like, time to add in the magic that makes Bézier curves so special: control points.

+
+
+

+ + Controlling Bézier curvatures +

+

+ Bézier curves are, like all "splines", interpolation functions. This means that they take a set of points, and generate values somewhere + "between" those points. (One of the consequences of this is that you'll never be able to generate a point that lies outside the outline + for the control points, commonly called the "hull" for the curve. Useful information!). In fact, we can visualize how each point + contributes to the value generated by the function, so we can see which points are important, where, in the curve. +

+

+ The following graphs show the interpolation functions for quadratic and cubic curves, with "S" being the strength of a point's + contribution to the total sum of the Bézier function. Click-and-drag to see the interpolation percentages for each curve-defining point at + a specific t value. +

+
+ + + Scripts are disabled. Showing fallback image. + + + + + + + + Scripts are disabled. Showing fallback image. + + + + + + + + Scripts are disabled. Showing fallback image. + + + + + +
-

Also shown is the interpolation function for a 15th order Bézier function. As you can see, the start and end point contribute considerably more to the curve's shape than any other point in the control point set.

-

If we want to change the curve, we need to change the weights of each point, effectively changing the interpolations. The way to do this is about as straightforward as possible: just multiply each point with a value that changes its strength. These values are conventionally called "weights", and we can add them to our original Bézier function:

- - -

That looks complicated, but as it so happens, the "weights" are actually just the coordinate values we want our curve to have: for an nth order curve, w0 is our start coordinate, wn is our last coordinate, and everything in between is a controlling coordinate. Say we want a cubic curve that starts at (110,150), is controlled by (25,190) and (210,250) and ends at (210,30), we use this Bézier curve:

- - -

Which gives us the curve we saw at the top of the article:

- - - Scripts are disabled. Showing fallback image. - - - -

What else can we do with Bézier curves? Quite a lot, actually. The rest of this article covers a multitude of possible operations and algorithms that we can apply, and the tasks they achieve.

-
+ +

Which gives us the curve we saw at the top of the article:

+ + + Scripts are disabled. Showing fallback image. + + + +

+ What else can we do with Bézier curves? Quite a lot, actually. The rest of this article covers a multitude of possible operations and + algorithms that we can apply, and the tasks they achieve. +

+
+

How to implement the weighted basis function

+

Given that we already know how to implement basis function, adding in the control points is remarkably easy:

-

How to implement the weighted basis function

-

Given that we already know how to implement basis function, adding in the control points is remarkably easy:

- - - - - -
1 - -
2
3
4
5
- -

And now for the optimized versions:

+ return sum + + + + 2 + + + 3 + + + 4 + + + 5 + + - - - - - - - - - - - - -
1 - -
2
3
4
5
6
7
8
9
10
11
12
13
- -

And now we know how to program the weighted basis function.

-
+ return w[0]*mt3 + 3*w[1]*mt2*t + 3*w[2]*mt*t2 + w[3]*t3 + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + -
-
-

Controlling Bézier curvatures, part 2: Rational Béziers

-

We can further control Bézier curves by "rationalising" them: that is, adding a "ratio" value in addition to the weight value discussed in the previous section, thereby gaining control over "how strongly" each coordinate influences the curve.

-

Adding these ratio values to the regular Bézier curve function is fairly easy. Where the regular function is the following:

- - -

The function for rational Bézier curves has two more terms:

- - -

In this, the first new term represents an additional weight for each coordinate. For example, if our ratio values are [1, 0.5, 0.5, 1] then ratio0 = 1, ratio1 = 0.5, and so on, and is effectively identical as if we were just using different weight. So far, nothing too special.

-

However, the second new term is what makes the difference: every point on the curve isn't just a "double weighted" point, it is a fraction of the "doubly weighted" value we compute by introducing that ratio. When computing points on the curve, we compute the "normal" Bézier value and then divide that by the Bézier value for the curve that only uses ratios, not weights.

-

This does something unexpected: it turns our polynomial into something that isn't a polynomial anymore. It is now a kind of curve that is a super class of the polynomials, and can do some really cool things that Bézier curves can't do "on their own", such as perfectly describing circles (which we'll see in a later section is literally impossible using standard Bézier curves).

-

But the best way to show what this does is to do literally that: let's look at the effect of "rationalising" our Bézier curves using an interactive graphic for a rationalised curves. The following graphic shows the Bézier curve from the previous section, "enriched" with ratio factors for each coordinate. The closer to zero we set one or more terms, the less relative influence the associated coordinate exerts on the curve (and of course the higher we set them, the more influence they have). Try to change the values and see how it affects what gets drawn:

- - - Scripts are disabled. Showing fallback image. - - - - - - - - -

You can think of the ratio values as each coordinate's "gravity": the higher the gravity, the closer to that coordinate the curve will want to be. You'll also notice that if you simply increase or decrease all the ratios by the same amount, nothing changes... much like with gravity, if the relative strengths stay the same, nothing really changes. The values define each coordinate's influence relative to all other points.

-
+ +

+ In this, the first new term represents an additional weight for each coordinate. For example, if our ratio values are [1, 0.5, 0.5, 1] + then ratio0 = 1, ratio1 = 0.5, and so on, and is effectively identical as if we were just + using different weight. So far, nothing too special. +

+

+ However, the second new term is what makes the difference: every point on the curve isn't just a "double weighted" point, it is a + fraction of the "doubly weighted" value we compute by introducing that ratio. When computing points on the curve, we compute the + "normal" Bézier value and then divide that by the Bézier value for the curve that only uses ratios, not weights. +

+

+ This does something unexpected: it turns our polynomial into something that isn't a polynomial anymore. It is now a kind of curve + that is a super class of the polynomials, and can do some really cool things that Bézier curves can't do "on their own", such as perfectly + describing circles (which we'll see in a later section is literally impossible using standard Bézier curves). +

+

+ But the best way to show what this does is to do literally that: let's look at the effect of "rationalising" our Bézier curves using an + interactive graphic for a rationalised curves. The following graphic shows the Bézier curve from the previous section, "enriched" with + ratio factors for each coordinate. The closer to zero we set one or more terms, the less relative influence the associated coordinate + exerts on the curve (and of course the higher we set them, the more influence they have). Try to change the values and see how it affects + what gets drawn: +

+ + + Scripts are disabled. Showing fallback image. + + + + + + + + +

+ You can think of the ratio values as each coordinate's "gravity": the higher the gravity, the closer to that coordinate the curve will + want to be. You'll also notice that if you simply increase or decrease all the ratios by the same amount, nothing changes... much like + with gravity, if the relative strengths stay the same, nothing really changes. The values define each coordinate's influence + relative to all other points. +

+
+

How to implement rational curves

+

Extending the code of the previous section to include ratios is almost trivial:

-

How to implement rational curves

-

Extending the code of the previous section to include ratios is almost trivial:

- - - - - - - - - - - - - - - - - - - - - - - - - - -
1 - -
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
- -

And that's all we have to do.

-
+ return (f[0] * w[0] + f[1] * w[1] + f[2] * w[2] + f[3] * w[3])/basis + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + 18 + + + 19 + + + 20 + + + 21 + + + 22 + + + 23 + + + 24 + + + 25 + + + 26 + + -
-
-

The Bézier interval [0,1]

-

Now that we know the mathematics behind Bézier curves, there's one curious thing that you may have noticed: they always run from t=0 to t=1. Why that particular interval?

-

It all has to do with how we run from "the start" of our curve to "the end" of our curve. If we have a value that is a mixture of two other values, then the general formula for this is:

- - -

The obvious start and end values here need to be a=1, b=0, so that the mixed value is 100% value 1, and 0% value 2, and a=0, b=1, so that the mixed value is 0% value 1 and 100% value 2. Additionally, we don't want "a" and "b" to be independent: if they are, then we could just pick whatever values we like, and end up with a mixed value that is, for example, 100% value 1 and 100% value 2. In principle that's fine, but for Bézier curves we always want mixed values between the start and end point, so we need to make sure we can never set "a" and "b" to some values that lead to a mix value that sums to more than 100%. And that's easy:

- - -

With this we can guarantee that we never sum above 100%. By restricting a to values in the interval [0,1], we will always be somewhere between our two values (inclusively), and we will always sum to a 100% mix.

-

But... what if we use this form, which is based on the assumption that we will only ever use values between 0 and 1, and instead use values outside of that interval? Do things go horribly wrong? Well... not really, but we get to "see more".

-

In the case of Bézier curves, extending the interval simply makes our curve "keep going". Bézier curves are simply segments of some polynomial curve, so if we pick a wider interval we simply get to see more of the curve. So what do they look like?

-

The following two graphics show you Bézier curves rendered "the usual way", as well as the curves they "lie on" if we were to extend the t values much further. As you can see, there's a lot more "shape" hidden in the rest of the curve, and we can model those parts by moving the curve points around.

-
- - - Scripts are disabled. Showing fallback image. - - - - - - Scripts are disabled. Showing fallback image. - - - -
+ +

+ With this we can guarantee that we never sum above 100%. By restricting a to values in the interval [0,1], we will always be + somewhere between our two values (inclusively), and we will always sum to a 100% mix. +

+

+ But... what if we use this form, which is based on the assumption that we will only ever use values between 0 and 1, and instead use + values outside of that interval? Do things go horribly wrong? Well... not really, but we get to "see more". +

+

+ In the case of Bézier curves, extending the interval simply makes our curve "keep going". Bézier curves are simply segments of some + polynomial curve, so if we pick a wider interval we simply get to see more of the curve. So what do they look like? +

+

+ The following two graphics show you Bézier curves rendered "the usual way", as well as the curves they "lie on" if we were to extend the + t values much further. As you can see, there's a lot more "shape" hidden in the rest of the curve, and we can model those + parts by moving the curve points around. +

+
+ + + Scripts are disabled. Showing fallback image. + + + + + + Scripts are disabled. Showing fallback image. + + + +
-

In fact, there are curves used in graphics design and computer modelling that do the opposite of Bézier curves; rather than fixing the interval, and giving you freedom to choose the coordinates, they fix the coordinates, but give you freedom over the interval. A great example of this is the "Spiro" curve, which is a curve based on part of a Cornu Spiral, also known as Euler's Spiral. It's a very aesthetically pleasing curve and you'll find it in quite a few graphics packages like FontForge and Inkscape. It has even been used in font design, for example for the Inconsolata typeface.

- -
-
-

Bézier curvatures as matrix operations

-

We can also represent Bézier curves as matrix operations, by expressing the Bézier formula as a polynomial basis function and a coefficients matrix, and the actual coordinates as a matrix. Let's look at what this means for the cubic curve, using P... to refer to coordinate values "in one or more dimensions":

- - -

Disregarding our actual coordinates for a moment, we have:

- - -

We can write this as a sum of four expressions:

- - -

And we can expand these expressions:

- - -

Furthermore, we can make all the 1 and 0 factors explicit:

- - -

And that, we can view as a series of four matrix operations:

- - -

If we compact this into a single matrix operation, we get:

- - -

This kind of polynomial basis representation is generally written with the bases in increasing order, which means we need to flip our t matrix horizontally, and our big "mixing" matrix upside down:

- - -

And then finally, we can add in our original coordinates as a single third matrix:

- - -

We can perform the same trick for the quadratic curve, in which case we end up with:

- - -

If we plug in a t value, and then multiply the matrices, we will get exactly the same values as when we evaluate the original polynomial function, or as when we evaluate the curve using progressive linear interpolation.

-

So: why would we bother with matrices? Matrix representations allow us to discover things about functions that would otherwise be hard to tell. It turns out that the curves form triangular matrices, and they have a determinant equal to the product of the actual coordinates we use for our curve. It's also invertible, which means there's a ton of properties that are all satisfied. Of course, the main question is "why is this useful to us now?", and the answer to that is that it's not immediately useful, but you'll be seeing some instances where certain curve properties can be either computed via function manipulation, or via clever use of matrices, and sometimes the matrix approach can be (drastically) faster.

-

So for now, just remember that we can represent curves this way, and let's move on.

+ +

+ If we plug in a t value, and then multiply the matrices, we will get exactly the same values as when we evaluate the original + polynomial function, or as when we evaluate the curve using progressive linear interpolation. +

+

+ So: why would we bother with matrices? Matrix representations allow us to discover things about functions that would + otherwise be hard to tell. It turns out that the curves form + triangular matrices, and they have a determinant equal to the product of the + actual coordinates we use for our curve. It's also invertible, which means there's + a ton of properties that are all satisfied. Of + course, the main question is "why is this useful to us now?", and the answer to that is that it's not immediately useful, but + you'll be seeing some instances where certain curve properties can be either computed via function manipulation, or via clever use of + matrices, and sometimes the matrix approach can be (drastically) faster. +

+

So for now, just remember that we can represent curves this way, and let's move on.

+
+
+

+ + de Casteljau's algorithm +

+

+ If we want to draw Bézier curves, we can run through all values of t from 0 to 1 and then compute the weighted basis function + at each value, getting the x/y values we need to plot. Unfortunately, the more complex the curve gets, the more expensive + this computation becomes. Instead, we can use de Casteljau's algorithm to draw curves. This is a geometric approach to curve + drawing, and it's really easy to implement. So easy, in fact, you can do it by hand with a pencil and ruler. +

+

Rather than using our calculus function to find x/y values for t, let's do this instead:

+
    +
  • treat t as a ratio (which it is). t=0 is 0% along a line, t=1 is 100% along a line.
  • +
  • Take all lines between the curve's defining points. For an order n curve, that's n lines.
  • +
  • + Place markers along each of these line, at distance t. So if t is 0.2, place the mark at 20% from the start, + 80% from the end. +
  • +
  • Now form lines between those points. This gives n-1 lines.
  • +
  • Place markers along each of these line at distance t.
  • +
  • Form lines between those points. This'll be n-2 lines.
  • +
  • Place markers, form lines, place markers, etc.
  • +
  • + Repeat this until you have only one line left. The point t on that line coincides with the original curve point at + t. +
  • +
+

+ To see this in action, move the slider for the following sketch to changes which curve point is explicitly evaluated using de Casteljau's + algorithm. +

+ + + Scripts are disabled. Showing fallback image. + + + + + +
+

How to implement de Casteljau's algorithm

+

+ Let's just use the algorithm we just specified, and implement that as a function that can take a list of curve-defining points, and a + t value, and draws the associated point on the curve for that t value: +

-
-
-

de Casteljau's algorithm

-

If we want to draw Bézier curves, we can run through all values of t from 0 to 1 and then compute the weighted basis function at each value, getting the x/y values we need to plot. Unfortunately, the more complex the curve gets, the more expensive this computation becomes. Instead, we can use de Casteljau's algorithm to draw curves. This is a geometric approach to curve drawing, and it's really easy to implement. So easy, in fact, you can do it by hand with a pencil and ruler.

-

Rather than using our calculus function to find x/y values for t, let's do this instead:

-
    -
  • treat t as a ratio (which it is). t=0 is 0% along a line, t=1 is 100% along a line.
  • -
  • Take all lines between the curve's defining points. For an order n curve, that's n lines.
  • -
  • Place markers along each of these line, at distance t. So if t is 0.2, place the mark at 20% from the start, 80% from the end.
  • -
  • Now form lines between those points. This gives n-1 lines.
  • -
  • Place markers along each of these line at distance t.
  • -
  • Form lines between those points. This'll be n-2 lines.
  • -
  • Place markers, form lines, place markers, etc.
  • -
  • Repeat this until you have only one line left. The point t on that line coincides with the original curve point at t.
  • -
-

To see this in action, move the slider for the following sketch to changes which curve point is explicitly evaluated using de Casteljau's algorithm.

- - - Scripts are disabled. Showing fallback image. - - - - - -
- -

How to implement de Casteljau's algorithm

-

Let's just use the algorithm we just specified, and implement that as a function that can take a list of curve-defining points, and a t value, and draws the associated point on the curve for that t value:

- - - - - - - - -
1 - -
2
3
4
5
6
7
8
- -

And done, that's the algorithm implemented. Although: usually you don't get the luxury of overloading the "+" operator, so let's also give the code for when you need to work with x and y values separately:

+ drawCurvePoint(newpoints, t) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + - - - - - - - - - -
1 - -
2
3
4
5
6
7
8
9
10
- -

So what does this do? This draws a point, if the passed list of points is only 1 point long. Otherwise it will create a new list of points that sit at the t ratios (i.e. the "markers" outlined in the above algorithm), and then call the draw function for this new list.

-
+ drawCurvePoint(newpoints, t) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + +

+ So what does this do? This draws a point, if the passed list of points is only 1 point long. Otherwise it will create a new list of + points that sit at the t ratios (i.e. the "markers" outlined in the above algorithm), and then call the draw function for this + new list. +

+ +
+
+

+ + Simplified drawing +

+

+ We can also simplify the drawing process by "sampling" the curve at certain points, and then joining those points up with straight lines, + a process known as "flattening", as we are reducing a curve to a simple sequence of straight, "flat" lines. +

+

+ We can do this is by saying "we want X segments", and then sampling the curve at intervals that are spaced such that we end up with the + number of segments we wanted. The advantage of this method is that it's fast: instead of evaluating 100 or even 1000 curve coordinates, we + can sample a much lower number and still end up with a curve that sort-of-kind-of looks good enough. The disadvantage of course is that we + lose the precision of working with "the real curve", so we usually can't use the flattened for doing true intersection detection, or + curvature alignment. +

+
+ + + Scripts are disabled. Showing fallback image. + + + + + + + + Scripts are disabled. Showing fallback image. + + + + + +
-
-
-

Simplified drawing

-

We can also simplify the drawing process by "sampling" the curve at certain points, and then joining those points up with straight lines, a process known as "flattening", as we are reducing a curve to a simple sequence of straight, "flat" lines.

-

We can do this is by saying "we want X segments", and then sampling the curve at intervals that are spaced such that we end up with the number of segments we wanted. The advantage of this method is that it's fast: instead of evaluating 100 or even 1000 curve coordinates, we can sample a much lower number and still end up with a curve that sort-of-kind-of looks good enough. The disadvantage of course is that we lose the precision of working with "the real curve", so we usually can't use the flattened for doing true intersection detection, or curvature alignment.

-
- - - Scripts are disabled. Showing fallback image. - - - - - - - - Scripts are disabled. Showing fallback image. - - - - - -
+

+ Try clicking on the sketch and using your up and down arrow keys to lower the number of segments for both the quadratic and cubic curve. + You'll notice that for certain curvatures, a low number of segments works quite well, but for more complex curvatures (try this for the + cubic curve), a higher number is required to capture the curvature changes properly. +

+
+

How to implement curve flattening

+

Let's just use the algorithm we just specified, and implement that:

-

Try clicking on the sketch and using your up and down arrow keys to lower the number of segments for both the quadratic and cubic curve. You'll notice that for certain curvatures, a low number of segments works quite well, but for more complex curvatures (try this for the cubic curve), a higher number is required to capture the curvature changes properly.

-
- -

How to implement curve flattening

-

Let's just use the algorithm we just specified, and implement that:

- - - - - - - -
1 - -
2
3
4
5
6
7
- -

And done, that's the algorithm implemented. That just leaves drawing the resulting "curve" as a sequence of lines:

+ return coordinates; + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + - - - - - - -
1 - -
2
3
4
5
6
7
- -

We start with the first coordinate as reference point, and then just draw lines between each point and its next point.

-
+ coord = _coord + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + -
-
-

Splitting curves

-

Using de Casteljau's algorithm, we can also find all the points we need to split up a Bézier curve into two, smaller curves, which taken together form the original curve. When we construct de Casteljau's skeleton for some value t, the procedure gives us all the points we need to split a curve at that t value: one curve is defined by all the inside skeleton points found prior to our on-curve point, with the other curve being defined by all the inside skeleton points after our on-curve point.

- - - Scripts are disabled. Showing fallback image. - - - - - -
+

We start with the first coordinate as reference point, and then just draw lines between each point and its next point.

+
+
+
+

+ + Splitting curves +

+

+ Using de Casteljau's algorithm, we can also find all the points we need to split up a Bézier curve into two, smaller curves, which taken + together form the original curve. When we construct de Casteljau's skeleton for some value t, the procedure gives us all the + points we need to split a curve at that t value: one curve is defined by all the inside skeleton points found prior to our + on-curve point, with the other curve being defined by all the inside skeleton points after our on-curve point. +

+ + + Scripts are disabled. Showing fallback image. + + + + + +
+

implementing curve splitting

+

We can implement curve splitting by bolting some extra logging onto the de Casteljau function:

-

implementing curve splitting

-

We can implement curve splitting by bolting some extra logging onto the de Casteljau function:

- - - - - - - - - - - - - - - - -
1 - -
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
- -

After running this function for some value t, the left and right arrays will contain all the coordinates for two new curves - one to the "left" of our t value, the other on the "right". These new curves will have the same order as the original curve, and can be overlaid exactly on the original curve.

-
+ drawCurvePoint(newpoints, t) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + -
-
-

Splitting curves using matrices

-

Another way to split curves is to exploit the matrix representation of a Bézier curve. In the section on matrices, we saw that we can represent curves as matrix multiplications. Specifically, we saw these two forms for the quadratic and cubic curves respectively: (we'll reverse the Bézier coefficients vector for legibility)

- - -

and

- - -

Let's say we want to split the curve at some point t = z, forming two new (obviously smaller) Bézier curves. To find the coordinates for these two Bézier curves, we can use the matrix representation and some linear algebra. First, we separate out the actual "point on the curve" information into a new matrix multiplication:

- - -

and

- - -

If we could compact these matrices back to the form [t values] · [Bézier matrix] · [column matrix], with the first two staying the same, then that column matrix on the right would be the coordinates of a new Bézier curve that describes the first segment, from t = 0 to t = z. As it turns out, we can do this quite easily, by exploiting some simple rules of linear algebra (and if you don't care about the derivations, just skip to the end of the box for the results!).

-
- -

Deriving new hull coordinates

-

Deriving the two segments upon splitting a curve takes a few steps, and the higher the curve order, the more work it is, so let's look at the quadratic curve first:

- - - - - - - - -

We can do this because [M · M-1] is the identity matrix. It's a bit like multiplying something by x/x in calculus: it doesn't do anything to the function, but it does allow you to rewrite it to something that may be easier to work with, or can be broken up differently. In the same way, multiplying our matrix by [M · M-1] has no effect on the total formula, but it does allow us to change the matrix sequence [something · M] to a sequence [M · something], and that makes a world of difference: if we know what [M-1 · Z · M] is, we can apply that to our coordinates, and be left with a proper matrix representation of a quadratic Bézier curve (which is [T · M · P]), with a new set of coordinates that represent the curve from t = 0 to t = z. So let's get computing:

- - -

Excellent! Now we can form our new quadratic curve:

- - - - - - -

Brilliant: if we want a subcurve from t = 0 to t = z, we can keep the first coordinate the same (which makes sense), our control point becomes a z-ratio mixture of the original control point and the start point, and the new end point is a mixture that looks oddly similar to a Bernstein polynomial of degree two. These new coordinates are actually really easy to compute directly!

-

Of course, that's only one of the two curves. Getting the section from t = z to t = 1 requires doing this again. We first observe that in the previous calculation, we actually evaluated the general interval [0,z]. We were able to write it down in a more simple form because of the zero, but what we actually evaluated, making the zero explicit, was:

- - - - -

If we want the interval [z,1], we will be evaluating this instead:

- - - - -

We're going to do the same trick of multiplying by the identity matrix, to turn [something · M] into [M · something]:

- - -

So, our final second curve looks like:

- - - - - - -

Nice. We see the same thing as before: we can keep the last coordinate the same (which makes sense); our control point becomes a z-ratio mixture of the original control point and the end point, and the new start point is a mixture that looks oddly similar to a bernstein polynomial of degree two, except this time it uses (z-1) rather than (1-z). These new coordinates are also really easy to compute directly!

-
+ +

+ Nice. We see the same thing as before: we can keep the last coordinate the same (which makes sense); our control point becomes a z-ratio + mixture of the original control point and the end point, and the new start point is a mixture that looks oddly similar to a bernstein + polynomial of degree two, except this time it uses (z-1) rather than (1-z). These new coordinates are also really easy to + compute directly! +

+ -

So, using linear algebra rather than de Casteljau's algorithm, we have determined that, for any quadratic curve split at some value t = z, we get two subcurves that are described as Bézier curves with simple-to-derive coordinates:

- - -

and

- - -

We can do the same for cubic curves. However, I'll spare you the actual derivation (don't let that stop you from writing that out yourself, though) and simply show you the resulting new coordinate sets:

- - -

and

- - -

So, looking at our matrices, did we really need to compute the second segment matrix? No, we didn't. Actually having one segment's matrix means we implicitly have the other: push the values of each row in the matrix Q to the right, with zeroes getting pushed off the right edge and appearing back on the left, and then flip the matrix vertically. Presto, you just "calculated" Q'.

-

Implementing curve splitting this way requires less recursion, and is just straight arithmetic with cached values, so can be cheaper on systems where recursion is expensive. If you're doing computation with devices that are good at matrix multiplication, chopping up a Bézier curve with this method will be a lot faster than applying de Casteljau.

- -
-
-

Lowering and elevating curve order

-

One interesting property of Bézier curves is that an nth order curve can always be perfectly represented by an (n+1)th order curve, by giving the higher-order curve specific control points.

-

If we have a curve with three points, then we can create a curve with four points that exactly reproduces the original curve. First, we give it the same start and end points, and for its two control points we pick "1/3rd start + 2/3rd control" and "2/3rd control + 1/3rd end". Now we have exactly the same curve as before, except represented as a cubic curve rather than a quadratic curve.

-

The general rule for raising an nth order curve to an (n+1)th order curve is as follows (observing that the start and end weights are the same as the start and end weights for the old curve):

- - -

However, this rule also has as direct consequence that you cannot generally safely lower a curve from nth order to (n-1)th order, because the control points cannot be "pulled apart" cleanly. We can try to, but the resulting curve will not be identical to the original, and may in fact look completely different.

-

However, there is a surprisingly good way to ensure that a lower order curve looks "as close as reasonably possible" to the original curve: we can optimise the "least-squares distance" between the original curve and the lower order curve, in a single operation (also explained over on Sirver's Castle). However, to use it, we'll need to do some calculus work and then switch over to linear algebra. As mentioned in the section on matrix representations, some things can be done much more easily with matrices than with calculus functions, and this is one of those things. So... let's go!

-

We start by taking the standard Bézier function, and condensing it a little:

- - -

Then, we apply one of those silly (actually, super useful) calculus tricks: since our t value is always between zero and one (inclusive), we know that (1-t) plus t always sums to 1. As such, we can express any value as a sum of t and 1-t:

- - -

So, with that seemingly trivial observation, we rewrite that Bézier function by splitting it up into a sum of a (1-t) and t component:

- - -

So far so good. Now, to see why we did this, let's write out the (1-t) and t parts, and see what that gives us. I promise, it's about to make sense. We start with (1-t):

- - -

So by using this seemingly silly trick, we can suddenly express part of our nth order Bézier function in terms of an (n+1)th order Bézier function. And that sounds a lot like raising the curve order! Of course we need to be able to repeat that trick for the t part, but that's not a problem:

- - -

So, with both of those changed from an order n expression to an order (n+1) expression, we can put them back together again. Now, where the order n function had a summation from 0 to n, the order n+1 function uses a summation from 0 to n+1, but this shouldn't be a problem as long as we can add some new terms that "contribute nothing". In the next section on derivatives, there is a discussion about why "higher terms than there is a binomial for" and "lower than zero terms" both "contribute nothing". So as long as we can add terms that have the same form as the terms we need, we can just include them in the summation, they'll sit there and do nothing, and the resulting function stays identical to the lower order curve.

-

Let's do this:

- - -

And this is where we switch over from calculus to linear algebra, and matrices: we can now express this relation between Bézier(n,t) and Bézier(n+1,t) as a very simple matrix multiplication:

- - -

where the matrix M is an n+1 by n matrix, and looks like:

- - -

That might look unwieldy, but it's really just a mostly-zeroes matrix, with a very simply fraction on the diagonal, and an even simpler fraction to the left of it. Multiplying a list of coordinates with this matrix means we can plug the resulting transformed coordinates into the one-order-higher function and get an identical looking curve.

-

Not too bad!

-

Equally interesting, though, is that with this matrix operation established, we can now use an incredibly powerful and ridiculously simple way to find out a "best fit" way to reverse the operation, called the normal equation. What it does is minimize the sum of the square differences between one set of values and another set of values. Specifically, if we can express that as some function A x = b, we can use it. And as it so happens, that's exactly what we're dealing with, so:

- - -

The steps taken here are:

-
    -
  1. We have a function in a form that the normal equation can be used with, so
  2. -
  3. apply the normal equation!
  4. -
  5. Then, we want to end up with just Bn on the left, so we start by left-multiply both sides such that we'll end up with lots of stuff on the left that simplified to "a factor 1", which in matrix maths is the identity matrix.
  6. -
  7. In fact, by left-multiplying with the inverse of what was already there, we've effectively "nullified" (but really, one-inified) that big, unwieldy block into the identity matrix I. So we substitute the mess with I, and then
  8. -
  9. because multiplication with the identity matrix does nothing (like multiplying by 1 does nothing in regular algebra), we just drop it.
  10. -
-

And we're done: we now have an expression that lets us approximate an n+1th order curve with a lower nth order curve. It won't be an exact fit, but it's definitely a best approximation. So, let's implement these rules for raising and lowering curve order to a (semi) random curve, using the following graphic. Select the sketch, which has movable control points, and press your up and down arrow keys to raise or lower the curve order.

- - - Scripts are disabled. Showing fallback image. - - - - - - - -
-
-

Derivatives

-

There's a number of useful things that you can do with Bézier curves based on their derivative, and one of the more amusing observations about Bézier curves is that their derivatives are, in fact, also Bézier curves. In fact, the differentiation of a Bézier curve is relatively straightforward, although we do need a bit of math.

-

First, let's look at the derivative rule for Bézier curves, which is:

- - -

which we can also write (observing that b in this formula is the same as our w weights, and that n times a summation is the same as a summation where each term is multiplied by n) as:

- - -

Or, in plain text: the derivative of an nth degree Bézier curve is an (n-1)th degree Bézier curve, with one fewer term, and new weights w'0...w'n-1 derived from the original weights as n(wi+1 - wi). So for a 3rd degree curve, with four weights, the derivative has three new weights: w'0 = 3(w1-w0), w'1 = 3(w2-w1) and w'2 = 3(w3-w2).

-
- -

"Slow down, why is that true?"

-

Sometimes just being told "this is the derivative" is nice, but you might want to see why this is indeed the case. As such, let's have a look at the proof for this derivative. First off, the weights are independent of the full Bézier function, so the derivative involves only the derivative of the polynomial basis function. So, let's find that:

- - -

Applying the product and chain rules gives us:

- - -

Which is hard to work with, so let's expand that properly:

- - -

Now, the trick is to turn this expression into something that has binomial coefficients again, so we want to end up with things that look like "x! over y!(x-y)!". If we can do that in a way that involves terms of n-1 and k-1, we'll be on the right track.

- - -

And that's the first part done: the two components inside the parentheses are actually regular, lower-order Bézier expressions:

- - -

Now to apply this to our weighted Bézier curves. We'll write out the plain curve formula that we saw earlier, and then work our way through to its derivative:

- - -

If we expand this (with some color to show how terms line up), and reorder the terms by increasing values for k we see the following:

- - -

Two of these terms fall way: the first term falls away because there is no -1st term in a summation. As such, it always contributes "nothing", so we can safely completely ignore it for the purpose of finding the derivative function. The other term is the very last term in this expansion: one involving Bn-1,n. This term would have a binomial coefficient of [i choose i+1], which is a non-existent binomial coefficient. Again, this term would contribute "nothing", so we can ignore it, too. This means we're left with:

- - -

And that's just a summation of lower order curves:

- - -

We can rewrite this as a normal summation, and we're done:

- - -
+ + -

Let's rewrite that in a form similar to our original formula, so we can see the difference. We will first list our original formula for Bézier curves, and then the derivative:

- - - - -

What are the differences? In terms of the actual Bézier curve, virtually nothing! We lowered the order (rather than n, it's now n-1), but it's still the same Bézier function. The only real difference is in how the weights change when we derive the curve's function. If we have four points A, B, C, and D, then the derivative will have three points, the second derivative two, and the third derivative one:

- - -

We can keep performing this trick for as long as we have more than one weight. Once we have one weight left, the next step will see k = 0, and the result of our "Bézier function" summation is zero, because we're not adding anything at all. As such, a quadratic curve has no second derivative, a cubic curve has no third derivative, and generalized: an nth order curve has n-1 (meaningful) derivatives, with any further derivative being zero.

- -
-
-

Tangents and normals

-

If you want to move objects along a curve, or "away from" a curve, the two vectors you're most interested in are the tangent vector and normal vector for curve points. These are actually really easy to find. For moving and orienting along a curve, we use the tangent, which indicates the direction of travel at specific points, and is literally just the first derivative of our curve:

- - -

This gives us the directional vector we want. We can normalize it to give us uniform directional vectors (having a length of 1.0) at each point, and then do whatever it is we want to do based on those directions:

- - -

The tangent is very useful for moving along a line, but what if we want to move away from the curve instead, perpendicular to the curve at some point t? In that case we want the normal vector. This vector runs at a right angle to the direction of the curve, and is typically of length 1.0, so all we have to do is rotate the normalized directional vector and we're done:

- - -
- -

Rotating coordinates is actually very easy, if you know the rule for it. You might find it explained as "applying a rotation matrix, which is what we'll look at here, too. Essentially, the idea is to take the circles over which we can rotate, and simply "sliding the coordinates" over these circles by the desired -angle. If we want a quarter circle turn, we take the coordinate, slide it along the circle by a quarter turn, and done.

-

To turn any point (x,y) into a rotated point (x',y') (over 0,0) by some angle φ, we apply this nice and easy computation:

- - -

Which is the "long" version of the following matrix transformation:

- - -

And that's all we need to rotate any coordinate. Note that for quarter, half, and three-quarter turns these functions become even easier, since sin and cos for these angles are, respectively: 0 and 1, -1 and 0, and 0 and -1.

-

But why does this work? Why this matrix multiplication? Wikipedia (technically, Thomas Herter and Klaus Lott) tells us that a rotation matrix can be -treated as a sequence of three (elementary) shear operations. When we combine this into a single matrix operation (because all matrix multiplications can be collapsed), we get the matrix that you see above. DataGenetics have an excellent article about this very thing: it's really quite cool, and I strongly recommend taking a quick break from this primer to read that article.

-
+ +

+ And that's all we need to rotate any coordinate. Note that for quarter, half, and three-quarter turns these functions become even + easier, since sin and cos for these angles are, respectively: 0 and 1, -1 and 0, and 0 and -1. +

+

+ But why does this work? Why this matrix multiplication? + Wikipedia (technically, Thomas Herter and Klaus + Lott) tells us that a rotation matrix can be treated as a sequence of three (elementary) shear operations. When we combine this into a + single matrix operation (because all matrix multiplications can be collapsed), we get the matrix that you see above. + DataGenetics have an excellent article about this very thing: it's + really quite cool, and I strongly recommend taking a quick break from this primer to read that article. +

+ -

The following two graphics show the tangent and normal along a quadratic and cubic curve, with the direction vector coloured blue, and the normal vector coloured red (the markers are spaced out evenly as t-intervals, not spaced equidistant).

-
- - - Scripts are disabled. Showing fallback image. - - - - - - Scripts are disabled. Showing fallback image. - - - -
+

+ The following two graphics show the tangent and normal along a quadratic and cubic curve, with the direction vector coloured blue, and the + normal vector coloured red (the markers are spaced out evenly as t-intervals, not spaced equidistant). +

+
+ + + Scripts are disabled. Showing fallback image. + + + + + + Scripts are disabled. Showing fallback image. + + + +
+
+
+

+ + Working with 3D normals +

+

+ Before we move on to the next section we need to spend a little bit of time on the difference between 2D and 3D. While for many things + this difference is irrelevant and the procedures are identical (for instance, getting the 3D tangent is just doing what we do for 2D, but + for x, y, and z, instead of just for x and y), when it comes to normals things are a little more complex, and thus more work. Mind you, + it's not "super hard", but there are more steps involved and we should have a look at those. +

+

+ Getting normals in 3D is in principle the same as in 2D: we take the normalised tangent vector, and then rotate it by a quarter turn. + However, this is where things get that little more complex: we can turn in quite a few directions, since "the normal" in 3D is a plane, + not a single vector, so we basically need to define what "the" normal is in the 3D case. +

+

+ The "naïve" approach is to construct what is known as the + Frenet normal, where we follow a simple recipe that works in + many cases (but does super bizarre things in some others). The idea is that even though there are infinitely many vectors that are + perpendicular to the tangent (i.e. make a 90 degree angle with it), the tangent itself sort of lies on its own plane already: since each + point on the curve (no matter how closely spaced) has its own tangent vector, we can say that each point lies in the same plane as the + local tangent, as well as the tangents "right next to it". +

+

+ Even if that difference in tangent vectors is minute, "any difference" is all we need to find out what that plane is - or rather, what the + vector perpendicular to that plane is. Which is what we need: if we can calculate that vector, and we have the tangent vector that we know + lies on a plane, then we can rotate the tangent vector over the perpendicular, and presto. We have computed the normal using the same + logic we used for the 2D case: "just rotate it 90 degrees". +

+

So let's do that! And in a twist surprise, we can do this in four lines:

+
    +
  • a = normalize(B'(t))
  • +
  • b = normalize(a + B''(t))
  • +
  • r = normalize(b × a)
  • +
  • normal = normalize(r × a)
  • +
+

Let's unpack that a little:

+
    +
  • + We start by taking the normalized vector for the derivative at some point on the + curve. We normalize it so the maths is less work. Less work is good. +
  • +
  • + Then, we compute b which represents what a next point's tangent would be if the curve stopped changing at our point and + just had the same derivative and second derivative from that point on. +
  • +
  • + This lets us find two vectors (the derivative, and the second derivative added to the derivative) that lie on the same plane, which + means we can use them to compute a vector perpendicular to that plane, using an elementary vector operation called the + cross product. (Note that while that operation uses the × operator, it's most + definitely not a multiplication!) The result of that gives us a vector that we can use as the "axis of rotation" for turning the tangent + a quarter circle to get our normal, just like we did in the 2D case. +
  • +
  • + Since the cross product lets us find a vector that is perpendicular to some plane defined by two other vectors, and since the normal + vector should be perpendicular to the plane that the tangent and the axis of rotation lie in, we can use the cross product a second + time, and immediately get our normal vector. +
  • +
+

+ And then we're done, we found "the" normal vector for a 3D curve. Let's see what that looks like for a sample curve, shall we? You can + move your cursor across the graphic from left to right, to show the normal at a point with a t value that is based on your cursor + position: all the way on the left is 0, all the way on the right = 1, midway is t=0.5, etc: +

+ + + Scripts are disabled. Showing fallback image. + + + + + +

+ However, if you've played with that graphic a bit, you might have noticed something odd. The normal seems to "suddenly twist around the + curve" between t=0.65 and t=0.75... Why is it doing that? +

+

+ As it turns out, it's doing that because that's how the maths works, and that's the problem with Frenet normals: while they are + "mathematically correct", they are "practically problematic", and so for any kind of graphics work what we really want is a way to compute + normals that just... look good. +

+

Thankfully, Frenet normals are not our only option.

+

+ Another option is to take a slightly more algorithmic approach and compute a form of + Rotation Minimising Frame + (also known as "parallel transport frame" or "Bishop frame") instead, where a "frame" is a set made up of the tangent, the rotational + axis, and the normal vector, centered on an on-curve point. +

+

+ These type of frames are computed based on "the previous frame", so we cannot simply compute these "on demand" for single points, as we + could for Frenet frames; we have to compute them for the entire curve. Thankfully, the procedure is pretty simple, and can be performed at + the same time that you're building lookup tables for your curve. +

+

+ The idea is to take a starting "tangent/rotation axis/normal" frame at t=0, and then compute what the next frame "should" look like by + applying some rules that yield a good looking next frame. In the case of the RMF paper linked above, those rules are: +

+
    +
  • Take a point on the curve for which we know the RM frame already,
  • +
  • take a next point on the curve for which we don't know the RM frame yet, and
  • +
  • + reflect the known frame onto the next point, by treating the plane through the curve at the point exactly between the next and previous + points as a "mirror". +
  • +
  • + This gives the next point a tangent vector that's essentially pointing in the opposite direction of what it should be, and a normal + that's slightly off-kilter, so: +
  • +
  • + reflect the vectors of our "mirrored frame" a second time, but this time using the plane through the "next point" itself as "mirror". +
  • +
  • Done: the tangent and normal have been fixed, and we have a good looking frame to work with.
  • +
+

So, let's write some code for that!

+
+

Implementing Rotation Minimising Frames

+

+ We first assume we have a function for calculating the Frenet frame at a point, which we already discussed above, inn a way that it + yields a frame with properties: +

-
-
-

Working with 3D normals

-

Before we move on to the next section we need to spend a little bit of time on the difference between 2D and 3D. While for many things this difference is irrelevant and the procedures are identical (for instance, getting the 3D tangent is just doing what we do for 2D, but for x, y, and z, instead of just for x and y), when it comes to normals things are a little more complex, and thus more work. Mind you, it's not "super hard", but there are more steps involved and we should have a look at those.

-

Getting normals in 3D is in principle the same as in 2D: we take the normalised tangent vector, and then rotate it by a quarter turn. However, this is where things get that little more complex: we can turn in quite a few directions, since "the normal" in 3D is a plane, not a single vector, so we basically need to define what "the" normal is in the 3D case.

-

The "naïve" approach is to construct what is known as the Frenet normal, where we follow a simple recipe that works in many cases (but does super bizarre things in some others). The idea is that even though there are infinitely many vectors that are perpendicular to the tangent (i.e. make a 90 degree angle with it), the tangent itself sort of lies on its own plane already: since each point on the curve (no matter how closely spaced) has its own tangent vector, we can say that each point lies in the same plane as the local tangent, as well as the tangents "right next to it".

-

Even if that difference in tangent vectors is minute, "any difference" is all we need to find out what that plane is - or rather, what the vector perpendicular to that plane is. Which is what we need: if we can calculate that vector, and we have the tangent vector that we know lies on a plane, then we can rotate the tangent vector over the perpendicular, and presto. We have computed the normal using the same logic we used for the 2D case: "just rotate it 90 degrees".

-

So let's do that! And in a twist surprise, we can do this in four lines:

-
    -
  • a = normalize(B'(t))
  • -
  • b = normalize(a + B''(t))
  • -
  • r = normalize(b × a)
  • -
  • normal = normalize(r × a)
  • -
-

Let's unpack that a little:

-
    -
  • We start by taking the normalized vector for the derivative at some point on the curve. We normalize it so the maths is less work. Less work is good.
  • -
  • Then, we compute b which represents what a next point's tangent would be if the curve stopped changing at our point and just had the same derivative and second derivative from that point on.
  • -
  • This lets us find two vectors (the derivative, and the second derivative added to the derivative) that lie on the same plane, which means we can use them to compute a vector perpendicular to that plane, using an elementary vector operation called the cross product. (Note that while that operation uses the × operator, it's most definitely not a multiplication!) The result of that gives us a vector that we can use as the "axis of rotation" for turning the tangent a quarter circle to get our normal, just like we did in the 2D case.
  • -
  • Since the cross product lets us find a vector that is perpendicular to some plane defined by two other vectors, and since the normal vector should be perpendicular to the plane that the tangent and the axis of rotation lie in, we can use the cross product a second time, and immediately get our normal vector.
  • -
-

And then we're done, we found "the" normal vector for a 3D curve. Let's see what that looks like for a sample curve, shall we? You can move your cursor across the graphic from left to right, to show the normal at a point with a t value that is based on your cursor position: all the way on the left is 0, all the way on the right = 1, midway is t=0.5, etc:

- - - Scripts are disabled. Showing fallback image. - - - - - -

However, if you've played with that graphic a bit, you might have noticed something odd. The normal seems to "suddenly twist around the curve" between t=0.65 and t=0.75... Why is it doing that?

-

As it turns out, it's doing that because that's how the maths works, and that's the problem with Frenet normals: while they are "mathematically correct", they are "practically problematic", and so for any kind of graphics work what we really want is a way to compute normals that just... look good.

-

Thankfully, Frenet normals are not our only option.

-

Another option is to take a slightly more algorithmic approach and compute a form of Rotation Minimising Frame (also known as "parallel transport frame" or "Bishop frame") instead, where a "frame" is a set made up of the tangent, the rotational axis, and the normal vector, centered on an on-curve point.

-

These type of frames are computed based on "the previous frame", so we cannot simply compute these "on demand" for single points, as we could for Frenet frames; we have to compute them for the entire curve. Thankfully, the procedure is pretty simple, and can be performed at the same time that you're building lookup tables for your curve.

-

The idea is to take a starting "tangent/rotation axis/normal" frame at t=0, and then compute what the next frame "should" look like by applying some rules that yield a good looking next frame. In the case of the RMF paper linked above, those rules are:

-
    -
  • Take a point on the curve for which we know the RM frame already,
  • -
  • take a next point on the curve for which we don't know the RM frame yet, and
  • -
  • reflect the known frame onto the next point, by treating the plane through the curve at the point exactly between the next and previous points as a "mirror".
  • -
  • This gives the next point a tangent vector that's essentially pointing in the opposite direction of what it should be, and a normal that's slightly off-kilter, so:
  • -
  • reflect the vectors of our "mirrored frame" a second time, but this time using the plane through the "next point" itself as "mirror".
  • -
  • Done: the tangent and normal have been fixed, and we have a good looking frame to work with.
  • -
-

So, let's write some code for that!

-
- -

Implementing Rotation Minimising Frames

-

We first assume we have a function for calculating the Frenet frame at a point, which we already discussed above, inn a way that it yields a frame with properties:

- - - - - - -
1 - -
2
3
4
5
6
- -

Then, we can write a function that generates a sequence of RM frames in the following manner:

+} + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
1 - -
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
- -

Ignoring comments, this is certainly more code than when we were just computing a single Frenet frame, but it's not a crazy amount more code to get much better looking normals.

-
+ frames.add(x1) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + 18 + + + 19 + + + 20 + + + 21 + + + 22 + + + 23 + + + 24 + + + 25 + + + 26 + + + 27 + + + 28 + + + 29 + + + 30 + + + 31 + + + 32 + + + 33 + + + 34 + + + 35 + + + 36 + + -

Speaking of better looking, what does this actually look like? Let's revisit that earlier curve, but this time use rotation minimising frames rather than Frenet frames:

- - - Scripts are disabled. Showing fallback image. - - - - - -

That looks so much better!

-

For those reading along with the code: we don't even strictly speaking need a Frenet frame to start with: we could, for instance, treat the z-axis as our initial axis of rotation, so that our initial normal is (0,0,1) × tangent, and then take things from there, but having that initial "mathematically correct" frame so that the initial normal seems to line up based on the curve's orientation in 3D space is just nice.

+

+ Ignoring comments, this is certainly more code than when we were just computing a single Frenet frame, but it's not a crazy amount more + code to get much better looking normals. +

+ -
-
-

Component functions

-

One of the first things people run into when they start using Bézier curves in their own programs is "I know how to draw the curve, but how do I determine the bounding box?". It's actually reasonably straightforward to do so, but it requires having some knowledge on exploiting math to get the values we need. For bounding boxes, we aren't actually interested in the curve itself, but only in its "extremities": the minimum and maximum values the curve has for its x- and y-axis values. If you remember your calculus (provided you ever took calculus, otherwise it's going to be hard to remember) we can determine function extremities using the first derivative of that function, but this poses a problem, since our function is parametric: every axis has its own function.

-

The solution: compute the derivative for each axis separately, and then fit them back together in the same way we do for the original.

-

Let's look at how a parametric Bézier curve "splits up" into two normal functions, one for the x-axis and one for the y-axis. Note the leftmost figure is again an interactive curve, without labeled axes (you get coordinates in the graph instead). The center and rightmost figures are the component functions for computing the x-axis value, given a value for t (between 0 and 1 inclusive), and the y-axis value, respectively.

-

If you move points in a curve sideways, you should only see the middle graph change; likewise, moving points vertically should only show a change in the right graph.

- - - Scripts are disabled. Showing fallback image. - - - -

 

- - - Scripts are disabled. Showing fallback image. - - - - -
-
-

Finding extremities: root finding

-

Now that we understand (well, superficially anyway) the component functions, we can find the extremities of our Bézier curve by finding maxima and minima on the component functions, by solving the equation B'(t) = 0. We've already seen that the derivative of a Bézier curve is a simpler Bézier curve, but how do we solve the equality? Fairly easily, actually, until our derivatives are 4th order or higher... then things get really hard. But let's start simple:

-

Quadratic curves: linear derivatives.

-

The derivative of a quadratic Bézier curve is a linear Bézier curve, interpolating between just two terms, which means finding the solution for "where is this line 0" is effectively trivial by rewriting it to a function of t and solving. First we turn our quadratic Bézier function into a linear one, by following the rule mentioned at the end of the derivatives section:

- - -

And then we turn this into our solution for t using basic arithmetics:

- - -

Done.

-

Although with the caveat that if b-a is zero, there is no solution and we probably shouldn't try to perform that division.

-

Cubic curves: the quadratic formula.

-

The derivative of a cubic Bézier curve is a quadratic Bézier curve, and finding the roots for a quadratic polynomial means we can apply the Quadratic formula. If you've seen it before, you'll remember it, and if you haven't, it looks like this:

- - -

So, if we can rewrite the Bézier component function as a plain polynomial, we're done: we just plug in the values into the quadratic formula, check if that square root is negative or not (if it is, there are no roots) and then just compute the two values that come out (because of that plus/minus sign we get two). Any value between 0 and 1 is a root that matters for Bézier curves, anything below or above that is irrelevant (because Bézier curves are only defined over the interval [0,1]). So, how do we convert?

-

First we turn our cubic Bézier function into a quadratic one, by following the rule mentioned at the end of the derivatives section:

- - -

And then, using these v values, we can find out what our a, b, and c should be:

- - -

This gives us three coefficients {a, b, c} that are expressed in terms of v values, where the v values are expressions of our original coordinate values, so we can do some substitution to get:

- - -

Easy-peasy. We can now almost trivially find the roots by plugging those values into the quadratic formula.

-

And as a cubic curve, there is also a meaningful second derivative, which we can compute by simple taking the derivative of the derivative.

-

Quartic curves: Cardano's algorithm.

-

We haven't really looked at them before now, but the next step up would be a Quartic curve, a fourth degree Bézier curve. As expected, these have a derivative that is a cubic function, and now things get much harder. Cubic functions don't have a "simple" rule to find their roots, like the quadratic formula, and instead require quite a bit of rewriting to a form that we can even start to try to solve.

-

Back in the 16th century, before Bézier curves were a thing, and even before calculus itself was a thing, Gerolamo Cardano figured out that even if the general cubic function is really hard to solve, it can be rewritten to a form for which finding the roots is "easier" (even if not "easy"):

- - -

We can see that the easier formula only has two constants, rather than four, and only two expressions involving t, rather than three: this makes things considerably easier to solve because it lets us use regular calculus to find the values that satisfy the equation.

-

Now, there is one small hitch: as a cubic function, the solutions may be complex numbers rather than plain numbers... And Cardano realised this, centuries before complex numbers were a well-understood and established part of number theory. His interpretation of them was "these numbers are impossible but that's okay because they disappear again in later steps", allowing him to not think about them too much, but we have it even easier: as we're trying to find the roots for display purposes, we don't even care about complex numbers: we're going to simplify Cardano's approach just that tiny bit further by throwing away any solution that's not a plain number.

-

So, how do we rewrite the hard formula into the easier formula? This is explained in detail over at Ken J. Ward's page for solving the cubic equation, so instead of showing the maths, I'm simply going to show the programming code for solving the cubic equation, with the complex roots getting totally ignored, but if you're interested you should definitely head over to Ken's page and give the procedure a read-through.

-
+ +

+ We can see that the easier formula only has two constants, rather than four, and only two expressions involving t, rather + than three: this makes things considerably easier to solve because it lets us use + regular calculus to find the values that satisfy the equation. +

+

+ Now, there is one small hitch: as a cubic function, the solutions may be + complex numbers rather than plain numbers... And Cardano realised this, + centuries before complex numbers were a well-understood and established part of number theory. His interpretation of them was "these + numbers are impossible but that's okay because they disappear again in later steps", allowing him to not think about them too much, but we + have it even easier: as we're trying to find the roots for display purposes, we don't even care about complex numbers: we're + going to simplify Cardano's approach just that tiny bit further by throwing away any solution that's not a plain number. +

+

+ So, how do we rewrite the hard formula into the easier formula? This is explained in detail over at + Ken J. Ward's page for solving the + cubic equation, so instead of showing the maths, I'm simply going to show the programming code for solving the cubic equation, with the + complex roots getting totally ignored, but if you're interested you should definitely head over to Ken's page and give the procedure a + read-through. +

+
+

Implementing Cardano's algorithm for finding all real roots

+

+ The "real roots" part is fairly important, because while you cannot take a square, cube, etc. root of a negative number in the "real" + number space (denoted with ℝ), this is perfectly fine in the + "complex" number space (denoted with ℂ). And, as it so happens, Cardano is + also attributed as the first mathematician in history to have made use of complex numbers in his calculations. For this very algorithm! +

-

Implementing Cardano's algorithm for finding all real roots

-

The "real roots" part is fairly important, because while you cannot take a square, cube, etc. root of a negative number in the "real" number space (denoted with ℝ), this is perfectly fine in the "complex" number space (denoted with ℂ). And, as it so happens, Cardano is also attributed as the first mathematician in history to have made use of complex numbers in his calculations. For this very algorithm!

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
1 - -
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
- -
+} + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + 18 + + + 19 + + + 20 + + + 21 + + + 22 + + + 23 + + + 24 + + + 25 + + + 26 + + + 27 + + + 28 + + + 29 + + + 30 + + + 31 + + + 32 + + + 33 + + + 34 + + + 35 + + + 36 + + + 37 + + + 38 + + + 39 + + + 40 + + + 41 + + + 42 + + + 43 + + + 44 + + + 45 + + + 46 + + + 47 + + + 48 + + + 49 + + + 50 + + + 51 + + + 52 + + + 53 + + + 54 + + + 55 + + + 56 + + + 57 + + + 58 + + + 59 + + + 60 + + + 61 + + + 62 + + + 63 + + + 64 + + + 65 + + + 66 + + + 67 + + + 68 + + + 69 + + + 70 + + + 71 + + + 72 + + + 73 + + + 74 + + + 75 + + + 76 + + + 77 + + + 78 + + + 79 + + + 80 + + + 81 + + +
-

And that's it. The maths is complicated, but the code is pretty much just "follow the maths, while caching as many values as we can to prevent recomputing things as much as possible" and now we have a way to find all roots for a cubic function and can just move on with using that to find extremities of our curves.

-

And of course, as a quartic curve also has meaningful second and third derivatives, we can quite easily compute those by using the derivative of the derivative (of the derivative), just as for cubic curves.

-

Quintic and higher order curves: finding numerical solutions

-

And this is where thing stop, because we cannot find the roots for polynomials of degree 5 or higher using algebra (a fact known as the Abel–Ruffini theorem). Instead, for occasions like these, where algebra simply cannot yield an answer, we turn to numerical analysis.

-

That's a fancy term for saying "rather than trying to find exact answers by manipulating symbols, find approximate answers by describing the underlying process as a combination of steps, each of which can be assigned a number via symbolic manipulation". For example, trying to mathematically compute how much water fits in a completely crazy three dimensional shape is very hard, even if it got you the perfect, precise answer. A much easier approach, which would be less perfect but still entirely useful, would be to just grab a buck and start filling the shape until it was full: just count the number of buckets of water you used. And if we want a more precise answer, we can use smaller buckets.

-

So that's what we're going to do here, too: we're going to treat the problem as a sequence of steps, and the smaller we can make each step, the closer we'll get to that "perfect, precise" answer. And as it turns out, there is a really nice numerical root-finding algorithm, called the Newton-Raphson root finding method (yes, after that Newton), which we can make use of. The Newton-Raphson approach consists of taking our impossible-to-solve function f(x), picking some initial value x (literally any value will do), and calculating f(x). We can think of that value as the "height" of the function at x. If that height is zero, we're done, we have found a root. If it isn't, we calculate the tangent line at f(x) and calculate at which x value its height is zero (which we've already seen is very easy). That will give us a new x and we repeat the process until we find a root.

-

Mathematically, this means that for some x, at step n=1, we perform the following calculation until fy(x) is zero, so that the next t is the same as the one we already have:

- - -

(The Wikipedia article has a decent animation for this process, so I will not add a graphic for that here)

-

Now, this works well only if we can pick good starting points, and our curve is continuously differentiable and doesn't have oscillations. Glossing over the exact meaning of those terms, the curves we're dealing with conform to those constraints, so as long as we pick good starting points, this will work. So the question is: which starting points do we pick?

-

As it turns out, Newton-Raphson is so blindingly fast that we could get away with just not picking: we simply run the algorithm from t=0 to t=1 at small steps (say, 1/200th) and the result will be all the roots we want. Of course, this may pose problems for high order Bézier curves: 200 steps for a 200th order Bézier curve is going to go wrong, but that's okay: there is no reason (at least, none that I know of) to ever use Bézier curves of crazy high orders. You might use a fifth order curve to get the "nicest still remotely workable" approximation of a full circle with a single Bézier curve, but that's pretty much as high as you'll ever need to go.

-

In conclusion:

-

So now that we know how to do root finding, we can determine the first and second derivative roots for our Bézier curves, and show those roots overlaid on the previous graphics. For the quadratic curve, that means just the first derivative, in red:

- - - Scripts are disabled. Showing fallback image. - - - -

And for cubic curves, that means first and second derivatives, in red and purple respectively:

- - - Scripts are disabled. Showing fallback image. - - - + +

(The Wikipedia article has a decent animation for this process, so I will not add a graphic for that here)

+

+ Now, this works well only if we can pick good starting points, and our curve is + continuously differentiable and doesn't have + oscillations. Glossing over the exact meaning of those terms, the + curves we're dealing with conform to those constraints, so as long as we pick good starting points, this will work. So the question is: + which starting points do we pick? +

+

+ As it turns out, Newton-Raphson is so blindingly fast that we could get away with just not picking: we simply run the algorithm from + t=0 to t=1 at small steps (say, 1/200th) and the result will be all the roots we want. Of course, this may + pose problems for high order Bézier curves: 200 steps for a 200th order Bézier curve is going to go wrong, but that's okay: + there is no reason (at least, none that I know of) to ever use Bézier curves of crazy high orders. You might use a fifth order + curve to get the "nicest still remotely workable" approximation of a full circle with a single Bézier curve, but that's pretty much as + high as you'll ever need to go. +

+

In conclusion:

+

+ So now that we know how to do root finding, we can determine the first and second derivative roots for our Bézier curves, and show those + roots overlaid on the previous graphics. For the quadratic curve, that means just the first derivative, in red: +

+ + + Scripts are disabled. Showing fallback image. + + +

And for cubic curves, that means first and second derivatives, in red and purple respectively:

+ + + Scripts are disabled. Showing fallback image. + + +
+
+

+ + Bounding boxes +

+

+ If we have the extremities, and the start/end points, a simple for-loop that tests for min/max values for x and y means we have the four + values we need to box in our curve: +

+

Computing the bounding box for a Bézier curve:

+
    +
  1. Find all t value(s) for the curve derivative's x- and y-roots.
  2. +
  3. Discard any t value that's lower than 0 or higher than 1, because Bézier curves only use the interval [0,1].
  4. +
  5. + Determine the lowest and highest value when plugging the values t=0, t=1 and each of the found roots into the original + functions: the lowest value is the lower bound, and the highest value is the upper bound for the bounding box we want to construct. +
  6. +
+

+ Applying this approach to our previous root finding, we get the following + axis-aligned bounding boxes (with all curve extremity points + shown on the curve): +

+
+ + + Scripts are disabled. Showing fallback image. + + + + + + Scripts are disabled. Showing fallback image. + + + +
-
-
-

Bounding boxes

-

If we have the extremities, and the start/end points, a simple for-loop that tests for min/max values for x and y means we have the four values we need to box in our curve:

-

Computing the bounding box for a Bézier curve:

-
    -
  1. Find all t value(s) for the curve derivative's x- and y-roots.
  2. -
  3. Discard any t value that's lower than 0 or higher than 1, because Bézier curves only use the interval [0,1].
  4. -
  5. Determine the lowest and highest value when plugging the values t=0, t=1 and each of the found roots into the original functions: the lowest value is the lower bound, and the highest value is the upper bound for the bounding box we want to construct.
  6. -
-

Applying this approach to our previous root finding, we get the following axis-aligned bounding boxes (with all curve extremity points shown on the curve):

-
- - - Scripts are disabled. Showing fallback image. - - - - - - Scripts are disabled. Showing fallback image. - - - -
- -

We can construct even nicer boxes by aligning them along our curve, rather than along the x- and y-axis, but in order to do so we first need to look at how aligning works.

- -
-
-

Aligning curves

-

While there are an incredible number of curves we can define by varying the x- and y-coordinates for the control points, not all curves are actually distinct. For instance, if we define a curve, and then rotate it 90 degrees, it's still the same curve, and we'll find its extremities in the same spots, just at different draw coordinates. As such, one way to make sure we're working with a "unique" curve is to "axis-align" it.

-

Aligning also simplifies a curve's functions. We can translate (move) the curve so that the first point lies on (0,0), which turns our n term polynomial functions into n-1 term functions. The order stays the same, but we have less terms. Then, we can rotate the curves so that the last point always lies on the x-axis, too, making its coordinate (...,0). This further simplifies the function for the y-component to an n-2 term function. For instance, if we have a cubic curve such as this:

- - -

Then translating it so that the first coordinate lies on (0,0), moving all x coordinates by -120, and all y coordinates by -160, gives us:

- - -

If we then rotate the curve so that its end point lies on the x-axis, the coordinates (integer-rounded for illustrative purposes here) become:

- - -

If we drop all the zero-terms, this gives us:

- - -

We can see that our original curve definition has been simplified considerably. The following graphics illustrate the result of aligning our example curves to the x-axis, with the cubic case using the coordinates that were just used in the example formulae:

- - - Scripts are disabled. Showing fallback image. - - - -

 

- - - Scripts are disabled. Showing fallback image. - - - + +

+ We can see that our original curve definition has been simplified considerably. The following graphics illustrate the result of aligning + our example curves to the x-axis, with the cubic case using the coordinates that were just used in the example formulae: +

+ + + Scripts are disabled. Showing fallback image. + + +

 

+ + + Scripts are disabled. Showing fallback image. + + +
+
+

+ + Tight bounding boxes +

+

+ With our knowledge of bounding boxes, and curve alignment, We can now form the "tight" bounding box for curves. We first align our curve, + recording the translation we performed, "T", and the rotation angle we used, "R". We then determine the aligned curve's normal bounding + box. Once we have that, we can map that bounding box back to our original curve by rotating it by -R, and then translating it by -T. +

+

We now have nice tight bounding boxes for our curves:

+
+ + + Scripts are disabled. Showing fallback image. + + + + + + Scripts are disabled. Showing fallback image. + + + +
-
-
-

Tight bounding boxes

-

With our knowledge of bounding boxes, and curve alignment, We can now form the "tight" bounding box for curves. We first align our curve, recording the translation we performed, "T", and the rotation angle we used, "R". We then determine the aligned curve's normal bounding box. Once we have that, we can map that bounding box back to our original curve by rotating it by -R, and then translating it by -T.

-

We now have nice tight bounding boxes for our curves:

-
- - - Scripts are disabled. Showing fallback image. - - - - - - Scripts are disabled. Showing fallback image. - - - -
- -

These are, strictly speaking, not necessarily the tightest possible bounding boxes. It is possible to compute the optimal bounding box by determining which spanning lines we need to effect a minimal box area, but because of the parametric nature of Bézier curves this is actually a rather costly operation, and the gain in bounding precision is often not worth it.

- -
-
-

Curve inflections

-

Now that we know how to align a curve, there's one more thing we can calculate: inflection points. Imagine we have a variable size circle that we can slide up against our curve. We place it against the curve and adjust its radius so that where it touches the curve, the curvatures of the curve and the circle are the same, and then we start to slide the circle along the curve - for quadratic curves, we can always do this without the circle behaving oddly: we might have to change the radius of the circle as we slide it along, but it'll always sit against the same side of the curve.

-

But what happens with cubic curves? Imagine we have an S curve and we place our circle at the start of the curve, and start sliding it along. For a while we can simply adjust the radius and things will be fine, but once we get to the midpoint of that S, something odd happens: the circle "flips" from one side of the curve to the other side, in order for the curvatures to keep matching. This is called an inflection, and we can find out where those happen relatively easily.

-

What we need to do is solve a simple equation:

- - -

What we're saying here is that given the curvature function C(t), we want to know for which values of t this function is zero, meaning there is no "curvature", which will be exactly at the point between our circle being on one side of the curve, and our circle being on the other side of the curve. So what does C(t) look like? Actually something that seems not too hard:

- - -

The function C(t) is the cross product between the first and second derivative functions for the parametric dimensions of our curve. And, as already shown, derivatives of Bézier curves are just simpler Bézier curves, with very easy to compute new coefficients, so this should be pretty easy.

-

However as we've seen in the section on aligning, aligning lets us simplify things a lot, by completely removing the contributions of the first coordinate from most mathematical evaluations, and removing the last y coordinate as well by virtue of the last point lying on the x-axis. So, while we can evaluate C(t) = 0 for our curve, it'll be much easier to first axis-align the curve and then evaluating the curvature function.

-
- -

Let's derive the full formula anyway

-

Of course, before we do our aligned check, let's see what happens if we compute the curvature function without axis-aligning. We start with the first and second derivatives, given our basis functions:

- - -

And of course the same functions for y:

- - -

Asking a computer to now compose the C(t) function for us (and to expand it to a readable form of simple terms) gives us this rather overly complicated set of arithmetic expressions:

- - -

That is... unwieldy. So, we note that there are a lot of terms that involve multiplications involving x1, y1, and y4, which would all disappear if we axis-align our curve, which is why aligning is a great idea.

-
+ +

+ That is... unwieldy. So, we note that there are a lot of terms that involve multiplications involving x1, y1, and y4, which would all + disappear if we axis-align our curve, which is why aligning is a great idea. +

+ -

Aligning our curve so that three of the eight coefficients become zero, and observing that scale does not affect finding t values, we end up with the following simple term function for C(t):

- - -

That's a lot easier to work with: we see a fair number of terms that we can compute and then cache, giving us the following simplification:

- - -

This is a plain quadratic curve, and we know how to solve C(t) = 0; we use the quadratic formula:

- - -

We can easily compute this value if the discriminator isn't a negative number (because we only want real roots, not complex roots), and if x is not zero, because divisions by zero are rather useless.

-

Taking that into account, we compute t, we disregard any t value that isn't in the Bézier interval [0,1], and we now know at which t value(s) our curve will inflect.

- - - Scripts are disabled. Showing fallback image. - - - - -
-
-

The canonical form (for cubic curves)

-

While quadratic curves are relatively simple curves to analyze, the same cannot be said of the cubic curve. As a curvature is controlled by more than one control point, it exhibits all kinds of features like loops, cusps, odd colinear features, and as many as two inflection points because the curvature can change direction up to three times. Now, knowing what kind of curve we're dealing with means that some algorithms can be run more efficiently than if we have to implement them as generic solvers, so is there a way to determine the curve type without lots of work?

-

As it so happens, the answer is yes, and the solution we're going to look at was presented by Maureen C. Stone from Xerox PARC and Tony D. deRose from the University of Washington in their joint paper "A Geometric Characterization of Parametric Cubic curves". It was published in 1989, and defines curves as having a "canonical" form (i.e. a form that all curves can be reduced to) from which we can immediately tell what features a curve will have. So how does it work?

-

The first observation that makes things work is that if we have a cubic curve with four points, we can apply a linear transformation to these points such that three of the points end up on (0,0), (0,1) and (1,1), with the last point then being "somewhere". After applying that transformation, the location of that last point can then tell us what kind of curve we're dealing with. Specifically, we see the following breakdown:

- - - Scripts are disabled. Showing fallback image. - - - -

This is a fairly funky image, so let's see what the various parts of it mean...

-

We see the three fixed points at (0,0), (0,1) and (1,1). The various regions and boundaries indicate what property the original curve will have, if the fourth point is in/on that region or boundary. Specifically, if the fourth point is...

-
    -
  1. ...anywhere inside the red zone, but not on its boundaries, the curve will be self-intersecting (yielding a loop). We won't know where it self-intersects (in terms of t values), but we are guaranteed that it does.

    -
  2. -
  3. ...on the left (red) edge of the red zone, the curve will have a cusp. We again don't know where, but we know there is one. This edge is described by the function:

    - - -
  4. -
  5. ...on the almost circular, lower right (pink) edge, the curve's end point touches the curve, forming a loop. This edge is described by the function:

    - - -
  6. -
  7. ...on the top (blue) edge, the curve's start point touches the curve, forming a loop. This edge is described by the function:

    - - -
  8. -
  9. ...inside the lower (green) zone, past y=1, the curve will have a single inflection (switching concave/convex once).

    -
  10. -
  11. ...between the left and lower boundaries (below the cusp line but above the single-inflection line), the curve will have two inflections (switching from concave to convex and then back again, or from convex to concave and then back again).

    -
  12. -
  13. ...anywhere on the right of self-intersection zone, the curve will have no inflections. It'll just be a simple arch.

    -
  14. -
-

Of course, this map is fairly small, but the regions extend to infinity, with well defined boundaries.

-
+ + +
  • +

    ...inside the lower (green) zone, past y=1, the curve will have a single inflection (switching concave/convex once).

    +
  • +
  • +

    + ...between the left and lower boundaries (below the cusp line but above the single-inflection line), the curve will have two + inflections (switching from concave to convex and then back again, or from convex to concave and then back again). +

    +
  • +
  • ...anywhere on the right of self-intersection zone, the curve will have no inflections. It'll just be a simple arch.

  • + +

    Of course, this map is fairly small, but the regions extend to infinity, with well defined boundaries.

    +
    +

    Wait, where do those lines come from?

    +

    + Without repeating the paper mentioned at the top of this section, the loop-boundaries come from rewriting the curve into canonical form, + and then solving the formulae for which constraints must hold for which possible curve properties. In the paper these functions yield + formulae for where you will find cusp points, or loops where we know t=0 or t=1, but those functions are derived for the full cubic + expression, meaning they apply to t=-∞ to t=∞... For Bézier curves we only care about the "clipped interval" t=0 to t=1, so some of the + properties that apply when you look at the curve over an infinite interval simply don't apply to the Bézier curve interval. +

    +

    + The right bound for the loop region, indicating where the curve switches from "having inflections" to "having a loop", for the general + cubic curve, is actually mirrored over x=1, but for Bézier curves this right half doesn't apply, so we don't need to pay attention to + it. Similarly, the boundaries for t=0 and t=1 loops are also nice clean curves but get "cut off" when we only look at what the general + curve does over the interval t=0 to t=1. +

    +

    + For the full details, head over to the paper and read through sections 3 and 4. If you still remember your high school pre-calculus, you + can probably follow along with this paper, although you might have to read it a few times before all the bits "click". +

    +
    -

    Wait, where do those lines come from?

    -

    Without repeating the paper mentioned at the top of this section, the loop-boundaries come from rewriting the curve into canonical form, and then solving the formulae for which constraints must hold for which possible curve properties. In the paper these functions yield formulae for where you will find cusp points, or loops where we know t=0 or t=1, but those functions are derived for the full cubic expression, meaning they apply to t=-∞ to t=∞... For Bézier curves we only care about the "clipped interval" t=0 to t=1, so some of the properties that apply when you look at the curve over an infinite interval simply don't apply to the Bézier curve interval.

    -

    The right bound for the loop region, indicating where the curve switches from "having inflections" to "having a loop", for the general cubic curve, is actually mirrored over x=1, but for Bézier curves this right half doesn't apply, so we don't need to pay attention to it. Similarly, the boundaries for t=0 and t=1 loops are also nice clean curves but get "cut off" when we only look at what the general curve does over the interval t=0 to t=1.

    -

    For the full details, head over to the paper and read through sections 3 and 4. If you still remember your high school pre-calculus, you can probably follow along with this paper, although you might have to read it a few times before all the bits "click".

    -
    - -

    So now the question becomes: how do we manipulate our curve so that it fits this canonical form, with three fixed points, and one "free" point? Enter linear algebra. Don't worry, I'll be doing all the math for you, as well as show you what the effect is on our curves, but basically we're going to be using linear algebra, rather than calculus, because "it's way easier". Sometimes a calculus approach is very hard to work with, when the equivalent geometrical solution is super obvious.

    -

    The approach is going to start with a curve that doesn't have all-colinear points (so we need to make sure the points don't all fall on a straight line), and then applying three graphics operations that you will probably have heard of: translation (moving all points by some fixed x- and y-distance), scaling (multiplying all points by some x and y scale factor), and shearing (an operation that turns rectangles into parallelograms).

    -

    Step 1: we translate any curve by -p1.x and -p1.y, so that the curve starts at (0,0). We're going to make use of an interesting trick here, by pretending our 2D coordinates are 3D, with the z coordinate simply always being 1. This is an old trick in graphics to overcome the limitations of 2D transformations: without it, we can only turn (x,y) coordinates into new coordinates of the form (ax + by, cx + dy), which means we can't do translation, since that requires we end up with some kind of (x + a, y + b). If we add a bogus z coordinate that is always 1, then we can suddenly add arbitrary values. For example:

    - - -

    Sweet! z stays 1, so we can effectively ignore it entirely, but we added some plain values to our x and y coordinates. So, if we want to subtract p1.x and p1.y, we use:

    - - -

    Running all our coordinates through this transformation gives a new set of coordinates, let's call those U, where the first coordinate lies on (0,0), and the rest is still somewhat free. Our next job is to make sure point 2 ends up lying on the x=0 line, so what we want is a transformation matrix that, when we run it, subtracts x from whatever x we currently have. This is called shearing, and the typical x-shear matrix and its transformation looks like this:

    - - -

    So we want some shearing value that, when multiplied by y, yields -x, so our x coordinate becomes zero. That value is simply -x/y, because *-x/y * y = -x*. Done:

    - - -

    Now, running this on all our points generates a new set of coordinates, let's call those V, which now have point 1 on (0,0) and point 2 on (0, some-value), and we wanted it at (0,1), so we need to do some scaling to make sure it ends up at (0,1). Additionally, we want point 3 to end up on (1,1), so we can also scale x to make sure its x-coordinate will be 1 after we run the transform. That means we'll be x-scaling by 1/point3x, and y-scaling by point2y. This is really easy:

    - - -

    Then, finally, this generates a new set of coordinates, let's call those W, of which point 1 lies on (0,0), point 2 lies on (0,1), and point three lies on (1, ...) so all that's left is to make sure point 3 ends up at (1,1) - but we can't scale! Point 2 is already in the right place, and y-scaling would move it out of (0,1) again, so our only option is to y-shear point three, just like how we x-sheared point 2 earlier. In this case, we do the same trick, but with y/x rather than x/y because we're not x-shearing but y-shearing. Additionally, we don't actually want to end up at zero (which is what we did before) so we need to shear towards an offset, in this case 1:

    - - -

    And this generates our final set of four coordinates. Of these, we already know that points 1 through 3 are (0,0), (0,1) and (1,1), and only the last coordinate is "free". In fact, given any four starting coordinates, the resulting "transformation mapped" coordinate will be:

    - - -

    Okay, well, that looks plain ridiculous, but: notice that every coordinate value is being offset by the initial translation, and also notice that a lot of terms in that expression are repeated. Even though the maths looks crazy as a single expression, we can just pull this apart a little and end up with an easy-to-calculate bit of code!

    -

    First, let's just do that translation step as a "preprocessing" operation so we don't have to subtract the values all the time. What does that leave?

    - - -

    Suddenly things look a lot simpler: the mapped x is fairly straight forward to compute, and we see that the mapped y actually contains the mapped x in its entirety, so we'll have that part already available when we need to evaluate it. In fact, let's pull out all those common factors to see just how simple this is:

    - - -

    That's kind of super-simple to write out in code, I think you'll agree. Coding math tends to be easier than the formulae initially make it look!

    -
    + +

    + That's kind of super-simple to write out in code, I think you'll agree. Coding math tends to be easier than the formulae initially make it + look! +

    +
    +

    How do you track all that?

    +

    + Doing maths can be a pain, so whenever possible, I like to make computers do the work for me. Especially for things like this, I simply + use Mathematica. Tracking all this math by hand is insane, and we invented computers, + literally, to do this for us. I have no reason to use pen and paper when I can write out what I want to do in a program, and have the + program do the math for me. And real math, too, with symbols, not with numbers. In fact, + here's the Mathematica notebook if you want to see how + this works for yourself. +

    +

    + Now, I know, you're thinking "but Mathematica is super expensive!" and that's true, it's + $344 for home use, up from $295 when I original wrote this, but it's + also free when you buy a $35 raspberry pi. Obviously, I bought a + raspberry pi, and I encourage you to do the same. With that, as long as you know what you want to do, Mathematica can just do + it for you. And we don't have to be geniuses to work out what the maths looks like. That's what we have computers for. +

    +
    -

    How do you track all that?

    -

    Doing maths can be a pain, so whenever possible, I like to make computers do the work for me. Especially for things like this, I simply use Mathematica. Tracking all this math by hand is insane, and we invented computers, literally, to do this for us. I have no reason to use pen and paper when I can write out what I want to do in a program, and have the program do the math for me. And real math, too, with symbols, not with numbers. In fact, here's the Mathematica notebook if you want to see how this works for yourself.

    -

    Now, I know, you're thinking "but Mathematica is super expensive!" and that's true, it's $344 for home use, up from $295 when I original wrote this, but it's also free when you buy a $35 raspberry pi. Obviously, I bought a raspberry pi, and I encourage you to do the same. With that, as long as you know what you want to do, Mathematica can just do it for you. And we don't have to be geniuses to work out what the maths looks like. That's what we have computers for.

    -
    - -

    So, let's write up a sketch that'll show us the canonical form for any curve drawn in blue, overlaid on our canonical map, so that we can immediately tell which features our curve must have, based on where the fourth coordinate is located on the map:

    - - - Scripts are disabled. Showing fallback image. - - - - -
    -
    -

    Finding Y, given X

    -

    One common task that pops up in things like CSS work, or parametric equalizers, or image leveling, or any other number of applications where Bézier curves are used as control curves in a way that there is really only ever one "y" value associated with one "x" value, you might want to cut out the middle man, as it were, and compute "y" directly based on "x". After all, the function looks simple enough, finding the "y" value should be simple too, right? Unfortunately, not really. However, it is possible and as long as you have some code in place to help, it's not a lot of a work either.

    -

    We'll be tackling this problem in two stages: the first, which is the hard part, is figuring out which "t" value belongs to any given "x" value. For instance, have a look at the following graphic. On the left we have a Bézier curve that looks for all intents and purposes like it fits our criteria: every "x" has one and only one associated "y" value. On the right we see the function for just the "x" values: that's a cubic curve, but not a really crazy cubic curve. If you move the graphic's slider, you will see a red line drawn that corresponds to the x coordinate: this is a vertical line in the left graphic, and a horizontal line on the right.

    - - - Scripts are disabled. Showing fallback image. - - - - - -

    Now, if you look more closely at that right graphic, you'll notice something interesting: if we treat the red line as "the x axis", then the point where the function crosses our line is really just a root for the cubic function x(t) through a shifted "x-axis"... and we've already seen how to calculate roots, so let's just run cubic root finding - and not even the complicated cubic case either: because of the kind of curve we're starting with, we know there is only root, simplifying the code we need!

    -

    First, let's look at the function for x(t):

    - - -

    We can rewrite this to a plain polynomial form, by just fully writing out the expansion and then collecting the polynomial factors, as:

    - - -

    Nothing special here: that's a standard cubic polynomial in "power" form (i.e. all the terms are ordered by their power of t). So, given that a, b, c, d, and x(t) are all known constants, we can trivially rewrite this (by moving the x(t) across the equal sign) as:

    - - -

    You might be wondering "where did all the other 'minus x' for all the other values a, b, c, and d go?" and the answer there is that they all cancel out, so the only one we actually need to subtract is the one at the end. Handy! So now we just solve this equation using Cardano's algorithm, and we're left with some rather short code:

    + +

    + You might be wondering "where did all the other 'minus x' for all the other values a, b, c, and d go?" and the answer there is that they + all cancel out, so the only one we actually need to subtract is the one at the end. Handy! So now we just solve this equation using + Cardano's algorithm, and we're left with some rather short code: +

    - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    - -

    So the procedure is fairly straight forward: pick an x, find the associated t value, evaluate our curve for that t value, which gives us the curve's {x,y} coordinate, which means we know y for this x. Move the slider for the following graphic to see this in action:

    - - - Scripts are disabled. Showing fallback image. - - - - - +y = curve.get(t).y + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + -
    -
    -

    Arc length

    -

    How long is a Bézier curve? As it turns out, that's not actually an easy question, because the answer requires maths that —much like root finding— cannot generally be solved the traditional way. If we have a parametric curve with fx(t) and fy(t), then the length of the curve, measured from start point to some point t = z, is computed using the following seemingly straight forward (if a bit overwhelming) formula:

    - - -

    or, more commonly written using Leibnitz notation as:

    - - -

    This formula says that the length of a parametric curve is in fact equal to the area underneath a function that looks a remarkable amount like Pythagoras' rule for computing the diagonal of a straight angled triangle. This sounds pretty simple, right? Sadly, it's far from simple... cutting straight to after the chase is over: for quadratic curves, this formula generates an unwieldy computation, and we're simply not going to implement things that way. For cubic Bézier curves, things get even more fun, because there is no "closed form" solution, meaning that due to the way calculus works, there is no generic formula that allows you to calculate the arc length. Let me just repeat this, because it's fairly crucial: for cubic and higher Bézier curves, there is no way to solve this function if you want to use it "for all possible coordinates".

    -

    Seriously: It cannot be done.

    -

    So we turn to numerical approaches again. The method we'll look at here is the Gauss quadrature. This approximation is a really neat trick, because for any nth degree polynomial it finds approximated values for an integral really efficiently. Explaining this procedure in length is way beyond the scope of this page, so if you're interested in finding out why it works, I can recommend the University of South Florida video lecture on the procedure, linked in this very paragraph. The general solution we're looking for is the following:

    - - -

    In plain text: an integral function can always be treated as the sum of an (infinite) number of (infinitely thin) rectangular strips sitting "under" the function's plotted graph. To illustrate this idea, the following graph shows the integral for a sinusoid function. The more strips we use (and of course the more we use, the thinner they get) the closer we get to the true area under the curve, and thus the better the approximation:

    -
    - - - Scripts are disabled. Showing fallback image. - - - - - - Scripts are disabled. Showing fallback image. - - - - - - Scripts are disabled. Showing fallback image. - - - -
    + +

    + In plain text: an integral function can always be treated as the sum of an (infinite) number of (infinitely thin) rectangular strips + sitting "under" the function's plotted graph. To illustrate this idea, the following graph shows the integral for a sinusoid function. The + more strips we use (and of course the more we use, the thinner they get) the closer we get to the true area under the curve, and thus the + better the approximation: +

    +
    + + + Scripts are disabled. Showing fallback image. + + + + + + Scripts are disabled. Showing fallback image. + + + + + + Scripts are disabled. Showing fallback image. + + + +
    -

    Now, infinitely many terms to sum and infinitely thin rectangles are not something that computers can work with, so instead we're going to approximate the infinite summation by using a sum of a finite number of "just thin" rectangular strips. As long as we use a high enough number of thin enough rectangular strips, this will give us an approximation that is pretty close to what the real value is.

    -

    So, the trick is to come up with useful rectangular strips. A naive way is to simply create n strips, all with the same width, but there is a far better way using special values for C and f(t) depending on the value of n, which indicates how many strips we'll use, and it's called the Legendre-Gauss quadrature.

    -

    This approach uses strips that are not spaced evenly, but instead spaces them in a special way based on describing the function as a polynomial (the more strips, the more accurate the polynomial), and then computing the exact integral for that polynomial. We're essentially performing arc length computation on a flattened curve, but flattening it based on the intervals dictated by the Legendre-Gauss solution.

    -
    - -

    Note that one requirement for the approach we'll use is that the integral must run from -1 to 1. That's no good, because we're dealing with Bézier curves, and the length of a section of curve applies to values which run from 0 to "some value smaller than or equal to 1" (let's call that value z). Thankfully, we can quite easily transform any integral interval to any other integral interval, by shifting and scaling the inputs. Doing so, we get the following:

    - - -

    That may look a bit more complicated, but the fraction involving z is a fixed number, so the summation, and the evaluation of the f(t) values are still pretty simple.

    -

    So, what do we need to perform this calculation? For one, we'll need an explicit formula for f(t), because that derivative notation is handy on paper, but not when we have to implement it. We'll also need to know what these Ci and ti values should be. Luckily, that's less work because there are actually many tables available that give these values, for any n, so if we want to approximate our integral with only two terms (which is a bit low, really) then these tables would tell us that for n=2 we must use the following values:

    - - -

    Which means that in order for us to approximate the integral, we must plug these values into the approximate function, which gives us:

    - - -

    We can program that pretty easily, provided we have that f(t) available, which we do, as we know the full description for the Bézier curve functions Bx(t) and By(t).

    -
    + +

    + We can program that pretty easily, provided we have that f(t) available, which we do, as we know the full description for the + Bézier curve functions Bx(t) and By(t). +

    + -

    If we use the Legendre-Gauss values for our C values (thickness for each strip) and t values (location of each strip), we can determine the approximate length of a Bézier curve by computing the Legendre-Gauss sum. The following graphic shows a cubic curve, with its computed lengths; Go ahead and change the curve, to see how its length changes. One thing worth trying is to see if you can make a straight line, and see if the length matches what you'd expect. What if you form a line with the control points on the outside, and the start/end points on the inside?

    - - - Scripts are disabled. Showing fallback image. - - - +

    + If we use the Legendre-Gauss values for our C values (thickness for each strip) and t values (location of each strip), + we can determine the approximate length of a Bézier curve by computing the Legendre-Gauss sum. The following graphic shows a cubic curve, + with its computed lengths; Go ahead and change the curve, to see how its length changes. One thing worth trying is to see if you can make + a straight line, and see if the length matches what you'd expect. What if you form a line with the control points on the outside, and the + start/end points on the inside? +

    + + + Scripts are disabled. Showing fallback image. + + + +
    +
    +

    + + Approximated arc length +

    +

    + Sometimes, we don't actually need the precision of a true arc length, and we can get away with simply computing the approximate arc length + instead. The by far fastest way to do this is to flatten the curve and then simply calculate the linear distance from point to point. This + will come with an error, but this can be made arbitrarily small by increasing the segment count. +

    +

    + If we combine the work done in the previous sections on curve flattening and arc length computation, we can implement these with minimal + effort: +

    +
    + + + Scripts are disabled. Showing fallback image. + + + + + + + + Scripts are disabled. Showing fallback image. + + + + + +
    -
    -
    -

    Approximated arc length

    -

    Sometimes, we don't actually need the precision of a true arc length, and we can get away with simply computing the approximate arc length instead. The by far fastest way to do this is to flatten the curve and then simply calculate the linear distance from point to point. This will come with an error, but this can be made arbitrarily small by increasing the segment count.

    -

    If we combine the work done in the previous sections on curve flattening and arc length computation, we can implement these with minimal effort:

    -
    - - - - Scripts are disabled. Showing fallback image. - - - - - - - - Scripts are disabled. Showing fallback image. - - - - - -
    - -

    You may notice that even though the error in length is actually pretty significant in absolute terms, even at a low number of segments we get a length that agrees with the true length when it comes to just the integer part of the arc length. Quite often, approximations can drastically speed things up!

    - -
    -
    -

    Curvature of a curve

    -

    If we have two curves, and we want to line them in up in a way that "looks right", what would we use as metric to let a computer decide what "looks right" means?

    -

    For instance, we can start by ensuring that the two curves share an end coordinate, so that there is no "gap" between the end of one and the start of the next curve, but that won't guarantee that things look right: both curves can be going in wildly different directions, and the resulting joined geometry will have a corner in it, rather than a smooth transition from one curve to the next.

    -

    What we want is to ensure that the curvature at the transition from one curve to the next "looks good". So, we start with a shared coordinate, and then also require that derivatives for both curves match at that coordinate. That way, we're assured that their tangents line up, which must mean the curve transition is perfectly smooth. We can even make the second, third, etc. derivatives match up for better and better transitions.

    -

    Problem solved!

    -

    However, there's a problem with this approach: if we think about this a little more, we realise that "what a curve looks like" and its derivative values are pretty much entirely unrelated. After all, the section on reordering curves showed us that the same looking curve can have an infinite number of curve expressions of arbitrarily high Bézier degree, and each of those will have wildly different derivative values.

    -

    So what we really want is some kind of expression that's not based on any particular expression of t, but is based on something that is invariant to the kind of function(s) we use to draw our curve. And the prime candidate for this is our curve expression, reparameterised for distance: no matter what order of Bézier curve we use, if we were able to rewrite it as a function of distance-along-the-curve, all those different degree Bézier functions would end up being the same function for "coordinate at some distance D along the curve".

    -

    We've seen this before... that's the arc length function.

    -

    So you might think that in order to find the curvature of a curve, we now need to solve the arc length function itself, and that this would be quite a problem because we just saw that there is no way to actually do that. Thankfully, we don't. We only need to know the form of the arc length function, which we saw above and is fairly simple, rather than needing to solve the arc length function. If we start with the arc length expression and the run through the steps necessary to determine its derivative (with an alternative, shorter demonstration of how to do this found over on Stackexchange), then the integral that was giving us so much problems in solving the arc length function disappears entirely (because of the fundamental theorem of calculus), and what we're left with us some surprisingly simple maths that relates curvature (denoted as κ, "kappa") to—and this is the truly surprising bit—a specific combination of derivatives of our original function.

    -

    Let me highlight what just happened, because it's pretty special:

    -
      -
    1. we wanted to make curves line up, and initially thought to match the curves' derivatives, but
    2. -
    3. that turned out to be a really bad choice, so instead
    4. -
    5. we picked a function that is basically impossible to work with, and then worked with that, which
    6. -
    7. gives us a simple formula that is and expression using the curves' derivatives.
    8. -
    -

    That's crazy!

    -

    But that's also one of the things that makes maths so powerful: even if your initial ideas are off the mark, you might be much closer than you thought you were, and the journey from "thinking we're completely wrong" to "actually being remarkably close to being right" is where we can find a lot of insight.

    -

    So, what does the function look like? This:

    - - -

    Which is really just a "short form" that glosses over the fact that we're dealing with functions of t, so let's expand that a tiny bit:

    - - -

    And while that's a little more verbose, it's still just as simple to work with as the first function: the curvature at some point on any (and this cannot be overstated: any) curve is a ratio between the first and second derivative cross product, and something that looks oddly similar to the standard Euclidean distance function. And nothing in these functions is hard to calculate either: for Bézier curves, simply knowing our curve coordinates means we know what the first and second derivatives are, and so evaluating this function for any t value is just a matter of basic arithematics.

    -

    In fact, let's just implement it right now:

    + +

    + And while that's a little more verbose, it's still just as simple to work with as the first function: the curvature at some point on any + (and this cannot be overstated: any) curve is a ratio between the first and second derivative cross product, and something that + looks oddly similar to the standard Euclidean distance function. And nothing in these functions is hard to calculate either: for Bézier + curves, simply knowing our curve coordinates means we know what the first and second derivatives are, and so + evaluating this function for any t value is just a matter of basic arithematics. +

    +

    In fact, let's just implement it right now:

    - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    - -

    That was easy! (Well okay, that "not a number" value will need to be taken into account by downstream code, but that's a reality of programming anyway)

    -

    With all of that covered, let's line up some curves! The following graphic gives you two curves that look identical, but use quadratic and cubic functions, respectively. As you can see, despite their derivatives being necessarily different, their curvature (thanks to being derived based on maths that "ignores" specific function derivative, and instead gives a formula that smooths out any differences) is exactly the same. And because of that, we can put them together such that the point where they overlap has the same curvature for both curves, giving us the smoothest transition.

    - - - Scripts are disabled. Showing fallback image. - - - -

    One thing you may have noticed in this sketch is that sometimes the curvature looks fine, but seems to be pointing in the wrong direction, making it hard to line up the curves properly. A way around that, of course, is to show the curvature on both sides of the curve, so let's just do that. But let's take it one step further: we can also compute the associated "radius of curvature", which gives us the implicit circle that "fits" the curve's curvature at any point, using what is possibly the simplest bit of maths found in this entire primer:

    - - -

    So let's revisit the previous graphic with the curvature visualised on both sides of our curves, as well as showing the circle that "fits" our curve at some point that we can control by using a slider:

    - - - Scripts are disabled. Showing fallback image. - - - - - + +

    + So let's revisit the previous graphic with the curvature visualised on both sides of our curves, as well as showing the circle that "fits" + our curve at some point that we can control by using a slider: +

    + + + Scripts are disabled. Showing fallback image. + + + + + +
    +
    +

    + + Tracing a curve at fixed distance intervals +

    +

    + Say you want to draw a curve with a dashed line, rather than a solid line, or you want to move something along the curve at fixed distance + intervals over time, like a train along a track, and you want to use Bézier curves. +

    +

    Now you have a problem.

    +

    + The reason you have a problem is that Bézier curves are parametric functions with non-linear behaviour, whereas moving a train along a + track is about as close to a practical example of linear behaviour as you can get. The problem we're faced with is that we can't just pick + t values at some fixed interval and expect the Bézier functions to generate points that are spaced a fixed distance apart. In + fact, let's look at the relation between "distance along a curve" and "t value", by plotting them against one another. +

    +

    + The following graphic shows a particularly illustrative curve, and its distance-for-t plot. For linear traversal, this line needs to be + straight, running from (0,0) to (length,1). That is, it's safe to say, not what we'll see: we'll see something very wobbly, instead. To + make matters even worse, the distance-for-t function is also of a much higher order than our curve is: while the curve we're using for + this exercise is a cubic curve, which can switch concave/convex form twice at best, the distance function is our old friend the arc length + function, which can have more inflection points. +

    + + + Scripts are disabled. Showing fallback image. + + +

    + So, how do we "cut up" the arc length function at regular intervals, when we can't really work with it? We basically cheat: we run through + the curve using t values, determine the distance-for-this-t-value at each point we generate during the run, and + then we find "the closest t value that matches some required distance" using those values instead. If we have a low number of + points sampled, we can then even refine which t value "should" work for our desired distance by interpolating between two + points, but if we have a high enough number of samples, we don't even need to bother. +

    +

    + So let's do exactly that: the following graph is similar to the previous one, showing how we would have to "chop up" our distance-for-t + curve in order to get regularly spaced points on the curve. It also shows what using those t values on the real curve looks + like, by coloring each section of curve between two distance markers differently: +

    + + + Scripts are disabled. Showing fallback image. + + + + + +

    Use the slider to increase or decrease the number of equidistant segments used to colour the curve.

    +

    + However, are there better ways? One such way is discussed in "Moving Along a Curve with Specified Speed" by David Eberly of Geometric Tools, LLC, but basically because we have no explicit length function (or rather, one we don't have to + constantly compute for different intervals), you may simply be better off with a traditional lookup table (LUT). +

    +
    +
    +

    + + Intersections +

    +

    + Let's look at some more things we will want to do with Bézier curves. Almost immediately after figuring out how to get bounding boxes to + work, people tend to run into the problem that even though the minimal bounding box (based on rotation) is tight, it's not sufficient to + perform true collision detection. It's a good first step to make sure there might be a collision (if there is no bounding box + overlap, there can't be one), but in order to do real collision detection we need to know whether or not there's an intersection on the + actual curve. +

    +

    + We'll do this in steps, because it's a bit of a journey to get to curve/curve intersection checking. First, let's start simple, by + implementing a line-line intersection checker. While we can solve this the traditional calculus way (determine the functions for both + lines, then compute the intersection by equating them and solving for two unknowns), linear algebra actually offers a nicer solution. +

    +

    Line-line intersections

    +

    + If we have two line segments with two coordinates each, segments A-B and C-D, we can find the intersection of the lines these segments are + an intervals on by linear algebra, using the procedure outlined in this + top coder + article. Of course, we need to make sure that the intersection isn't just on the lines our line segments lie on, but actually on our line + segments themselves. So after we find the intersection, we need to verify that it lies without the bounds of our original line segments. +

    +

    + The following graphic implements this intersection detection, showing a red point for an intersection on the lines our segments lie on + (thus being a virtual intersection point), and a green point for an intersection that lies on both segments (being a real intersection + point). +

    + + + Scripts are disabled. Showing fallback image. + + + +
    +

    Implementing line-line intersections

    +

    + Let's have a look at how to implement a line-line intersection checking function. The basics are covered in the article mentioned above, + but sometimes you need more function signatures, because you might not want to call your function with eight distinct parameters. Maybe + you're using point structs for the line. Let's get coding: +

    -
    -
    -

    Tracing a curve at fixed distance intervals

    -

    Say you want to draw a curve with a dashed line, rather than a solid line, or you want to move something along the curve at fixed distance intervals over time, like a train along a track, and you want to use Bézier curves.

    -

    Now you have a problem.

    -

    The reason you have a problem is that Bézier curves are parametric functions with non-linear behaviour, whereas moving a train along a track is about as close to a practical example of linear behaviour as you can get. The problem we're faced with is that we can't just pick t values at some fixed interval and expect the Bézier functions to generate points that are spaced a fixed distance apart. In fact, let's look at the relation between "distance along a curve" and "t value", by plotting them against one another.

    -

    The following graphic shows a particularly illustrative curve, and its distance-for-t plot. For linear traversal, this line needs to be straight, running from (0,0) to (length,1). That is, it's safe to say, not what we'll see: we'll see something very wobbly, instead. To make matters even worse, the distance-for-t function is also of a much higher order than our curve is: while the curve we're using for this exercise is a cubic curve, which can switch concave/convex form twice at best, the distance function is our old friend the arc length function, which can have more inflection points.

    - - - Scripts are disabled. Showing fallback image. - - - -

    So, how do we "cut up" the arc length function at regular intervals, when we can't really work with it? We basically cheat: we run through the curve using t values, determine the distance-for-this-t-value at each point we generate during the run, and then we find "the closest t value that matches some required distance" using those values instead. If we have a low number of points sampled, we can then even refine which t value "should" work for our desired distance by interpolating between two points, but if we have a high enough number of samples, we don't even need to bother.

    -

    So let's do exactly that: the following graph is similar to the previous one, showing how we would have to "chop up" our distance-for-t curve in order to get regularly spaced points on the curve. It also shows what using those t values on the real curve looks like, by coloring each section of curve between two distance markers differently:

    - - - Scripts are disabled. Showing fallback image. - - - - - -

    Use the slider to increase or decrease the number of equidistant segments used to colour the curve.

    -

    However, are there better ways? One such way is discussed in "Moving Along a Curve with Specified Speed" by David Eberly of Geometric Tools, LLC, but basically because we have no explicit length function (or rather, one we don't have to constantly compute for different intervals), you may simply be better off with a traditional lookup table (LUT).

    - -
    -
    -

    Intersections

    -

    Let's look at some more things we will want to do with Bézier curves. Almost immediately after figuring out how to get bounding boxes to work, people tend to run into the problem that even though the minimal bounding box (based on rotation) is tight, it's not sufficient to perform true collision detection. It's a good first step to make sure there might be a collision (if there is no bounding box overlap, there can't be one), but in order to do real collision detection we need to know whether or not there's an intersection on the actual curve.

    -

    We'll do this in steps, because it's a bit of a journey to get to curve/curve intersection checking. First, let's start simple, by implementing a line-line intersection checker. While we can solve this the traditional calculus way (determine the functions for both lines, then compute the intersection by equating them and solving for two unknowns), linear algebra actually offers a nicer solution.

    -

    Line-line intersections

    -

    If we have two line segments with two coordinates each, segments A-B and C-D, we can find the intersection of the lines these segments are an intervals on by linear algebra, using the procedure outlined in this top coder article. Of course, we need to make sure that the intersection isn't just on the lines our line segments lie on, but actually on our line segments themselves. So after we find the intersection, we need to verify that it lies without the bounds of our original line segments.

    -

    The following graphic implements this intersection detection, showing a red point for an intersection on the lines our segments lie on (thus being a virtual intersection point), and a green point for an intersection that lies on both segments (being a real intersection point).

    - - - Scripts are disabled. Showing fallback image. - - - -
    - -

    Implementing line-line intersections

    -

    Let's have a look at how to implement a line-line intersection checking function. The basics are covered in the article mentioned above, but sometimes you need more function signatures, because you might not want to call your function with eight distinct parameters. Maybe you're using point structs for the line. Let's get coding:

    - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    - -
    + return lli4(line1.p1, line1.p2, line2.p1, line2.p2) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + -

    What about curve-line intersections?

    -

    Curve/line intersection is more work, but we've already seen the techniques we need to use in order to perform it: first we translate/rotate both the line and curve together, in such a way that the line coincides with the x-axis. This will position the curve in a way that makes it cross the line at points where its y-function is zero. By doing this, the problem of finding intersections between a curve and a line has now become the problem of performing root finding on our translated/rotated curve, as we already covered in the section on finding extremities.

    -
    - - - Scripts are disabled. Showing fallback image. - - - - - - Scripts are disabled. Showing fallback image. - - - -
    +

    What about curve-line intersections?

    +

    + Curve/line intersection is more work, but we've already seen the techniques we need to use in order to perform it: first we + translate/rotate both the line and curve together, in such a way that the line coincides with the x-axis. This will position the curve in + a way that makes it cross the line at points where its y-function is zero. By doing this, the problem of finding intersections between a + curve and a line has now become the problem of performing root finding on our translated/rotated curve, as we already covered in the + section on finding extremities. +

    +
    + + + Scripts are disabled. Showing fallback image. + + + + + + Scripts are disabled. Showing fallback image. + + + +
    -

    Curve/curve intersection, however, is more complicated. Since we have no straight line to align to, we can't simply align one of the curves and be left with a simple procedure. Instead, we'll need to apply two techniques we've met before: de Casteljau's algorithm, and curve splitting.

    +

    + Curve/curve intersection, however, is more complicated. Since we have no straight line to align to, we can't simply align one of the + curves and be left with a simple procedure. Instead, we'll need to apply two techniques we've met before: de Casteljau's algorithm, and + curve splitting. +

    +
    +
    +

    + + Curve/curve intersection +

    +

    + Using de Casteljau's algorithm to split the curve we can now implement curve/curve intersection finding using a "divide and conquer" + technique: +

    +
      +
    1. + Take two curves C1 and C2, and treat them as a pair. +
    2. +
    3. If their bounding boxes overlap, split up each curve into two sub-curves
    4. +
    5. + With C1.1, C1.2, C2.1 and C2.2, form four new pairs (C1.1,C2.1), (C1.1, C2.2), (C1.2,C2.1), and (C1.2,C2.2). +
    6. +
    7. + For each pair, check whether their bounding boxes overlap. +
        +
      1. If their bounding boxes do not overlap, discard the pair, as there is no intersection between this pair of curves.
      2. +
      3. If there is overlap, rerun all steps for this pair.
      4. +
      +
    8. +
    9. + Once the sub-curves we form are so small that they effectively occupy sub-pixel areas, we consider an intersection found, noting that we + might have a cluster of multiple intersections at the sub-pixel level, out of which we pick one to act as "found" t value + (we can either throw all but one away, we can average the cluster's t values, or you can do something even more creative). +
    10. +
    +

    + This algorithm will start with a single pair, "balloon" until it runs in parallel for a large number of potential sub-pairs, and then + taper back down as it homes in on intersection coordinates, ending up with as many pairs as there are intersections. +

    +

    + The following graphic applies this algorithm to a pair of cubic curves, one step at a time, so you can see the algorithm in action. Click + the button to run a single step in the algorithm, after setting up your curves in some creative arrangement. You can also change the value + that is used in step 5 to determine whether the curves are small enough. Manipulating the curves or changing the threshold will reset the + algorithm, so you can try this with lots of different curves. +

    +

    (can you find the configuration that yields the maximum number of intersections between two cubic curves? Nine intersections!)

    + + + Scripts are disabled. Showing fallback image. + + + + + + +

    + Finding self-intersections is effectively the same procedure, except that we're starting with a single curve, so we need to turn that into + two separate curves first. This is trivially achieved by splitting at an inflection point, or if there are none, just splitting at + t=0.5 first, and then running the exact same algorithm as above, with all non-overlapping curve pairs getting removed at each + iteration, and each successive step homing in on the curve's self-intersection points. +

    +
    +
    +

    + + The projection identity +

    +

    + De Casteljau's algorithm is the pivotal algorithm when it comes to Bézier curves. You can use it not just to split curves, but also to + draw them efficiently (especially for high-order Bézier curves), as well as to come up with curves based on three points and a tangent. + Particularly this last thing is really useful because it lets us "mold" a curve, by picking it up at some point, and dragging that point + around to change the curve's shape. +

    +

    How does that work? Succinctly: we run de Casteljau's algorithm in reverse!

    +

    + In order to run de Casteljau's algorithm in reverse, we need a few basic things: a start and end point, a point on the curve that we want + to be moving around, which has an associated t value, and a point we've not explicitly talked about before, and as far as I know + has no explicit name, but lives one iteration higher in the de Casteljau process then our on-curve point does. I like to call it "A" for + reasons that will become obvious. +

    +

    + So let's use graphics instead of text to see where this "A" is, because text only gets us so far: move the sliders for the following + graphics to see what, given a specific t value, our A coordinate is. As well as some other coordinates, which + taken together let us derive a value that the graphics call "ratio": if you move the curve's points around, A, B, and C will move, what + happens to that value? +

    +
    + + + Scripts are disabled. Showing fallback image. + + + + + + + + Scripts are disabled. Showing fallback image. + + + + + +
    -
    -
    -

    Curve/curve intersection

    -

    Using de Casteljau's algorithm to split the curve we can now implement curve/curve intersection finding using a "divide and conquer" technique:

    -
      -
    1. Take two curves C1 and C2, and treat them as a pair.
    2. -
    3. If their bounding boxes overlap, split up each curve into two sub-curves
    4. -
    5. With C1.1, C1.2, C2.1 and C2.2, form four new pairs (C1.1,C2.1), (C1.1, C2.2), (C1.2,C2.1), and (C1.2,C2.2).
    6. -
    7. For each pair, check whether their bounding boxes overlap.
        -
      1. If their bounding boxes do not overlap, discard the pair, as there is no intersection between this pair of curves.
      2. -
      3. If there is overlap, rerun all steps for this pair.
      4. -
      -
    8. -
    9. Once the sub-curves we form are so small that they effectively occupy sub-pixel areas, we consider an intersection found, noting that we might have a cluster of multiple intersections at the sub-pixel level, out of which we pick one to act as "found" t value (we can either throw all but one away, we can average the cluster's t values, or you can do something even more creative).
    10. -
    -

    This algorithm will start with a single pair, "balloon" until it runs in parallel for a large number of potential sub-pairs, and then taper back down as it homes in on intersection coordinates, ending up with as many pairs as there are intersections.

    -

    The following graphic applies this algorithm to a pair of cubic curves, one step at a time, so you can see the algorithm in action. Click the button to run a single step in the algorithm, after setting up your curves in some creative arrangement. You can also change the value that is used in step 5 to determine whether the curves are small enough. Manipulating the curves or changing the threshold will reset the algorithm, so you can try this with lots of different curves.

    -

    (can you find the configuration that yields the maximum number of intersections between two cubic curves? Nine intersections!)

    - - - Scripts are disabled. Showing fallback image. - - - - - - -

    Finding self-intersections is effectively the same procedure, except that we're starting with a single curve, so we need to turn that into two separate curves first. This is trivially achieved by splitting at an inflection point, or if there are none, just splitting at t=0.5 first, and then running the exact same algorithm as above, with all non-overlapping curve pairs getting removed at each iteration, and each successive step homing in on the curve's self-intersection points.

    - -
    -
    -

    The projection identity

    -

    De Casteljau's algorithm is the pivotal algorithm when it comes to Bézier curves. You can use it not just to split curves, but also to draw them efficiently (especially for high-order Bézier curves), as well as to come up with curves based on three points and a tangent. Particularly this last thing is really useful because it lets us "mold" a curve, by picking it up at some point, and dragging that point around to change the curve's shape.

    -

    How does that work? Succinctly: we run de Casteljau's algorithm in reverse!

    -

    In order to run de Casteljau's algorithm in reverse, we need a few basic things: a start and end point, a point on the curve that we want to be moving around, which has an associated t value, and a point we've not explicitly talked about before, and as far as I know has no explicit name, but lives one iteration higher in the de Casteljau process then our on-curve point does. I like to call it "A" for reasons that will become obvious.

    -

    So let's use graphics instead of text to see where this "A" is, because text only gets us so far: move the sliders for the following graphics to see what, given a specific t value, our A coordinate is. As well as some other coordinates, which taken together let us derive a value that the graphics call "ratio": if you move the curve's points around, A, B, and C will move, what happens to that value?

    -
    - - - - Scripts are disabled. Showing fallback image. - - - - - - - - Scripts are disabled. Showing fallback image. - - - - - -
    - -

    So these graphics show us several things:

    -
      -
    1. a point at the tip of the curve construction's "hat": let's call that A, as well as
    2. -
    3. our on-curve point give our chosen t value: let's call that B, and finally,
    4. -
    5. a point that we get by projecting A, through B, onto the line between the curve's start and end points: let's call that C.
    6. -
    7. for both quadratic and cubic curves, two points e1 and e2, which represent the single-to-last step in de Casteljau's algorithm: in the last step, we find B at (1-t) * e1 + t * e2.
    8. -
    9. for cubic curves, also the points v1 and v2, which together with A represent the first step in de Casteljau's algorithm: in the next step, we find e1 and e2.
    10. -
    -

    These three values A, B, and C allow us to derive an important identity formula for quadratic and cubic Bézier curves: for any point on the curve with some t value, the ratio of distances from A to B and B to C is fixed: if some t value sets up a C that is 20% away from the start and 80% away from the end, then it doesn't matter where the start, end, or control points are; for that t value, C will always lie at 20% from the start and 80% from the end point. Go ahead, pick an on-curve point in either graphic and then move all the other points around: if you only move the control points, start and end won't move, and so neither will C, and if you move either start or end point, C will move but its relative position will not change.

    -

    So, how can we compute C? We start with our observation that C always lies somewhere between the start and end points, so logically C will have a function that interpolates between those two coordinates:

    - - -

    If we can figure out what the function u(t) looks like, we'll be done. Although we do need to remember that this u(t) will have a different form depending on whether we're working with quadratic or cubic curves. Running through the maths (with thanks to Boris Zbarsky) shows us the following two formulae:

    - - -

    And

    - - -

    So, if we know the start and end coordinates and the t value, we know C without having to calculate the A or even B coordinates. In fact, we can do the same for the ratio function. As another function of t, we technically don't need to know what A or B or C are. It, too, can be expressed as a pure function of t.

    -

    We start by observing that, given A, B, and C, the following always holds:

    - - -

    Working out the maths for this, we see the following two formulae for quadratic and cubic curves:

    - - -

    And

    - - -

    Which now leaves us with some powerful tools: given three points (start, end, and "some point on the curve"), as well as a t value, we can construct curves. We can compute C using the start and end points and our u(t) function, and once we have C, we can use our on-curve point (B) and the ratio(t) function to find A:

    - - -

    With A found, finding e1 and e2 for quadratic curves is a matter of running the linear interpolation with t between start and A to yield e1, and between A and end to yield e2. For cubic curves, there is no single pair of points that can act as e1 and e2 (there are infinitely many, because the tangent at B is a free parameter for cubic curves) so as long as the distance ratio between e1 to B and B to e2 is the Bézier ratio (1-t):t, we are free to pick any pair, after which we can reverse engineer v1 and v2:

    - - -

    And then reverse engineer the curve's control points:

    - - -

    So: if we have a curve's start and end points, as well as some third point B that we want the curve to pass through, then for any t value we implicitly know all the ABC values, which (combined with an educated guess on appropriate e1 and e2 coordinates for cubic curves) gives us the necessary information to reconstruct a curve's "de Casteljau skeleton". Which means that we can now do several things: we can "fit" curves using only three points, which means we can also "mold" curves by moving an on-curve point but leaving its start and end points, and then reconstruct the curve based on where we moved the on-curve point to. These are very useful things, and we'll look at both in the next few sections.

    - -
    -
    -

    Creating a curve from three points

    -

    Given the preceding section, you might be wondering if we can use that knowledge to just "create" curves by placing some points and having the computer do the rest, to which the answer is: that's exactly what we can now do!

    -

    For quadratic curves, things are pretty easy. Technically, we'll need a t value in order to compute the ratio function used in computing the ABC coordinates, but we can just as easily approximate one by treating the distance between the start and B point, and B and end point as a ratio, using

    - - -

    With this code in place, creating a quadratic curve from three points is literally just computing the ABC values, and using A as our curve's control point:

    - - - Scripts are disabled. Showing fallback image. - - - -

    For cubic curves we need to do a little more work, but really only just a little. We're first going to assume that a decent curve through the three points should approximate a circular arc, which first requires knowing how to fit a circle to three points. You may remember (if you ever learned it!) that a line between two points on a circle is called a chord, and that one property of chords is that the line from the center of any chord, perpendicular to that chord, passes through the center of the circle.

    -

    That means that if we have three points on a circle, we have three (different) chords, and consequently, three (different) lines that go from those chords through the center of the circle: if we find two of those lines, then their intersection will be our circle's center, and the circle's radius will—by definition!—be the distance from the center to any of our three points:

    - - - Scripts are disabled. Showing fallback image. - - - -

    With that covered, we now also know the tangent line to our point B, because the tangent to any point on the circle is a line through that point, perpendicular to the line from that point to the center. That just leaves marking appropriate points e1 and e2 on that tangent, so that we can construct a new cubic curve hull. We use the approach as we did for quadratic curves to automatically determine a reasonable t value, and then our e1 and e2 coordinates must obey the standard de Casteljau rule for linear interpolation:

    - - -

    Where d is the total length of the line segment from e1 to e2. So how long do we make that? There are again all kinds of approaches we can take, and a simple-but-effective one is to set the length of that segment to "one third the length of the baseline". This forces e1 and e2 to always be the "linear curve" distance apart, which means if we place our three points on a line, it will actually look like a line. Nice! The last thing we'll need to do is make sure to flip the sign of d depending on which side of the baseline our B is located, so we don't end up creating a funky curve with a loop in it. To do this, we can use the atan2 function:

    - - -

    This angle φ will be between 0 and π if B is "above" the baseline (rotating all three points so that the start is on the left and the end is the right), so we can use a relatively straight forward check to make sure we're using the correct sign for our value d:

    - - -

    The result of this approach looks as follows:

    - - - Scripts are disabled. Showing fallback image. - - - -

    It is important to remember that even though we're using a circular arc to come up with decent e1 and e2 terms, we're not trying to perfectly create a circular arc with a cubic curve (which is good, because we can't; more on that later), we're only trying to come up with some reasonable e1 and e2 points so we can construct a new cubic curve... so now that we have those: let's see what kind of cubic curve that gives us:

    - - - Scripts are disabled. Showing fallback image. - - - -

    That looks perfectly serviceable!

    -

    Of course, we can take this one step further: we can't just "create" curves, we also have (almost!) all the tools available to "mold" curves, where we can reshape a curve by dragging a point on the curve around while leaving the start and end fixed, effectively molding the shape as if it were clay or the like. We'll see the last tool we need to do that in the next section, and then we'll look at implementing curve molding in the section after that, so read on!

    + +

    The result of this approach looks as follows:

    + + + Scripts are disabled. Showing fallback image. + + + +

    + It is important to remember that even though we're using a circular arc to come up with decent e1 and e2 terms, + we're not trying to perfectly create a circular arc with a cubic curve (which is good, because we can't; + more on that later), we're only trying to come up with some reasonable e1 and + e2 points so we can construct a new cubic curve... so now that we have those: let's see what kind of cubic curve that gives + us: +

    + + + Scripts are disabled. Showing fallback image. + + + +

    That looks perfectly serviceable!

    +

    + Of course, we can take this one step further: we can't just "create" curves, we also have (almost!) all the tools available to "mold" + curves, where we can reshape a curve by dragging a point on the curve around while leaving the start and end fixed, effectively molding + the shape as if it were clay or the like. We'll see the last tool we need to do that in the next section, and then we'll look at + implementing curve molding in the section after that, so read on! +

    +
    +
    +

    + + Projecting a point onto a Bézier curve +

    +

    + Before we can move on to actual curve molding, it'll be good if know how to actually be able to find "some point on the curve" that we're + trying to click on. After all, if all we have is our Bézier coordinates, that is not in itself enough to figure out which point on the + curve our cursor will be closest to. So, how do we project points onto a curve? +

    +

    + If the Bézier curve is of low enough order, we might be able to + work out the maths for how to do this, and get a perfect t value back, but in general this is an incredibly hard problem and the easiest solution is, really, a + numerical approach again. We'll be finding our ideal t value using a + binary search. First, we do a coarse distance-check based on + t values associated with the curve's "to draw" coordinates (using a lookup table, or LUT). This is pretty fast: +

    -
    -
    -

    Projecting a point onto a Bézier curve

    -

    Before we can move on to actual curve molding, it'll be good if know how to actually be able to find "some point on the curve" that we're trying to click on. After all, if all we have is our Bézier coordinates, that is not in itself enough to figure out which point on the curve our cursor will be closest to. So, how do we project points onto a curve?

    -

    If the Bézier curve is of low enough order, we might be able to work out the maths for how to do this, and get a perfect t value back, but in general this is an incredibly hard problem and the easiest solution is, really, a numerical approach again. We'll be finding our ideal t value using a binary search. First, we do a coarse distance-check based on t values associated with the curve's "to draw" coordinates (using a lookup table, or LUT). This is pretty fast:

    - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    - -

    After this runs, we know that LUT[i] is the coordinate on the curve in our LUT that is closest to the point we want to project, so that's a pretty good initial guess as to what the best projection onto our curve is. To refine it, we note that LUT[i] is a better guess than both LUT[i-1] and LUT[i+1], but there might be an even better projection somewhere else between those two values, so that's what we're going to be testing for, using a variation of the binary search.

    -
      -
    1. we start with our point p, and the t values t1=LUT[i-1].t and t2=LUT[i+1].t, which span an interval v = t2-t1.
    2. -
    3. we test this interval in five spots: the start, middle, and end (which we already have), and the two points in between the middle and start/end points
    4. -
    5. we then check which of these five points is the closest to our original point p, and then repeat step 1 with the points before and after the closest point we just found.
    6. -
    -

    This makes the interval we check smaller and smaller at each iteration, and we can keep running the three steps until the interval becomes so small as to lead to distances that are, for all intents and purposes, the same for all points.

    -

    So, let's see that in action: in this case, I'm going to arbitrarily say that if we're going to run the loop until the interval is smaller than 0.001, and show you what that means for projecting your mouse cursor or finger tip onto a rather complex Bézier curve (which, of course, you can reshape as you like). Also shown are the original three points that our coarse check finds.

    - - - Scripts are disabled. Showing fallback image. - - - + i = index + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + -
    -
    -

    Intersections with a circle

    -

    It might seem odd to cover this subject so much later than the line/line, line/curve, and curve/curve intersection topics from several sections earlier, but the reason we can't cover circle/curve intersections is that we can't really discuss circle/curve intersection until we've covered the kind of lookup table (LUT) walking that the section on projecting a point onto a curve uses. To see why, let's look at what we would have to do if we wanted to find the intersections between a curve and a circle using calculus.

    -

    First, we observe that "finding intersections" in this case means that, given a circle defined by a center point c = (x,y) and a radius r, we want to find all points on the Bezier curve for which the distance to the circle's center point is equal to the circle radius, which by definition means those points lie on the circle, and so count as intersections. In maths, that means we're trying to solve:

    - - -

    Which seems simple enough. Unfortunately, when we expand that dist function, things get a lot more problematic:

    - - -

    And now we have a problem because that's a sixth degree polynomial inside the square root. So, thanks to the Abel-Ruffini theorem that we saw before, we can't solve this by just going "square both sides because we don't care about signs"... we can't solve a sixth degree polynomial. So, we're going to have to actually evaluate that expression. We can "simplify" this by translating all our coordinates so that the center of the circle is (0,0) and all our coordinates are shifted accordingly, which makes the cx and cy terms fall away, but then we're still left with a monstrous function to solve.

    -

    So instead, we turn to the same kind of "LUT walking" that we saw for projecting points onto a curve, with a twist: instead of finding the on-curve point with the smallest distance to our projection point, we want to find the on-curve point that has the exact distance r to our projection point (namely, our circle center). Of course, there can be more than one such point, so there's also a bit more code to make sure we find all of them, but let's look at the steps involved:

    + +

    + And now we have a problem because that's a sixth degree polynomial inside the square root. So, thanks to the + Abel-Ruffini theorem that we saw before, we can't solve this by + just going "square both sides because we don't care about signs"... we can't solve a sixth degree polynomial. So, we're going to have to + actually evaluate that expression. We can "simplify" this by translating all our coordinates so that the center of the circle is (0,0) and + all our coordinates are shifted accordingly, which makes the cx and cy terms fall away, but then we're still left + with a monstrous function to solve. +

    +

    + So instead, we turn to the same kind of "LUT walking" that we saw for projecting points onto a curve, with a twist: instead of finding the + on-curve point with the smallest distance to our projection point, we want to find the on-curve point that has the exact distance + r to our projection point (namely, our circle center). Of course, there can be more than one such point, so there's also a + bit more code to make sure we find all of them, but let's look at the steps involved: +

    - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    - -

    This is very similar to the code in the previous section, with an extra input r for the circle radius, and a minor change in the "distance for this coordinate": rather than just distance(coordinate, p) we want to know the difference between that distance and the circle radius. After all, if that difference is zero, then the distance from the coordinate to the circle center is exactly the radius, so the coordinate lies on both the curve and the circle.

    -

    So far so good.

    -

    However, we also want to make sure we find all the points, not just a single one, so we need a little more code for that:

    + i = index + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    - -

    After running this code, values will be the list of all LUT coordinates that are closest to the distance r: we can use those values to run the same kind of refinement lookup we used for point projection (with the caveat that we're now not checking for smallest distance, but for "distance closest to r"), and we'll have all our intersection points. Of course, that does require explaining what findClosest does: rather than looking for a global minimum, we're now interested in finding a local minimum, so instead of checking a single point and looking at its distance value, we check three points ("current", "previous" and "before previous") and then check whether they form a local minimum:

    + start = i + 2; + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + - - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    - -

    In words: given a start index, the circle center and radius, and our LUT, we check where (closest to our start index) we can find a local minimum for the difference between "the distance from the curve to the circle center", and the circle's radius. We track this by looking at three values (associated with the indices index-2, index-1, and index), and we know we've found a local minimum if the three values show that the middle value (pd1) is less than either value beside it. When we do, we can set our "best guess, relative to start" as index-1. Of course, since we're now checking values relative to some start value, we might not find another candidate value at all, in which case we return start - 1, so that a simple "is the result less than start?" lets us determine that there are no more intersections to find.

    -

    Finally, while not necessary for point projection, there is one more step we need to perform when we run the binary refinement function on our candidate LUT indices, because we've so far only been testing using distances "closest to the radius of the circle", and that's actually not good enough... we need distances that are the radius of the circle. So, after running the refinement for each of these indices, we need to discard any final value that isn't the circle radius. And because we're working with floating point numbers, what this really means is that we need to discard any value that's a pixel or more "off". Or, if we want to get really fancy, "some small epsilon value".

    -

    Based on all of that, the following graphic shows this off for the standard cubic curve (which you can move the coordinates around for, of course) and a circle with a controllable radius centered on the graphic's center, using the code approach described above.

    - - - Scripts are disabled. Showing fallback image. - - - - - -

    And of course, for the full details, click that "view source" link.

    + return start + i + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + 18 + + + 19 + + -
    -
    -

    Molding a curve

    -

    Armed with knowledge of the "ABC" relation, point-on-curve projection, and guestimating reasonable looking helper values for cubic curve construction, we can finally cover curve molding: updating a curve's shape interactively, by dragging points on the curve around.

    -

    For quadratic curve, this is a really simple trick: we project our cursor onto the curve, which gives us a t value and initial B coordinate. We don't even need the latter: with our t value and "wherever the cursor is" as target B, we can compute the associated C:

    - - -

    And then the associated A:

    - - -

    And we're done, because that's our new quadratic control point!

    - - - Scripts are disabled. Showing fallback image. - - - -

    As before, cubic curves are a bit more work, because while it's easy to find our initial t value and ABC values, getting those all-important e1 and e2 coordinates is going to pose a bit of a problem... in the section on curve creation, we were free to pick an appropriate t value ourselves, which allowed us to find appropriate e1 and e2 coordinates. That's great, but when we're curve molding we don't have that luxury: whatever point we decide to start moving around already has its own t value, and its own e1 and e2 values, and those may not make sense for the rest of the curve.

    -

    For example, let's see what happens if we just "go with what we get" when we pick a point and start moving it around, preserving its t value and e1/e2 coordinates:

    - - - Scripts are disabled. Showing fallback image. - - - -

    That looks reasonable, close to the original point, but the further we drag our point, the less "useful" things become. Especially if we drag our point across the baseline, rather than turning into a nice curve.

    -

    One way to combat this might be to combine the above approach with the approach from the creating curves section: generate both the "unchanged t/e1/e2" curve, as well as the "idealized" curve through the start/cursor/end points, with idealized t value, and then interpolating between those two curves:

    - - - Scripts are disabled. Showing fallback image. - - - - - -

    The slide controls the "falloff distance" relative to where the original point on the curve is, so that as we drag our point around, it interpolates with a bias towards "preserving t/e1/e2" closer to the original point, and bias towards "idealized" form the further away we move our point, with anything that's further than our falloff distance simply being the idealized curve. We don't even try to interpolate at that point.

    -

    A more advanced way to try to smooth things out is to implement continuous molding, where we constantly update the curve as we move around, and constantly change what our B point is, based on constantly projecting the cursor on the curve as we're updating it - this is, you won't be surprised to learn, tricky, and beyond the scope of this section: interpolation (with a reasonable distance) will do for now!

    - -
    -
    -

    Curve fitting

    -

    Given the previous section, one question you might have is "what if I don't want to guess t values?". After all, plenty of graphics packages do automated curve fitting, so how can we implement that in a way that just finds us reasonable t values all on its own?

    -

    And really this is just a variation on the question "how do I get the curve through these X points?", so let's look at that. Specifically, let's look at the answer: "curve fitting". This is in fact a rather rich field in geometry, applying to anything from data modelling to path abstraction to "drawing", so there's a fair number of ways to do curve fitting, but we'll look at one of the most common approaches: something called a least squares polynomial regression. In this approach, we look at the number of points we have in our data set, roughly determine what would be an appropriate order for a curve that would fit these points, and then tackle the question "given that we want an nth order curve, what are the coordinates we can find such that our curve is "off" by the least amount?".

    -

    Now, there are many ways to determine how "off" points are from the curve, which is where that "least squares" term comes in. The most common tool in the toolbox is to minimise the squared distance between each point we have, and the corresponding point on the curve we end up "inventing". A curve with a snug fit will have zero distance between those two, and a bad fit will have non-zero distances between every such pair. It's a workable metric. You might wonder why we'd need to square, rather than just ensure that distance is a positive value (so that the total error is easy to compute by just summing distances) and the answer really is "because it tends to be a little better". There's lots of literature on the web if you want to deep-dive the specific merits of least squared error metrics versus least absolute error metrics, but those are well beyond the scope of this material.

    -

    So let's look at what we end up with in terms of curve fitting if we start with the idea of performing least squares Bézier fitting. We're going to follow a procedure similar to the one described by Jim Herold over on his "Least Squares Bézier Fit" article, and end with some nice interactive graphics for doing some curve fitting.

    -

    Before we begin, we're going to use the curve in matrix form. In the section on matrices, I mentioned that some things are easier if we use the matrix representation of a Bézier curve rather than its calculus form, and this is one of those things.

    -

    As such, the first step in the process is expressing our Bézier curve as powers/coefficients/coordinate matrix T x M x C, by expanding the Bézier functions.

    -
    - -

    Revisiting the matrix representation

    -

    Rewriting Bézier functions to matrix form is fairly easy, if you first expand the function, and then arrange them into a multiple line form, where each line corresponds to a power of t, and each column is for a specific coefficient. First, we expand the function:

    - - -

    And then we (trivially) rearrange the terms across multiple lines:

    - - -

    This rearrangement has "factors of t" at each row (the first row is t⁰, i.e. "1", the second row is t¹, i.e. "t", the third row is t²) and "coefficient" at each column (the first column is all terms involving "a", the second all terms involving "b", the third all terms involving "c").

    -

    With that arrangement, we can easily decompose this as a matrix multiplication:

    - - -

    We can do the same for the cubic curve, of course. We know the base function for cubics:

    - - -

    So we write out the expansion and rearrange:

    - - -

    Which we can then decompose:

    - - -

    And, of course, we can do this for quartic curves too (skipping the expansion step):

    - - -

    And so and on so on. Now, let's see how to use these T, M, and C, to do some curve fitting.

    -
    + +

    + And so and on so on. Now, let's see how to use these T, M, and C, to do some curve + fitting. +

    + -

    Let's get started: we're going to assume we picked the right order curve: for n points we're fitting an n-1th order curve, so we "start" with a vector P that represents the coordinates we already know, and for which we want to do curve fitting:

    - - -

    Next, we need to figure out appropriate t values for each point in the curve, because we need something that lets us tie "the actual coordinate" to "some point on the curve". There's a fair number of different ways to do this (and a large part of optimizing "the perfect fit" is about picking appropriate t values), but in this case let's look at two "obvious" choices:

    -
      -
    1. equally spaced t values, and
    2. -
    3. t values that align with distance along the polygon.
    4. -
    -

    The first one is really simple: if we have n points, then we'll just assign each point i a t value of (i-1)/(n-1). So if we have four points, the first point will have t=(1-1)/(4-1)=0/3, the second point will have t=(2-1)/(4-1)=1/3, the third point will have t=2/3, and the last point will be t=1. We're just straight up spacing the t values to match the number of points we have.

    -

    The second one is a little more interesting: since we're doing polynomial regression, we might as well exploit the fact that our base coordinates just constitute a collection of line segments. At the first point, we're fixing t=0, and the last point, we want t=1, and anywhere in between we're simply going to say that t is equal to the distance along the polygon, scaled to the [0,1] domain.

    -

    To get these values, we first compute the general "distance along the polygon" matrix:

    - - -

    Where length() is literally just that: the length of the line segment between the point we're looking at, and the previous point. This isn't quite enough, of course: we still need to make sure that all the values between i=1 and i=n fall in the [0,1] interval, so we need to scale all values down by whatever the total length of the polygon is:

    - - -

    And now we can move on to the actual "curve fitting" part: what we want is a function that lets us compute "ideal" control point values such that if we build a Bézier curve with them, that curve passes through all our original points. Or, failing that, have an overall error distance that is as close to zero as we can get it. So, let's write out what the error distance looks like.

    -

    As mentioned before, this function is really just "the distance between the actual coordinate, and the coordinate that the curve evaluates to for the associated t value", which we'll square to get rid of any pesky negative signs:

    - - -

    Since this function only deals with individual coordinates, we'll need to sum over all coordinates in order to get the full error function. So, we literally just do that; the total error function is simply the sum of all these individual errors:

    - - -

    And here's the trick that justifies using matrices: while we can work with individual values using calculus, with matrices we can compute as many values as we make our matrices big, all at the "same time", We can replace the individual terms pi with the full P coordinate matrix, and we can replace Bézier(si) with the matrix representation T x M x C we talked about before, which gives us:

    - - -

    In which we can replace the rather cumbersome "squaring" operation with a more conventional matrix equivalent:

    - - -

    Here, the letter T is used instead of the number 2, to represent the matrix transpose; each row in the original matrix becomes a column in the transposed matrix instead (row one becomes column one, row two becomes column two, and so on).

    -

    This leaves one problem: T isn't actually the matrix we want: we don't want symbolic t values, we want the actual numerical values that we computed for S, so we need to form a new matrix, which we'll call 𝕋, that makes use of those, and then use that 𝕋 instead of T in our error function:

    - - -

    Which, because of the first and last values in S, means:

    - - -

    Now we can properly write out the error function as matrix operations:

    - - -

    So, we have our error function: we now need to figure out the expression for where that function has minimal value, e.g. where the error between the true coordinates and the coordinates generated by the curve fitting is smallest. Like in standard calculus, this requires taking the derivative, and determining where that derivative is zero:

    - - -
    + +
    +

    Where did this derivative come from?

    +

    + That... is a good question. In fact, when trying to run through this approach, I ran into the same question! And you know what? I + straight up had no idea. I'm decent enough at calculus, I'm decent enough at linear algebra, and I just don't know. +

    +

    + So I did what I always do when I don't understand something: I asked someone to help me understand how things work. In this specific + case, I + posted a question + to Math.stackexchange, and received a answer that goes into way more detail than I had + hoped to receive. +

    +

    + Is that answer useful to you? Probably: no. At least, not unless you like understanding maths on a recreational level. And I do mean + maths in general, not just basic algebra. But it does help in giving us a reference in case you ever wonder "Hang on. Why was that + true?". There are answers. They might just require some time to come to understand. +

    +
    -

    Where did this derivative come from?

    -

    That... is a good question. In fact, when trying to run through this approach, I ran into the same question! And you know what? I straight up had no idea. I'm decent enough at calculus, I'm decent enough at linear algebra, and I just don't know.

    -

    So I did what I always do when I don't understand something: I asked someone to help me understand how things work. In this specific case, I posted a question to Math.stackexchange, and received a answer that goes into way more detail than I had hoped to receive.

    -

    Is that answer useful to you? Probably: no. At least, not unless you like understanding maths on a recreational level. And I do mean maths in general, not just basic algebra. But it does help in giving us a reference in case you ever wonder "Hang on. Why was that true?". There are answers. They might just require some time to come to understand.

    -
    - -

    Now, given the above derivative, we can rearrange the terms (following the rules of matrix algebra) so that we end up with an expression for C:

    - - -

    Here, the "to the power negative one" is the notation for the matrix inverse. But that's all we have to do: we're done. Starting with P and inventing some t values based on the polygon the coordinates in P define, we can compute the corresponding Bézier coordinates C that specify a curve that goes through our points. Or, if it can't go through them exactly, as near as possible.

    -

    So before we try that out, how much code is involved in implementing this? Honestly, that answer depends on how much you're going to be writing yourself. If you already have a matrix maths library available, then really not that much code at all. On the other hand, if you are writing this from scratch, you're going to have to write some utility functions for doing your matrix work for you, so it's really anywhere from 50 lines of code to maybe 200 lines of code. Not a bad price to pay for being able to fit curves to pre-specified coordinates.

    -

    So let's try it out! The following graphic lets you place points, and will start computing exact-fit curves once you've placed at least three. You can click for more points, and the code will simply try to compute an exact fit using a Bézier curve of the appropriate order. Four points? Cubic Bézier. Five points? Quartic. And so on. Of course, this does break down at some point: depending on where you place your points, it might become mighty hard for the fitter to find an exact fit, and things might actually start looking horribly off once there's enough points for compound floating point rounding errors to start making a difference (which is around 10~11 points).

    - - - Scripts are disabled. Showing fallback image. - - - - - - -

    You'll note there is a convenient "toggle" buttons that lets you toggle between equidistant t values, and distance ratio along the polygon formed by the points. Arguably more interesting is that once you have points to abstract a curve, you also get direct control over the time values through sliders for each, because if the time values are our degree of freedom, you should be able to freely manipulate them and see what the effect on your curve is.

    - -
    -
    -

    Bézier curves and Catmull-Rom curves

    -

    Taking an excursion to different splines, the other common design curve is the Catmull-Rom spline, which unlike Bézier curves pass through each control point, so they offer a kind of "built-in" curve fitting.

    -

    In fact, let's start with just playing with one: the following graphic has a predefined curve that you manipulate the points for, and lets you add points by clicking/tapping the background, as well as let you control "how fast" the curve passes through its point using the tension slider. The tenser the curve, the more the curve tends towards straight lines from one point to the next.

    - - - Scripts are disabled. Showing fallback image. - - - - - -

    Now, it may look like Catmull-Rom curves are very different from Bézier curves, because these curves can get very long indeed, but what looks like a single Catmull-Rom curve is actually a spline: a single curve built up of lots of identically-computed pieces, similar to if you just took a whole bunch of Bézier curves, placed them end to end, and lined up their control points so that things look like a single curve. For a Catmull-Rom curve, each "piece" between two points is defined by the point's coordinates, and the tangent for those points, the latter of which can trivially be derived from knowing the previous and next point:

    - + +

    + You'll note there is a convenient "toggle" buttons that lets you toggle between equidistant t values, and distance ratio + along the polygon formed by the points. Arguably more interesting is that once you have points to abstract a curve, you also get + direct control over the time values through sliders for each, because if the time values are our degree of freedom, you should be + able to freely manipulate them and see what the effect on your curve is. +

    +
    +
    +

    + + Bézier curves and Catmull-Rom curves +

    +

    + Taking an excursion to different splines, the other common design curve is the + Catmull-Rom spline, which unlike Bézier curves + pass through each control point, so they offer a kind of "built-in" curve fitting. +

    +

    + In fact, let's start with just playing with one: the following graphic has a predefined curve that you manipulate the points for, and lets + you add points by clicking/tapping the background, as well as let you control "how fast" the curve passes through its point using the + tension slider. The tenser the curve, the more the curve tends towards straight lines from one point to the next. +

    + + + Scripts are disabled. Showing fallback image. + + + + + +

    + Now, it may look like Catmull-Rom curves are very different from Bézier curves, because these curves can get very long indeed, but what + looks like a single Catmull-Rom curve is actually a spline: a single + curve built up of lots of identically-computed pieces, similar to if you just took a whole bunch of Bézier curves, placed them end to end, + and lined up their control points so that things look like a single curve. For a Catmull-Rom curve, each "piece" between two points is + defined by the point's coordinates, and the tangent for those points, the latter of which + can trivially be derived from knowing the + previous and next point: +

    + - -

    One downside of this is that—as you may have noticed from the graphic—the first and last point of the overall curve don't actually join up with the rest of the curve: they don't have a previous/next point respectively, and so there is no way to calculate what their tangent should be. Which also makes it rather tricky to fit a Catmull-Rom curve to three points like we were able to do for Bézier curves. More on that in the next section.

    -

    In fact, before we move on, let's look at how to actually draw the basic form of these curves (I say basic, because there are a number of variations that make things considerable more complex):

    + +

    + One downside of this is that—as you may have noticed from the graphic—the first and last point of the overall curve don't actually join up + with the rest of the curve: they don't have a previous/next point respectively, and so there is no way to calculate what their tangent + should be. Which also makes it rather tricky to fit a Catmull-Rom curve to three points like we were able to do for Bézier curves. More on + that in the next section. +

    +

    + In fact, before we move on, let's look at how to actually draw the basic form of these curves (I say basic, because there are a number of + variations that make things + considerable more + complex): +

    - - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    - -

    Now, since a Catmull-Rom curve is a form of cubic Hermite spline, and as cubic Bézier curves are also a form of cubic Hermite spline, we run into an interesting bit of maths programming: we can convert one to the other and back, and the maths for doing so is surprisingly simple!

    -

    The main difference between Catmull-Rom curves and Bézier curves is "what the points mean":

    -
      -
    • A cubic Bézier curve is defined by a start point, a control point that implies the tangent at the start, a control point that implies the tangent at the end, and an end point, plus a characterizing matrix that we can multiply by that point vector to get on-curve coordinates.
    • -
    • A Catmull-Rom curve is defined by a start point, a tangent that for that starting point, an end point, and a tangent for that end point, plus a characteristic matrix that we can multiple by the point vector to get on-curve coordinates.
    • -
    -

    Those are very similar, so let's see exactly how similar they are. We've already see the matrix form for Bézier curves, so how different is the matrix form for Catmull-Rom curves?:

    - - -

    That's pretty dang similar. So the question is: how can we convert that expression with Catmull-Rom matrix and vector into an expression of the Bézier matrix and vector? The short answer is of course "by using linear algebra", but the longer answer is the rest of this section, and involves some maths that you may not even care for: if you just want to know the (incredibly simple) conversions between the two curve forms, feel free to skip to the end of the following explanation, but if you want to how we can get one from the other... let's get mathing!

    -
    - -

    Deriving the conversion formulae

    -

    In order to convert between Catmull-Rom curves and Bézier curves, we need to know two things. Firstly, how to express the Catmull-Rom curve using a "set of four coordinates", rather than a mix of coordinates and tangents, and secondly, how to convert those Catmull-Rom coordinates to and from Bézier form.

    -

    We start with the first part, to figure out how we can go from Catmull-Rom V coordinates to Bézier P coordinates, by applying "some matrix T". We don't know what that T is yet, but we'll get to that:

    - - -

    So, this mapping says that in order to map a Catmull-Rom "point + tangent" vector to something based on an "all coordinates" vector, we need to determine the mapping matrix such that applying T yields P2 as start point, P3 as end point, and two tangents based on the lines between P1 and P3, and P2 nd P4, respectively.

    -

    Computing T is really more "arranging the numbers":

    - - -

    Thus:

    - - -

    However, we're not quite done, because Catmull-Rom curves have that "tension" parameter, written as τ (a lowercase"tau"), which is a scaling factor for the tangent vectors: the bigger the tension, the smaller the tangents, and the smaller the tension, the bigger the tangents. As such, the tension factor goes in the denominator for the tangents, and before we continue, let's add that tension factor into both our coordinate vector representation, and mapping matrix T:

    - - -

    With the mapping matrix properly done, let's rewrite the "point + tangent" Catmull-Rom matrix form to a matrix form in terms of four coordinates, and see what we end up with:

    - - -

    Replace point/tangent vector with the expression for all-coordinates:

    - - -

    and merge the matrices:

    - - -

    This looks a lot like the Bézier matrix form, which as we saw in the chapter on Bézier curves, should look like this:

    - - -

    So, if we want to express a Catmull-Rom curve using a Bézier curve, we'll need to turn this Catmull-Rom bit:

    - - -

    Into something that looks like this:

    - - -

    And the way we do that is with a fairly straight forward bit of matrix rewriting. We start with the equality we need to ensure:

    - - -

    Then we remove the coordinate vector from both sides without affecting the equality:

    - - -

    Then we can "get rid of" the Bézier matrix on the right by left-multiply both with the inverse of the Bézier matrix:

    - - -

    A matrix times its inverse is the matrix equivalent of 1, and because "something times 1" is the same as "something", so we can just outright remove any matrix/inverse pair:

    - - -

    And now we're basically done. We just multiply those two matrices and we know what V is:

    - - -

    We now have the final piece of our function puzzle. Let's run through each step.

    -
      -
    1. Start with the Catmull-Rom function:
    2. -
    - - -
      -
    1. rewrite to pure coordinate form:
    2. -
    - - -
      -
    1. rewrite for "normal" coordinate vector:
    2. -
    - - -
      -
    1. merge the inner matrices:
    2. -
    - - -
      -
    1. rewrite for Bézier matrix form:
    2. -
    - - -
      -
    1. and transform the coordinates so we have a "pure" Bézier expression:
    2. -
    - - -

    And we're done: we finally know how to convert these two curves!

    -
    + +

    And we're done: we finally know how to convert these two curves!

    + -

    If we have a Catmull-Rom curve defined by four coordinates P1 through P4, then we can draw that curve using a Bézier curve that has the vector:

    - - -

    Similarly, if we have a Bézier curve defined by four coordinates P1 through P4, we can draw that using a standard tension Catmull-Rom curve with the following coordinate values:

    - - -

    Or, if your API allows you to specify Catmull-Rom curves using plain coordinates:

    - - - -
    -
    -

    Creating a Catmull-Rom curve from three points

    -

    Much shorter than the previous section: we saw that Catmull-Rom curves need at least 4 points to draw anything sensible, so how do we create a Catmull-Rom curve from three points?

    -

    Short and sweet: we don't.

    -

    We run through the maths that lets us create a cubic Bézier curve, and then convert its coordinates to Catmull-Rom form using the conversion formulae we saw above.

    - -
    -
    -

    Forming poly-Bézier curves

    -

    Much like lines can be chained together to form polygons, Bézier curves can be chained together to form poly-Béziers, and the only trick required is to make sure that:

    -
      -
    1. the end point of each section is the starting point of the following section, and
    2. -
    3. the derivatives across that dual point line up.
    4. -
    -

    Unless you want sharp corners, of course. Then you don't even need 2.

    -

    We'll cover three forms of poly-Bézier curves in this section. First, we'll look at the kind that just follows point 1. where the end point of a segment is the same point as the start point of the next segment. This leads to poly-Béziers that are pretty hard to work with, but they're the easiest to implement:

    - - - Scripts are disabled. Showing fallback image. - - - - - - Scripts are disabled. Showing fallback image. - - - -

    Dragging the control points around only affects the curve segments that the control point belongs to, and moving an on-curve point leaves the control points where they are, which is not the most useful for practical modelling purposes. So, let's add in the logic we need to make things a little better. We'll start by linking up control points by ensuring that the "incoming" derivative at an on-curve point is the same as it's "outgoing" derivative:

    - - -

    We can effect this quite easily, because we know that the vector from a curve's last control point to its last on-curve point is equal to the derivative vector. If we want to ensure that the first control point of the next curve matches that, all we have to do is mirror that last control point through the last on-curve point. And mirroring any point A through any point B is really simple:

    - - -

    So let's implement that and see what it gets us. The following two graphics show a quadratic and a cubic poly-Bézier curve again, but this time moving the control points around moves others, too. However, you might see something unexpected going on for quadratic curves...

    - - - Scripts are disabled. Showing fallback image. - - - - - - Scripts are disabled. Showing fallback image. - - - -

    As you can see, quadratic curves are particularly ill-suited for poly-Bézier curves, as all the control points are effectively linked. Move one of them, and you move all of them. Not only that, but if we move the on-curve points, it's possible to get a situation where a control point cannot satisfy the constraint that it's the reflection of its two neighbouring control points... This means that we cannot use quadratic poly-Béziers for anything other than really, really simple shapes. And even then, they're probably the wrong choice. Cubic curves are pretty decent, but the fact that the derivatives are linked means we can't manipulate curves as well as we might if we relaxed the constraints a little.

    -

    So: let's relax the requirement a little.

    -

    We can change the constraint so that we still preserve the angle of the derivatives across sections (so transitions from one section to the next will still look natural), but give up the requirement that they should also have the same vector length. Doing so will give us a much more useful kind of poly-Bézier curve:

    - - - Scripts are disabled. Showing fallback image. - - - - - - Scripts are disabled. Showing fallback image. - - - -

    Cubic curves are now better behaved when it comes to dragging control points around, but the quadratic poly-Bézier still has the problem that moving one control points will move the control points and may ending up defining "the next" control point in a way that doesn't work. Quadratic curves really aren't very useful to work with...

    -

    Finally, we also want to make sure that moving the on-curve coordinates preserves the relative positions of the associated control points. With that, we get to the kind of curve control that you might be familiar with from applications like Photoshop, Inkscape, Blender, etc.

    - - - Scripts are disabled. Showing fallback image. - - - - - - Scripts are disabled. Showing fallback image. - - - -

    Again, we see that cubic curves are now rather nice to work with, but quadratic curves have a new, very serious problem: we can move an on-curve point in such a way that we can't compute what needs to "happen next". Move the top point down, below the left and right points, for instance. There is no way to preserve correct control points without a kink at the bottom point. Quadratic curves: just not that good...

    -

    A final improvement is to offer fine-level control over which points behave which, so that you can have "kinks" or individually controlled segments when you need them, with nicely well-behaved curves for the rest of the path. Implementing that, is left as an exercise for the reader.

    - -
    -
    -

    Curve offsetting

    -

    Perhaps you're like me, and you've been writing various small programs that use Bézier curves in some way or another, and at some point you make the step to implementing path extrusion. But you don't want to do it pixel based; you want to stay in the vector world. You find that extruding lines is relatively easy, and tracing outlines is coming along nicely (although junction caps and fillets are a bit of a hassle), and then you decide to do things properly and add Bézier curves to the mix. Now you have a problem.

    -

    Unlike lines, you can't simply extrude a Bézier curve by taking a copy and moving it around, because of the curvatures; rather than a uniform thickness, you get an extrusion that looks too thin in places, if you're lucky, but more likely will self-intersect. The trick, then, is to scale the curve, rather than simply copying it. But how do you scale a Bézier curve?

    -

    Bottom line: you can't. So you cheat. We're not going to do true curve scaling, or rather curve offsetting, because that's impossible. Instead we're going to try to generate 'looks good enough' offset curves.

    -
    - -

    "What do you mean, you can't? Prove it."

    -

    First off, when I say "you can't," what I really mean is "you can't offset a Bézier curve with another Bézier curve", not even by using a really high order curve. You can find the function that describes the offset curve, but it won't be a polynomial, and as such it cannot be represented as a Bézier curve, which has to be a polynomial. Let's look at why this is:

    -

    From a mathematical point of view, an offset curve O(t) is a curve such that, given our original curve B(t), any point on O(t) is a fixed distance d away from coordinate B(t). So let's math that:

    - - -

    However, we're working in 2D, and d is a single value, so we want to turn it into a vector. If we want a point distance d "away" from the curve B(t) then what we really mean is that we want a point at d times the "normal vector" from point B(t), where the "normal" is a vector that runs perpendicular ("at a right angle") to the tangent at B(t). Easy enough:

    - - -

    Now this still isn't very useful unless we know what the formula for N(t) is, so let's find out. N(t) runs perpendicular to the original curve tangent, and we know that the tangent is simply B'(t), so we could just rotate that 90 degrees and be done with it. However, we need to ensure that N(t) has the same magnitude for every t, or the offset curve won't be at a uniform distance, thus not being an offset curve at all. The easiest way to guarantee this is to make sure N(t) always has length 1, which we can achieve by dividing B'(t) by its magnitude:

    - - -

    Determining the length requires computing an arc length, and this is where things get Tricky with a capital T. First off, to compute arc length from some start a to end b, we must use the formula we saw earlier. Noting that "length" is usually denoted with double vertical bars:

    - - -

    So if we want the length of the tangent, we plug in B'(t), with t = 0 as start and -t = 1 as end:

    - - -

    And that's where things go wrong. It doesn't even really matter what the second derivative for B(t) is, that square root is screwing everything up, because it turns our nice polynomials into things that are no longer polynomials.

    -

    There is a small class of polynomials where the square root is also a polynomial, but they're utterly useless to us: any polynomial with unweighted binomial coefficients has a square root that is also a polynomial. Now, you might think that Bézier curves are just fine because they do, but they don't; remember that only the base function has binomial coefficients. That's before we factor in our coordinates, which turn it into a non-binomial polygon. The only way to make sure the functions stay binomial is to make all our coordinates have the same value. And that's not a curve, that's a point. We can already create offset curves for points, we call them circles, and they have much simpler functions than Bézier curves.

    -

    So, since the tangent length isn't a polynomial, the normalised tangent won't be a polynomial either, which means N(t) won't be a polynomial, which means that d times N(t) won't be a polynomial, which means that, ultimately, O(t) won't be a polynomial, which means that even if we can determine the function for O(t) just fine (and that's far from trivial!), it simply cannot be represented as a Bézier curve.

    -

    And that's one reason why Bézier curves are tricky: there are actually a lot of curves that cannot be represented as a Bézier curve at all. They can't even model their own offset curves. They're weird that way. So how do all those other programs do it? Well, much like we're about to do, they cheat. We're going to approximate an offset curve in a way that will look relatively close to what the real offset curve would look like, if we could compute it.

    -
    + +

    + And that's where things go wrong. It doesn't even really matter what the second derivative for B(t) is, that square root is + screwing everything up, because it turns our nice polynomials into things that are no longer polynomials. +

    +

    + There is a small class of polynomials where the square root is also a polynomial, but they're utterly useless to us: any polynomial with + unweighted binomial coefficients has a square root that is also a polynomial. Now, you might think that Bézier curves are just fine + because they do, but they don't; remember that only the base function has binomial coefficients. That's before we + factor in our coordinates, which turn it into a non-binomial polygon. The only way to make sure the functions stay binomial is to make + all our coordinates have the same value. And that's not a curve, that's a point. We can already create offset curves for points, we call + them circles, and they have much simpler functions than Bézier curves. +

    +

    + So, since the tangent length isn't a polynomial, the normalised tangent won't be a polynomial either, which means + N(t) won't be a polynomial, which means that d times N(t) won't be a polynomial, which means + that, ultimately, O(t) won't be a polynomial, which means that even if we can determine the function for + O(t) just fine (and that's far from trivial!), it simply cannot be represented as a Bézier curve. +

    +

    + And that's one reason why Bézier curves are tricky: there are actually a lot of curves that cannot be represented as a Bézier + curve at all. They can't even model their own offset curves. They're weird that way. So how do all those other programs do it? Well, + much like we're about to do, they cheat. We're going to approximate an offset curve in a way that will look relatively close to what the + real offset curve would look like, if we could compute it. +

    + -

    So, you cannot offset a Bézier curve perfectly with another Bézier curve, no matter how high-order you make that other Bézier curve. However, we can chop up a curve into "safe" sub-curves (where "safe" means that all the control points are always on a single side of the baseline, and the midpoint of the curve at t=0.5 is roughly in the center of the polygon defined by the curve coordinates) and then point-scale each sub-curve with respect to its scaling origin (which is the intersection of the point normals at the start and end points).

    -

    A good way to do this reduction is to first find the curve's extreme points, as explained in the earlier section on curve extremities, and use these as initial splitting points. After this initial split, we can check each individual segment to see if it's "safe enough" based on where the center of the curve is. If the on-curve point for t=0.5 is too far off from the center, we simply split the segment down the middle. Generally this is more than enough to end up with safe segments.

    -

    The following graphics show off curve offsetting, and you can use the slider to control the distance at which the curve gets offset. The curve first gets reduced to safe segments, each of which is then offset at the desired distance. Especially for simple curves, particularly easily set up for quadratic curves, no reduction is necessary, but the more twisty the curve gets, the more the curve needs to be reduced in order to get segments that can safely be scaled.

    - - - Scripts are disabled. Showing fallback image. - - - - - - - - Scripts are disabled. Showing fallback image. - - - - - -

    You may notice that this may still lead to small 'jumps' in the sub-curves when moving the curve around. This is caused by the fact that we're still performing a naive form of offsetting, moving the control points the same distance as the start and end points. If the curve is large enough, this may still lead to incorrect offsets.

    - -
    -
    -

    Graduated curve offsetting

    -

    What if we want to do graduated offsetting, starting at some distance s but ending at some other distance e? Well, if we can compute the length of a curve (which we can if we use the Legendre-Gauss quadrature approach) then we can also determine how far "along the line" any point on the curve is. With that knowledge, we can offset a curve so that its offset curve is not uniformly wide, but graduated between with two different offset widths at the start and end.

    -

    Like normal offsetting we cut up our curve in sub-curves, and then check at which distance along the original curve each sub-curve starts and ends, as well as to which point on the curve each of the control points map. This gives us the distance-along-the-curve for each interesting point in the sub-curve. If we call the total length of all sub-curves seen prior to seeing "the current" sub-curve S (and if the current sub-curve is the first one, S is zero), and we call the full length of our original curve L, then we get the following graduation values:

    -
      -
    • start: map S from interval (0,L) to interval (s,e)
    • -
    • c1: map(<strong>S+d1</strong>, 0,L, s,e), d1 = distance along curve to projection of c1
    • -
    • c2: map(<strong>S+d2</strong>, 0,L, s,e), d2 = distance along curve to projection of c2
    • -
    • ...
    • -
    • end: map(<strong>S+length(subcurve)</strong>, 0,L, s,e)
    • -
    -

    At each of the relevant points (start, end, and the projections of the control points onto the curve) we know the curve's normal, so offsetting is simply a matter of taking our original point, and moving it along the normal vector by the offset distance for each point. Doing so will give us the following result (these have with a starting width of 0, and an end width of 40 pixels, but can be controlled with your up and down arrow keys):

    - - - Scripts are disabled. Showing fallback image. - - - - - - - - Scripts are disabled. Showing fallback image. - - - - - - -
    -
    -

    Circles and quadratic Bézier curves

    -

    Circles and Bézier curves are very different beasts, and circles are infinitely easier to work with than Bézier curves. Their formula is much simpler, and they can be drawn more efficiently. But, sometimes you don't have the luxury of using circles, or ellipses, or arcs. Sometimes, all you have are Bézier curves. For instance, if you're doing font design, fonts have no concept of geometric shapes, they only know straight lines, and Bézier curves. OpenType fonts with TrueType outlines only know quadratic Bézier curves, and OpenType fonts with Type 2 outlines only know cubic Bézier curves. So how do you draw a circle, or an ellipse, or an arc?

    -

    You approximate.

    -

    We already know that Bézier curves cannot model all curves that we can think of, and this includes perfect circles, as well as ellipses, and their arc counterparts. However, we can certainly approximate them to a degree that is visually acceptable. Quadratic and cubic curves offer us different curvature control, so in order to approximate a circle we will first need to figure out what the error is if we try to approximate arcs of increasing degree with quadratic and cubic curves, and where the coordinates even lie.

    -

    Since arcs are mid-point-symmetrical, we need the control points to set up a symmetrical curve. For quadratic curves this means that the control point will be somewhere on a line that intersects the baseline at a right angle. And we don't get any choice on where that will be, since the derivatives at the start and end point have to line up, so our control point will lie at the intersection of the tangents at the start and end point.

    -

    First, let's try to fit the quadratic curve onto a circular arc. In the following sketch you can move the mouse around over a unit circle, to see how well, or poorly, a quadratic curve can approximate the arc from (1,0) to where your mouse cursor is:

    - - - Scripts are disabled. Showing fallback image. - - - - - -

    As you can see, things go horribly wrong quite quickly; even trying to approximate a quarter circle using a quadratic curve is a bad idea. An eighth of a turns might look okay, but how okay is okay? Let's apply some maths and find out. What we're interested in is how far off our on-curve coordinates are with respect to a circular arc, given a specific start and end angle. We'll be looking at how much space there is between the circular arc, and the quadratic curve's midpoint.

    -

    We start out with our start and end point, and for convenience we will place them on a unit circle (a circle around 0,0 with radius 1), at some angle φ:

    - - -

    What we want to find is the intersection of the tangents, so we want a point C such that:

    - - -

    i.e. we want a point that lies on the vertical line through S (at some distance a from S) and also lies on the tangent line through E (at some distance b from E). Solving this gives us:

    - - -

    First we solve for b:

    - - -

    which yields:

    - - -

    which we can then substitute in the expression for a:

    - - -

    A quick check shows that plugging these values for a and b into the expressions for Cx and Cy give the same x/y coordinates for both "a away from A" and "b away from B", so let's continue: now that we know the coordinate values for C, we know where our on-curve point T for t=0.5 (or angle φ/2) is, because we can just evaluate the Bézier polynomial, and we know where the circle arc's actual point P is for angle φ/2:

    - - -

    We compute T, observing that if t=0.5, the polynomial values (1-t)², 2(1-t)t, and t² are 0.25, 0.5, and 0.25 respectively:

    - - -

    Which, worked out for the x and y components, gives:

    - - -

    And the distance between these two is the standard Euclidean distance:

    - - -

    So, what does this distance function look like when we plot it for a number of ranges for the angle φ, such as a half circle, quarter circle and eighth circle?

    -
    - - plotted for 0 ≤ φ ≤ π: - - - plotted for 0 ≤ φ ≤ ½π: - - - - - plotted for 0 ≤ φ ≤ ¼π: -
    + +

    + So, what does this distance function look like when we plot it for a number of ranges for the angle φ, such as a half circle, quarter + circle and eighth circle? +

    + + + + + + + + +
    + + plotted for 0 ≤ φ ≤ π: + + + plotted for 0 ≤ φ ≤ ½π: + + + + + plotted for 0 ≤ φ ≤ ¼π: +
    -

    We now see why the eighth circle arc looks decent, but the quarter circle arc doesn't: an error of roughly 0.06 at t=0.5 means we're 6% off the mark... we will already be off by one pixel on a circle with pixel radius 17. Any decent sized quarter circle arc, say with radius 100px, will be way off if approximated by a quadratic curve! For the eighth circle arc, however, the error is only roughly 0.003, or 0.3%, which explains why it looks so close to the actual eighth circle arc. In fact, if we want a truly tiny error, like 0.001, we'll have to contend with an angle of (rounded) 0.593667, which equates to roughly 34 degrees. We'd need 11 quadratic curves to form a full circle with that precision! (technically, 10 and ten seventeenth, but we can't do partial curves, so we have to round up). That's a whole lot of curves just to get a shape that can be drawn using a simple function!

    -

    In fact, let's flip the function around, so that if we plug in the precision error, labelled ε, we get back the maximum angle for that precision:

    - - -

    And frankly, things are starting to look a bit ridiculous at this point, we're doing way more maths than we've ever done, but thankfully this is as far as we need the maths to take us: If we plug in the precisions 0.1, 0.01, 0.001 and 0.0001 we get the radians values 1.748, 1.038, 0.594 and 0.3356; in degrees, that means we can cover roughly 100 degrees (requiring four curves), 59.5 degrees (requiring six curves), 34 degrees (requiring 11 curves), and 19.2 degrees (requiring a whopping nineteen curves).

    -

    The bottom line? Quadratic curves are kind of lousy if you want circular (or elliptical, which are circles that have been squashed in one dimension) curves. We can do better, even if it's just by raising the order of our curve once. So let's try the same thing for cubic curves.

    - -
    -
    -

    Circular arcs and cubic Béziers

    -

    Let's look at approximating circles and circular arcs using cubic Béziers. How much better is that?

    - - - Scripts are disabled. Showing fallback image. - - - - - -

    At cursory glance, a fair bit better, but let's find out how much better by looking at how to construct the Bézier curve.

    -

    A construction diagram for a cubic approximation of a circular arc

    -

    The start and end points are trivial, but the mid point requires a bit of work, but it's mostly basic trigonometry once we know the angle θ for our circular arc: if we scale our circular arc to a unit circle, we can always start our arc, with radius 1, at (1,0) and then given our arc angle θ, we also know that the circular arc has length θ (because unit circles are nice that way). We also know our end point, because that's just (cos(θ), sin(θ)), and so the challenge is to figure out what control points we need in order for the curve at t=0.5 to exactly touch the circular arc at the angle θ/2:

    -

    So let's again formally describe this:

    - - -

    Only P3 isn't quite straight-forward here, and its description is based on the fact that the triangle (origin, P4, P3) is a right angled triangle, with the distance between the origin and P4 being 1 (because we're working with a unit circle), and the distance between P4 and P3 being k, so that we can represent P3 as "The point P4 plus the vector from the origin to P4 but then rotated a quarter circle, counter-clockwise, and scaled by k".

    -

    With that, we can determine the y-coordinates for A, B, e1, and e2, after which we have all the information we need to determine what the value of k is. We can find these values by using (no surprise here) linear interpolation between known points, as A is midway between P2 and P3, e1 is between A and "midway between P1 and P2" (which is "half height" P2), and so forth:

    - - -

    Which now gives us two identities for B, because in addition to determining B through linear interpolation, we also know that B's y coordinate is just sin(θ/2): we started this exercise by saying we were going to approximate the circular arc using a Bézier curve that had its midpoint, which is point B, touching the unit circle at the arc's half-angle, by definition making B the point at (cos(θ/2), sin(θ/2)).

    -

    This means we can equate the two identities we now have for By and solve for k.

    -
    - -

    Deriving k

    -

    Solving for k is fairly straight forward, but it's a fair few steps, and if you just the immediate result: using a tool like Wolfram Alpha is definitely the way to go. That said, let's get going:

    - - -

    And finally, we can take further advantage of several trigonometric identities to drastically simplify our formula for k:

    - - -

    And we're done.

    -
    + +

    And we're done.

    + -

    So, the distance of our control points to the start/end points can be expressed as a number that we get from an almost trivial expression involving the circular arc's angle:

    - - -

    Which means that for any circular arc with angle θ and radius r, our Bézier approximation based on three points of incidence is:

    - - -

    Which also gives us the commonly found value of 0.55228 for quarter circles, based on them having an angle of half π:

    - - -

    And thus giving us the following Bézier coordinates for a quarter circle of radius r:

    - - -
    + +
    +

    So, how accurate is this?

    +

    + Unlike for the quadratic curve, we can't use t=0.5 as our reference point because by its very nature it's one of the three points + that are actually guaranteed to be on the circular arc itself. Instead, we need a different t value that will give us the maximum + deflection - there are two possible choices (as our curve is still strictly "overshoots" the circular arc, and it's symmetrical) but + rather than trying to use calculus to find the perfect t value—which we could! the maths is perfectly reasonable as long as we + get to use computers—we can also just perform a binary search for the biggest deflection and not bother with all this maths stuff. +

    +

    + So let's do that instead: we can run a maximum deflection check that just runs through t from 0 to 1 at some coarse interval, + finds a t value that has "the highest deflection of the bunch", then reruns the same check with a much smaller interval around + that t value, repeating as many times as necessary to get us an arbitrarily precise value of t: +

    -

    So, how accurate is this?

    -

    Unlike for the quadratic curve, we can't use t=0.5 as our reference point because by its very nature it's one of the three points that are actually guaranteed to be on the circular arc itself. Instead, we need a different t value that will give us the maximum deflection - there are two possible choices (as our curve is still strictly "overshoots" the circular arc, and it's symmetrical) but rather than trying to use calculus to find the perfect t value—which we could! the maths is perfectly reasonable as long as we get to use computers—we can also just perform a binary search for the biggest deflection and not bother with all this maths stuff.

    -

    So let's do that instead: we can run a maximum deflection check that just runs through t from 0 to 1 at some coarse interval, finds a t value that has "the highest deflection of the bunch", then reruns the same check with a much smaller interval around that t value, repeating as many times as necessary to get us an arbitrarily precise value of t:

    - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    - -

    Plus, how often do you get to write a function with that name?

    -

    Using this code, we find that our t values are approximately 0.211325 and 0.788675, so let's pick the lower of the two and see what the maximum deflection is across our domain of angles, with the original quadratic error show in green (rocketing off to infinity first, and then coming back down as we approach 2π)

    - - -
    - - - - - -
    - error plotted for 0 ≤ φ ≤ 2π - - error plotted for 0 ≤ φ ≤ π - - error plotted for 0 ≤ φ ≤ ½π -
    + return getMostWrongT(radius, bezier, worst_t - stepsize, worst_t + stepsize) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + -

    That last image is probably not quite clear enough: the cubic approximation of a quarter circle is so incredibly much better that we can't even really see it at the same scale of our quadratic curve. Let's scale the y-axis a little, and try that again:

    -

    +

    Plus, how often do you get to write a function with that name?

    +

    + Using this code, we find that our t values are approximately 0.211325 and 0.788675, so let's pick the lower of the two and see + what the maximum deflection is across our domain of angles, with the original quadratic error show in green (rocketing off to infinity + first, and then coming back down as we approach 2π) +

    + + + + + + + + + + + + + +
    + + + + + +
    + error plotted for 0 ≤ φ ≤ 2π + + error plotted for 0 ≤ φ ≤ π + + error plotted for 0 ≤ φ ≤ ½π +
    -

    Yeah... the error of a cubic approximation for a quarter circle turns out to be two orders of magnitude better. At approximately 0.00027 (or: just shy of being 2.7 pixels off for a circle with a radius of 10,000 pixels) the increase in precision over quadratic curves is quite spectacular - certainly good enough that no one in their right mind should ever use quadratic curves.

    -
    +

    + That last image is probably not quite clear enough: the cubic approximation of a quarter circle is so incredibly much better that we + can't even really see it at the same scale of our quadratic curve. Let's scale the y-axis a little, and try that again: +

    +

    -

    So that's it, kappa is 4/3 · tan(θ/4) , we're done! ...or are we?

    -

    Can we do better?

    -

    Technically: yes, we can. But I'm going to prefix this section with "we can, and we should investigate that possibility, but let me warn you up front that the result is only better if we're going to hard-code the values". We're about to get into the weeds and the standard three-points-of-incidence value is so good already that for most applications, trying to do better won't make any sense at all.

    -

    So with that said: what we calculated above is an upper bound for a best fit Bézier curve for a circular arc: anywhere we don't touch the circular arc in our approximation, we've "overshot" the arc. What if we dropped our value for k just a little, so that the curve starts out as an over-estimation, but then crosses the circular arc, yielding an region of underestimation, and then crosses the circular arc again, with another region of overestimation. This might give us a lower overall error, so let's see what we can do.

    -

    First, let's express the total error (given circular arc angle θ, and some k) using standard calculus notation:

    - - -

    This says that the error function for a given angle and value of k is equal to the "infinite" sum of differences between our curve and the circular arc, as we run t from 0 to 1, using an infinitely small step size. between subsequent t values.

    -

    Now, since we want to find the minimal error, that means we want to know where along this function things go from "error is getting progressively less" to "error is increasing again", which means we want to know where its derivative is zero, which as mathematical expression looks like:

    - - -

    And here we have the most direct application of the Fundamental Theorem of Calculus: the derivative and integral are each other's inverse operations, so they cancel out, leaving us with our original function:

    - - -

    And now we just solve for that... oh wait. We've seen this before. In order to solve this, we'd end up needing to solve this:

    - - -

    And both of those terms on the left of the equal sign are 6th degree polynomials, which means—as we've covered in the section on arc lengths—there is no symbolic solution for this equasion. Instead, we'll have to use a numerical approach to find the solutions here, so... to the computer!

    -
    + +

    + And both of those terms on the left of the equal sign are 6th degree polynomials, which means—as we've covered in the section + on arc lengths—there is no symbolic solution for this equasion. + Instead, we'll have to use a numerical approach to find the solutions here, so... to the computer! +

    +
    +

    Iterating on a solution

    +

    + By which I really mean "to the binary search algorithm", because we're dealing with a reasonably well behaved function: depending on the + value for k , we're either going to end up with a Bézier curve that's on average "not at distance r from the arc's + center", "exactly distance r from the arc's center", or "more than distance r from the arc's center", so we can just + binary search our way to the most accurate value for c that gets us that middle case. +

    +

    First our setup, where we determine our upper and lower bounds, before entering our binary search:

    -

    Iterating on a solution

    -

    By which I really mean "to the binary search algorithm", because we're dealing with a reasonably well behaved function: depending on the value for k , we're either going to end up with a Bézier curve that's on average "not at distance r from the arc's center", "exactly distance r from the arc's center", or "more than distance r from the arc's center", so we can just binary search our way to the most accurate value for c that gets us that middle case.

    -

    First our setup, where we determine our upper and lower bounds, before entering our binary search:

    - - - - -
    1 - -
    2
    3
    4
    - -

    And then the binary search algorithm, which can be found in pretty much any CS textbook, as well as more online articles, tutorials, and blog posts than you can ever read in a life time:

    + return binarySearch(radius, angle, points, lowerBound, upperBound) + + + + 2 + + + 3 + + + 4 + + - - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    - -

    Using the following radialError function, which samples the curve's approximation of the circular arc over several points (although the first and last point will never contribute anything, so we skip them):

    + return binarySearch(radius, angle, points, value, upperBound) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + 18 + + + 19 + + - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    - -

    In this, getOnCurvePoint is just the standard Bézier evaluation function, yielding a point. Treating that point as a vector, we can get its length to the origin using a magnitude call.

    -

    Examining the result

    -

    Running the above code we can get a list of k values associated with a list of angles θ from 0 to π, and we can use that to, for each angle, plot what the difference between the circular arc and the Bézier approximation looks like:

    -

    image-20210419085430711

    -

    Here we see the difference between an arc and its Bézier approximation plotted as we run t from 0 to 1. Just by looking at the plot we can tell that there is maximum deflection at t = 0.5, so let's plot the maximum deflection "function", for angles from 0 to θ:

    -

    In fact, let's plot the maximum deflections for both approaches as a functions over θ:

    - - -
    - - - - - -
    - max deflection using unit scale - - max deflection at 10x scale - - max deflection at 100x scale -
    + return err + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + -

    That doesn't actually appear to be all that much better, so let's look at some numbers, to see what the improvement actually is:

    - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
    angle"improved" deflection"upper bound" deflectiondifference
    1/8 π6.202833502388927E-86.657161222278773E-84.5432771988984655E-9
    1/4 π3.978021202111215E-64.246252911066506E-62.68231708955291E-7
    3/8 π4.547652269037972E-54.8397483513262785E-52.9209608228830675E-6
    1/2 π2.569196199214696E-42.7251652752280364E-41.559690760133403E-5
    5/8 π9.877526288810667E-40.00104441758597118025.666495709011343E-5
    3/4 π0.002981649786796270.00314556284145806051.6391305466179062E-4
    7/8 π0.00763231828070198850.0080477779099483734.1545962924638413E-4
    π0.0173621859640437080.0183490165195459029.86830555502194E-4
    -

    As we can see, the increase in precision is not particularly big: for a quarter circle (π/2) the traditional k will be off by 2.75 pixels on a circle with radius 10,000 pixels, whereas this "better" fit will be off by 2.56 pixels. And while that's certainly an almost 10% improvement, it's also nowhere near enough of an improvement to make a discernible difference.

    -
    +

    + In this, getOnCurvePoint is just the standard Bézier evaluation function, yielding a point. Treating that point as a + vector, we can get its length to the origin using a magnitude call. +

    +

    Examining the result

    +

    + Running the above code we can get a list of k values associated with a list of angles θ from 0 to π, and we can use that to, + for each angle, plot what the difference between the circular arc and the Bézier approximation looks like: +

    +

    image-20210419085430711

    +

    + Here we see the difference between an arc and its Bézier approximation plotted as we run t from 0 to 1. Just by looking at the + plot we can tell that there is maximum deflection at t = 0.5, so let's plot the maximum deflection "function", for angles from + 0 to θ: +

    +

    In fact, let's plot the maximum deflections for both approaches as a functions over θ:

    + + + + + + + + + + + + + +
    + + + + + +
    + max deflection using unit scale + + max deflection at 10x scale + + max deflection at 100x scale +
    -

    At this point it should be clear that while, yes, there are improvement to be had, they're essentially insignificant while also being much more computationally expensive.

    -

    TL;DR: just tell me which value I should be using

    -

    It depends on what we need to do. If we just want the best value for quarter circles, and we're going to hard code the value for k, then there is no reason to hard-code the constant k=4/3*tan(pi/8) when you can just as easily hard-code the constant as k=0.551784777779014 instead.

    -

    If you need "the" value for quarter circles, use 0.551785 instead of 0.55228

    -

    However, for dynamic arc approximation, in code that tries to fit circular paths using Bézier paths instead, it should be fairly obvious that the simple function involving a tangent computation, two divisions, and one multiplication, is vastly more performant than running all the code we ended writing just to get a 25% lower error value, and most certainly worth preferring over getting the "more accurate" value.

    -

    If you need to fit Béziers to circular arcs on the fly, use 4/3 * tan(θ/4)

    -

    However, always remember that if you're writing for humans, you can typically use the best of both worlds: as the user interacts with their curves, you should draw their curves instead of drawing approximations of them. If they need to draw circles or circular arcs, draw those, and only approximate them with a Bézier curve when the data needs to be exported to a format that doesn't support those. Ideally with a preview mechanism that highlights where the errors will be, and how large they will be.

    -

    If you're writing code for graphics design by humans, use circular arcs for circular arcs

    -

    And that's it. We have pretty well exhausted this subject. There are different metrics we could use to find "different best k values", like trying to match arc length (e.g. when we're optimizing for material cost), or minimizing the area between the circular arc and the Bézier curve (e.g. when we're optimizing for inking), or minimizing the rate of change of the Bézier's curvature (e.g. when we're optimizing for curve traversal) and they all yield values that are so similar that it's almost certainly not worth it. (For instance, for quarter circle approximations those values are 0.551777, 0.5533344, and 0.552184 respectively. Much like the 0.551785 we get from minimizing the maximum deflection, none of these values are significantly better enough to prefer them over the upper bound value).

    +

    That doesn't actually appear to be all that much better, so let's look at some numbers, to see what the improvement actually is:

    + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
    angle"improved" deflection"upper bound" deflectiondifference
    1/8 π6.202833502388927E-86.657161222278773E-84.5432771988984655E-9
    1/4 π3.978021202111215E-64.246252911066506E-62.68231708955291E-7
    3/8 π4.547652269037972E-54.8397483513262785E-52.9209608228830675E-6
    1/2 π2.569196199214696E-42.7251652752280364E-41.559690760133403E-5
    5/8 π9.877526288810667E-40.00104441758597118025.666495709011343E-5
    3/4 π0.002981649786796270.00314556284145806051.6391305466179062E-4
    7/8 π0.00763231828070198850.0080477779099483734.1545962924638413E-4
    π0.0173621859640437080.0183490165195459029.86830555502194E-4
    +

    + As we can see, the increase in precision is not particularly big: for a quarter circle (π/2) the traditional k will be off by + 2.75 pixels on a circle with radius 10,000 pixels, whereas this "better" fit will be off by 2.56 pixels. And while that's certainly an + almost 10% improvement, it's also nowhere near enough of an improvement to make a discernible difference. +

    +
    -
    -
    -

    Approximating Bézier curves with circular arcs

    -

    Let's look at doing the exact opposite of the previous section: rather than approximating circular arc using Bézier curves, let's approximate Bézier curves using circular arcs.

    -

    We already saw in the section on circle approximation that this will never yield a perfect equivalent, but sometimes you need circular arcs, such as when you're working with fabrication machinery, or simple vector languages that understand lines and circles, but not much else.

    -

    The approach is fairly simple: pick a starting point on the curve, and pick two points that are further along the curve. Determine the circle that goes through those three points, and see if it fits the part of the curve we're trying to approximate. Decent fit? Try spacing the points further apart. Bad fit? Try spacing the points closer together. Keep doing this until you've found the "good approximation/bad approximation" boundary, record the "good" arc, and then move the starting point up to overlap the end point we previously found. Rinse and repeat until we've covered the entire curve.

    -

    We already saw how to fit a circle through three points in the section on creating a curve from three points, and finding the arc through those points is straight-forward: pick one of the three points as start point, pick another as an end point, and the arc has to necessarily go from the start point, to the end point, over the remaining point.

    -

    So, how can we convert a Bézier curve into a (sequence of) circular arc(s)?

    -
      -
    • Start at t=0
    • -
    • Pick two points further down the curve at some value m = t + n and e = t + 2n
    • -
    • Find the arc that these points define
    • -
    • Determine how close the found arc is to the curve:
        -
      • Pick two additional points e1 = t + n/2 and e2 = t + n + n/2.
      • -
      • These points, if the arc is a good approximation of the curve interval chosen, should - lie on the circle, so their distance to the center of the circle should be the - same as the distance from any of the three other points to the center.
      • -
      • For point points, determine the (absolute) error between the radius of the circle, and the -actual distance from the center of the circle to the point on the curve.
      • -
      • If this error is too high, we consider the arc bad, and try a smaller interval.
      • -
      -
    • -
    -

    The result of this is shown in the next graphic: we start at a guaranteed failure: s=0, e=1. That's the entire curve. The midpoint is simply at t=0.5, and then we start performing a binary search.

    -
      -
    1. We start with low=0, mid=0.5 and high=1
    2. -
    3. That'll fail, so we retry with the interval halved: {0, 0.25, 0.5}
        -
      • If that arc's good, we move back up by half distance: {0, 0.375, 0.75}.
      • -
      • However, if the arc was still bad, we move down by half the distance: {0, 0.125, 0.25}.
      • -
      -
    4. -
    5. We keep doing this over and over until we have two arcs, in sequence, of which the first arc is good, and the second arc is bad. When we find that pair, we've found the boundary between a good approximation and a bad approximation, and we pick the good arc.
    6. -
    -

    The following graphic shows the result of this approach, with a default error threshold of 0.5, meaning that if an arc is off by a combined half pixel over both verification points, then we treat the arc as bad. This is an extremely simple error policy, but already works really well. Note that the graphic is still interactive, and you can use your up and down arrow keys keys to increase or decrease the error threshold, to see what the effect of a smaller or larger error threshold is.

    - - - Scripts are disabled. Showing fallback image. - - - - - -

    With that in place, all that's left now is to "restart" the procedure by treating the found arc's end point as the new to-be-determined arc's starting point, and using points further down the curve. We keep trying this until the found end point is for t=1, at which point we are done. Again, the following graphic allows for up and down arrow key input to increase or decrease the error threshold, so you can see how picking a different threshold changes the number of arcs that are necessary to reasonably approximate a curve:

    - - - Scripts are disabled. Showing fallback image. - - - - - -

    So... what is this good for? Obviously, if you're working with technologies that can't do curves, but can do lines and circles, then the answer is pretty straightforward, but what else? There are some reasons why you might need this technique: using circular arcs means you can determine whether a coordinate lies "on" your curve really easily (simply compute the distance to each circular arc center, and if any of those are close to the arc radii, at an angle between the arc start and end, bingo, this point can be treated as lying "on the curve"). Another benefit is that this approximation is "linear": you can almost trivially travel along the arcs at fixed speed. You can also trivially compute the arc length of the approximated curve (it's a bit like curve flattening). The only thing to bear in mind is that this is a lossy equivalence: things that you compute based on the approximation are guaranteed "off" by some small value, and depending on how much precision you need, arc approximation is either going to be super useful, or completely useless. It's up to you to decide which, based on your application!

    - -
    -
    -

    B-Splines

    -

    No discussion on Bézier curves is complete without also giving mention of that other beast in the curve design space: B-Splines. Easily confused to mean Bézier splines, that's not actually what they are; they are "basis function" splines, which makes a lot of difference, and we'll be looking at those differences in this section. We're not going to dive as deep into B-Splines as we have for Bézier curves (that would be an entire primer on its own) but we'll be looking at how B-Splines work, what kind of maths is involved in computing them, and how to draw them based on a number of parameters that you can pick for individual B-Splines.

    -

    First off: B-Splines are piecewise, polynomial interpolation curves, where the "single curve" is built by performing polynomial interpolation over a set of points, using a sliding window of a fixed number of points. For instance, a "cubic" B-Spline defined by twelve points will have its curve built by evaluating the polynomial interpolation of four points, and the curve can be treated as a lot of different sections, each controlled by four points at a time, such that the full curve consists of smoothly connected sections defined by points {1,2,3,4}, {2,3,4,5}, ..., {8,9,10,11}, and finally {9,10,11,12}, for eight sections.

    -

    What do they look like? They look like this! Tap on the graphic to add more points, and move points around to see how they map to the spline curve drawn.

    - - - Scripts are disabled. Showing fallback image. - - - -

    The important part to notice here is that we are not doing the same thing with B-Splines that we do for poly-Béziers or Catmull-Rom curves: both of the latter simply define new sections as literally "new sections based on new points", so a 12 point cubic poly-Bézier curve is actually impossible, because we start with a four point curve, and then add three more points for each section that follows, so we can only have 4, 7, 10, 13, 16, etc. point Poly-Béziers. Similarly, while Catmull-Rom curves can grow by adding single points, this addition of a single point introduces three implicit Bézier points. Cubic B-Splines, on the other hand, are smooth interpolations of each possible curve involving four consecutive points, such that at any point along the curve except for our start and end points, our on-curve coordinate is defined by four control points.

    -

    Consider the difference to be this:

    -
      -
    • for Bézier curves, the curve is defined as an interpolation of points, but:
    • -
    • for B-Splines, the curve is defined as an interpolation of curves.
    • -
    -

    In fact, let's look at that again, but this time with the base curves shown, too. Each consecutive four points define one curve:

    - - - Scripts are disabled. Showing fallback image. - - - - - -

    In order to make this interpolation of curves work, the maths is necessarily more complex than the maths for Bézier curves, so let's have a look at how things work.

    -

    How to compute a B-Spline curve: some maths

    -

    Given a B-Spline of degree d and thus order k=d+1 (so a quadratic B-Spline is degree 2 and order 3, a cubic B-Spline is degree 3 and order 4, etc) and n control points P0 through Pn-1, we can compute a point on the curve for some value t in the interval [0,1] (where 0 is the start of the curve, and 1 the end, just like for Bézier curves), by evaluating the following function:

    - + +

    + In order to make this interpolation of curves work, the maths is necessarily more complex than the maths for Bézier curves, so let's have + a look at how things work. +

    +

    How to compute a B-Spline curve: some maths

    +

    + Given a B-Spline of degree d and thus order k=d+1 (so a quadratic B-Spline is degree 2 and order 3, a cubic + B-Spline is degree 3 and order 4, etc) and n control points P0 through Pn-1, we can compute a point on the curve for some value t in the interval [0,1] (where 0 is the start of the curve, and 1 the + end, just like for Bézier curves), by evaluating the following function: +

    + - -

    Which, honestly, doesn't tell us all that much. All we can see is that a point on a B-Spline curve is defined as "a mix of all the control points, weighted somehow", where the weighting is achieved through the N(...) function, subscripted with an obvious parameter i, which comes from our summation, and some magical parameter k. So we need to know two things: 1. what does N(t) do, and 2. what is that k? Let's cover both, in reverse order.

    -

    The parameter k represents the "knot interval" over which a section of curve is defined. As we learned earlier, a B-Spline curve is itself an interpolation of curves, and we can treat each transition where a control point starts or stops influencing the total curvature as a "knot on the curve". -Doing so for a degree d B-Spline with n control point gives us d + n + 1 knots, defining d + n intervals along the curve, and it is these intervals that the above k subscript to the N() function applies to.

    -

    Then the N() function itself. What does it look like?

    - - -

    So this is where we see the interpolation: N(t) for an (i,k) pair (that is, for a step in the above summation, on a specific knot interval) is a mix between N(t) for (i,k-1) and N(t) for (i+1,k-1), so we see that this is a recursive iteration where i goes up, and k goes down, so it seem reasonable to expect that this recursion has to stop at some point; obviously, it does, and specifically it does so for the following i/k values:

    - - -

    And this function finally has a straight up evaluation: if a t value lies within a knot-specific interval once we reach a k=1 value, it "counts", otherwise it doesn't. We did cheat a little, though, because for all these values we need to scale our t value first, so that it lies in the interval bounded by knots[d] and knots[n], which are the start point and end point where curvature is controlled by exactly order control points. For instance, for degree 3 (=order 4) and 7 control points, with knot vector [1,2,3,4,5,6,7,8,9,10,11], we map t from [the interval 0,1] to the interval [4,8], and then use that value in the functions above, instead.

    -

    Can we simplify that?

    -

    We can, yes.

    -

    People far smarter than us have looked at this work, and two in particular — Maurice Cox and Carl de Boor — came to a mathematically pleasing solution: to compute a point P(t), we can compute this point by evaluating d(t) on a curve section between knots i and i+1:

    - - -

    This is another recursive function, with k values decreasing from the curve order to 1, and the value α (alpha) defined by:

    - - -

    That looks complicated, but it's not. Computing alpha is just a fraction involving known, plain numbers. And, once we have our alpha value, we also have (1-alpha) because it's a trivial subtraction. Computing the d() function is thus mostly a matter of computing pretty simple arithmetical statements, with some caching of results so we can refer to them as we recurve. While the recursion might see computationally expensive, the total algorithm is cheap, as each step only involves very simple maths.

    -

    Of course, the recursion does need a stop condition:

    - - -

    So, we actually see two stopping conditions: either i becomes 0, in which case d() is zero, or k becomes zero, in which case we get the same "either 1 or 0" that we saw in the N() function above.

    -

    Thanks to Cox and de Boor, we can compute points on a B-Spline pretty easily using the same kind of linear interpolation we saw in de Casteljau's algorithm. For instance, if we write out d() for i=3 and k=3, we get the following recursion diagram:

    - - -

    That is, we compute d(3,3) as a mixture of d(2,3) and d(2,2), where those two are themselves a mixture of d(1,3) and d(1,2), and d(1,2) and d(1,1), respectively, which are themselves a mixture of etc. etc. We simply keep expanding our terms until we reach the stop conditions, and then sum everything back up. It's really quite elegant.

    -

    One thing we need to keep in mind is that we're working with a spline that is constrained by its control points, so even though the d(..., k) values are zero or one at the lowest level, they are really "zero or one, times their respective control point", so in the next section you'll see the algorithm for running through the computation in a way that starts with a copy of the control point vector and then works its way up to that single point, rather than first starting "on the left", working our way "to the right" and then summing back up "to the left". We can just start on the right and work our way left immediately.

    -

    Running the computation

    -

    Unlike the de Casteljau algorithm, where the t value stays the same at every iteration, for B-Splines that is not the case, and so we end having to (for each point we evaluate) run a fairly involving bit of recursive computation. The algorithm is discussed on this Michigan Tech page, but an easier to read version is implemented by b-spline.js, so we'll look at its code.

    -

    Given an input value t, we first map the input to a value from the domain [0,1] to the domain [knots[degree], knots[knots.length - 1 - degree]. Then, we find the section number s that this mapped t value lies on:

    + +

    + That is, we compute d(3,3) as a mixture of d(2,3) and d(2,2), where those two are themselves a + mixture of d(1,3) and d(1,2), and d(1,2) and d(1,1), respectively, which are + themselves a mixture of etc. etc. We simply keep expanding our terms until we reach the stop conditions, and then sum everything back up. + It's really quite elegant. +

    +

    + One thing we need to keep in mind is that we're working with a spline that is constrained by its control points, so even though the + d(..., k) values are zero or one at the lowest level, they are really "zero or one, times their respective control point", so + in the next section you'll see the algorithm for running through the computation in a way that starts with a copy of the control point + vector and then works its way up to that single point, rather than first starting "on the left", working our way "to the right" and then + summing back up "to the left". We can just start on the right and work our way left immediately. +

    +

    Running the computation

    +

    + Unlike the de Casteljau algorithm, where the t value stays the same at every iteration, for B-Splines that is not the case, + and so we end having to (for each point we evaluate) run a fairly involving bit of recursive computation. The algorithm is discussed on + this Michigan Tech page, but an easier to read version + is implemented by b-spline.js, so we'll look at its code. +

    +

    + Given an input value t, we first map the input to a value from the domain [0,1] to the domain + [knots[degree], knots[knots.length - 1 - degree]. Then, we find the section number s that this mapped + t value lies on: +

    - - -
    1 - -
    2
    3
    - -

    after running this code, s is the index for the section the point will lie on. We then run the algorithm mentioned on the MU page (updated to use this description's variable names):

    +} + + + + 2 + + + 3 + + - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    - -

    (A nice bit of behaviour in this code is that we work the interpolation "backwards", starting at i=s at each level of the interpolation, and we stop when i = s - order + level, so we always end up with a value for i such that those v[i-1] don't try to use an array index that doesn't exist)

    -

    Open vs. closed paths

    -

    Much like poly-Béziers, B-Splines can be either open, running from the first point to the last point, or closed, where the first and last point are the same coordinate. However, because B-Splines are an interpolation of curves, not just points, we can't simply make the first and last point the same, we need to link as many points as are necessary to form "a curve" that the spline performs interpolation with. As such, for an order d B-Spline, we need to make the first and last d points the same. This is of course hardly more work than before (simply append points.splice(0,d) to points) but it's important to remember that you need more than just a single point.

    -

    Of course if we want to manipulate these kind of curves we need to make sure to mark them as "closed" so that we know the coordinate for points[0] and points[n-k] etc. don't just happen to have the same x/y values, but really are the same coordinate, so that manipulating one will equally manipulate the other, but programming generally makes this really easy by storing references to points, rather than copies (or other linked values such as coordinate weights, discussed in the NURBS section) rather than separate coordinate objects.

    -

    Manipulating the curve through the knot vector

    -

    The most important thing to understand when it comes to B-Splines is that they work because of the concept of a knot vector. As mentioned above, knots represent "where individual control points start/stop influencing the curve", but we never looked at the values that go in the knot vector. If you look back at the N() and a() functions, you see that interpolations are based on intervals in the knot vector, rather than the actual values in the knot vector, and we can exploit this to do some pretty interesting things with clever manipulation of the knot vector. Specifically there are four things we can do that are worth looking at:

    -
      -
    1. we can use a uniform knot vector, with equally spaced intervals,
    2. -
    3. we can use a non-uniform knot vector, without enforcing equally spaced intervals,
    4. -
    5. we can collapse sequential knots to the same value, locally lowering curve complexity using "null" intervals, and
    6. -
    7. we can form a special case non-uniform vector, by combining (1) and (3) to for a vector with collapsed start and end knots, with a uniform vector in between.
    8. -
    -

    Uniform B-Splines

    -

    The most straightforward type of B-Spline is the uniform spline. In a uniform spline, the knots are distributed uniformly over the entire curve interval. For instance, if we have a knot vector of length twelve, then a uniform knot vector would be [0,1,2,3,...,9,10,11]. Or [4,5,6,...,13,14,15], which defines the same intervals, or even [0,2,3,...,18,20,22], which also defines the same intervals, just scaled by a constant factor, which becomes normalised during interpolation and so does not contribute to the curvature.

    - - - Scripts are disabled. Showing fallback image. - - - - - -

    This is an important point: the intervals that the knot vector defines are relative intervals, so it doesn't matter if every interval is size 1, or size 100 - the relative differences between the intervals is what shapes any particular curve.

    -

    The problem with uniform knot vectors is that, as we need order control points before we have any curve with which we can perform interpolation, the curve does not "start" at the first point, nor "ends" at the last point. Instead there are "gaps". We can get rid of these, by being clever about how we apply the following uniformity-breaking approach instead...

    -

    Reducing local curve complexity by collapsing intervals

    -

    Collapsing knot intervals, by making two or more consecutive knots have the same value, allows us to reduce the curve complexity in the sections that are affected by the knots involved. This can have drastic effects: for every interval collapse, the curve order goes down, and curve continuity goes down, to the point where collapsing order knots creates a situation where all continuity is lost and the curve "kinks".

    - - - Scripts are disabled. Showing fallback image. - - - - - -

    Open-Uniform B-Splines

    -

    By combining knot interval collapsing at the start and end of the curve, with uniform knots in between, we can overcome the problem of the curve not starting and ending where we'd kind of like it to:

    -

    For any curve of degree D with control points N, we can define a knot vector of length N+D+1 in which the values 0 ... D+1 are the same, the values D+1 ... N+1 follow the "uniform" pattern, and the values N+1 ... N+D+1 are the same again. For example, a cubic B-Spline with 7 control points can have a knot vector [0,0,0,0,1,2,3,4,4,4,4], or it might have the "identical" knot vector [0,0,0,0,2,4,6,8,8,8,8], etc. Again, it is the relative differences that determine the curve shape.

    - - - Scripts are disabled. Showing fallback image. - - - - - -

    Non-uniform B-Splines

    -

    This is essentially the "free form" version of a B-Spline, and also the least interesting to look at, as without any specific reason to pick specific knot intervals, there is nothing particularly interesting going on. There is one constraint to the knot vector, other than that any value knots[k+1] should be greater than or equal to knots[k].

    -

    One last thing: Rational B-Splines

    -

    While it is true that this section on B-Splines is running quite long already, there is one more thing we need to talk about, and that's "Rational" splines, where the rationality applies to the "ratio", or relative weights, of the control points themselves. By introducing a ratio vector with weights to apply to each control point, we greatly increase our influence over the final curve shape: the more weight a control point carries, the closer to that point the spline curve will lie, a bit like turning up the gravity of a control point, just like for rational Bézier curves.

    - - - Scripts are disabled. Showing fallback image. - - - - - -

    Of course this brings us to the final topic that any text on B-Splines must touch on before calling it a day: the NURBS, or Non-Uniform Rational B-Spline (NURBS is not a plural, the capital S actually just stands for "spline", but a lot of people mistakenly treat it as if it is, so now you know better). NURBS is an important type of curve in computer-facilitated design, used a lot in 3D modelling (typically as NURBS surfaces) as well as in arbitrary-precision 2D design due to the level of control a NURBS curve offers designers.

    -

    While a true non-uniform rational B-Spline would be hard to work with, when we talk about NURBS we typically mean the Open-Uniform Rational B-Spline, or OURBS, but that doesn't roll off the tongue nearly as nicely, and so remember that when people talk about NURBS, they typically mean open-uniform, which has the useful property of starting the curve at the first control point, and ending it at the last.

    -

    Extending our implementation to cover rational splines

    -

    The algorithm for working with Rational B-Splines is virtually identical to the regular algorithm, and the extension to work in the control point weights is fairly simple: we extend each control point from a point in its original number of dimensions (2D, 3D, etc.) to one dimension higher, scaling the original dimensions by the control point's weight, and then assigning that weight as its value for the extended dimension.

    -

    For example, a 2D point (x,y) with weight w becomes a 3D point (w * x, w * y, w).

    -

    We then run the same algorithm as before, which will automatically perform weight interpolation in addition to regular coordinate interpolation, because all we've done is pretended we have coordinates in a higher dimension. The algorithm doesn't really care about how many dimensions it needs to interpolate.

    -

    In order to recover our "real" curve point, we take the final result of the point generation algorithm, and "unweigh" it: we take the final point's derived weight w' and divide all the regular coordinate dimensions by it, then throw away the weight information.

    -

    Based on our previous example, we take the final 3D point (x', y', w'), which we then turn back into a 2D point by computing (x'/w', y'/w'). And that's it, we're done!

    +} + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + -
    -
    - +

    + (A nice bit of behaviour in this code is that we work the interpolation "backwards", starting at i=s at each level of the + interpolation, and we stop when i = s - order + level, so we always end up with a value for i such that those + v[i-1] don't try to use an array index that doesn't exist) +

    +

    Open vs. closed paths

    +

    + Much like poly-Béziers, B-Splines can be either open, running from the first point to the last point, or closed, where the first and last + point are the same coordinate. However, because B-Splines are an interpolation of curves, not just points, we can't simply make the first + and last point the same, we need to link as many points as are necessary to form "a curve" that the spline performs interpolation with. As + such, for an order d B-Spline, we need to make the first and last d points the same. This is of course hardly + more work than before (simply append points.splice(0,d) to points) but it's important to remember that you need + more than just a single point. +

    +

    + Of course if we want to manipulate these kind of curves we need to make sure to mark them as "closed" so that we know the coordinate for + points[0] and points[n-k] etc. don't just happen to have the same x/y values, but really are the same + coordinate, so that manipulating one will equally manipulate the other, but programming generally makes this really easy by storing + references to points, rather than copies (or other linked values such as coordinate weights, discussed in the NURBS section) rather than + separate coordinate objects. +

    +

    Manipulating the curve through the knot vector

    +

    + The most important thing to understand when it comes to B-Splines is that they work because of the concept of a knot vector. As + mentioned above, knots represent "where individual control points start/stop influencing the curve", but we never looked at the + values that go in the knot vector. If you look back at the N() and a() functions, you see that interpolations are based on + intervals in the knot vector, rather than the actual values in the knot vector, and we can exploit this to do some pretty interesting + things with clever manipulation of the knot vector. Specifically there are four things we can do that are worth looking at: +

    +
      +
    1. we can use a uniform knot vector, with equally spaced intervals,
    2. +
    3. we can use a non-uniform knot vector, without enforcing equally spaced intervals,
    4. +
    5. we can collapse sequential knots to the same value, locally lowering curve complexity using "null" intervals, and
    6. +
    7. + we can form a special case non-uniform vector, by combining (1) and (3) to for a vector with collapsed start and end knots, with a + uniform vector in between. +
    8. +
    +

    Uniform B-Splines

    +

    + The most straightforward type of B-Spline is the uniform spline. In a uniform spline, the knots are distributed uniformly over the entire + curve interval. For instance, if we have a knot vector of length twelve, then a uniform knot vector would be [0,1,2,3,...,9,10,11]. Or + [4,5,6,...,13,14,15], which defines the same intervals, or even [0,2,3,...,18,20,22], which also defines + the same intervals, just scaled by a constant factor, which becomes normalised during interpolation and so does not contribute to + the curvature. +

    + + + Scripts are disabled. Showing fallback image. + + + + + +

    + This is an important point: the intervals that the knot vector defines are relative intervals, so it doesn't matter if every + interval is size 1, or size 100 - the relative differences between the intervals is what shapes any particular curve. +

    +

    + The problem with uniform knot vectors is that, as we need order control points before we have any curve with which we can + perform interpolation, the curve does not "start" at the first point, nor "ends" at the last point. Instead there are "gaps". We can get + rid of these, by being clever about how we apply the following uniformity-breaking approach instead... +

    +

    Reducing local curve complexity by collapsing intervals

    +

    + Collapsing knot intervals, by making two or more consecutive knots have the same value, allows us to reduce the curve complexity in the + sections that are affected by the knots involved. This can have drastic effects: for every interval collapse, the curve order goes down, + and curve continuity goes down, to the point where collapsing order knots creates a situation where all continuity is lost + and the curve "kinks". +

    + + + Scripts are disabled. Showing fallback image. + + + + + +

    Open-Uniform B-Splines

    +

    + By combining knot interval collapsing at the start and end of the curve, with uniform knots in between, we can overcome the problem of the + curve not starting and ending where we'd kind of like it to: +

    +

    + For any curve of degree D with control points N, we can define a knot vector of length N+D+1 in + which the values 0 ... D+1 are the same, the values D+1 ... N+1 follow the "uniform" pattern, and the values + N+1 ... N+D+1 are the same again. For example, a cubic B-Spline with 7 control points can have a knot vector + [0,0,0,0,1,2,3,4,4,4,4], or it might have the "identical" knot vector [0,0,0,0,2,4,6,8,8,8,8], etc. Again, it is the relative differences + that determine the curve shape. +

    + + + Scripts are disabled. Showing fallback image. + + + + + +

    Non-uniform B-Splines

    +

    + This is essentially the "free form" version of a B-Spline, and also the least interesting to look at, as without any specific reason to + pick specific knot intervals, there is nothing particularly interesting going on. There is one constraint to the knot vector, other than + that any value knots[k+1] should be greater than or equal to knots[k]. +

    +

    One last thing: Rational B-Splines

    +

    + While it is true that this section on B-Splines is running quite long already, there is one more thing we need to talk about, and that's + "Rational" splines, where the rationality applies to the "ratio", or relative weights, of the control points themselves. By introducing a + ratio vector with weights to apply to each control point, we greatly increase our influence over the final curve shape: the more weight a + control point carries, the closer to that point the spline curve will lie, a bit like turning up the gravity of a control point, just like + for rational Bézier curves. +

    + + + Scripts are disabled. Showing fallback image. + + + + + +

    + Of course this brings us to the final topic that any text on B-Splines must touch on before calling it a day: the + NURBS, or Non-Uniform Rational B-Spline (NURBS is not a plural, + the capital S actually just stands for "spline", but a lot of people mistakenly treat it as if it is, so now you know better). NURBS is an + important type of curve in computer-facilitated design, used a lot in 3D modelling (typically as NURBS surfaces) as well as in + arbitrary-precision 2D design due to the level of control a NURBS curve offers designers. +

    +

    + While a true non-uniform rational B-Spline would be hard to work with, when we talk about NURBS we typically mean the Open-Uniform + Rational B-Spline, or OURBS, but that doesn't roll off the tongue nearly as nicely, and so remember that when people talk about NURBS, + they typically mean open-uniform, which has the useful property of starting the curve at the first control point, and ending it at the + last. +

    +

    Extending our implementation to cover rational splines

    +

    + The algorithm for working with Rational B-Splines is virtually identical to the regular algorithm, and the extension to work in the + control point weights is fairly simple: we extend each control point from a point in its original number of dimensions (2D, 3D, etc.) to + one dimension higher, scaling the original dimensions by the control point's weight, and then assigning that weight as its value for the + extended dimension. +

    +

    For example, a 2D point (x,y) with weight w becomes a 3D point (w * x, w * y, w).

    +

    + We then run the same algorithm as before, which will automatically perform weight interpolation in addition to regular coordinate + interpolation, because all we've done is pretended we have coordinates in a higher dimension. The algorithm doesn't really care about how + many dimensions it needs to interpolate. +

    +

    + In order to recover our "real" curve point, we take the final result of the point generation algorithm, and "unweigh" it: we take the + final point's derived weight w' and divide all the regular coordinate dimensions by it, then throw away the weight + information. +

    +

    + Based on our previous example, we take the final 3D point (x', y', w'), which we then turn back into a 2D point by computing + (x'/w', y'/w'). And that's it, we're done! +

    +
    +
    + -

    Comments and questions

    -

    First off, if you enjoyed this book, or you simply found it useful for something you were trying to get done, and you were wondering how to let me know you appreciated this book, you have two options: you can either head on over to the Patreon page for this book, or if you prefer to make a one-time donation, head on over to the buy Pomax a coffee page. This work has grown from a small primer to a 70-plus print-page-equivalent reader on the subject of Bézier curves over the years, and a lot of coffee went into the making of it. I don't regret a minute I spent on writing it, but I can always do with some more coffee to keep on writing.

    -

    With that said, on to the comments!

    -
    +

    + + Comments and questions +

    +

    + First off, if you enjoyed this book, or you simply found it useful for something you were trying to get done, and you were wondering how + to let me know you appreciated this book, you have two options: you can either head on over to the + Patreon page for this book, or if you prefer to make a one-time donation, head on over to + the buy Pomax a coffee page. This + work has grown from a small primer to a 70-plus print-page-equivalent reader on the subject of Bézier curves over the years, and a lot of + coffee went into the making of it. I don't regret a minute I spent on writing it, but I can always do with some more coffee to keep on + writing. +

    +

    With that said, on to the comments!

    +
    +
    +
    +
    +
    -
    - -
    + -
    - - - - - - - - - \ No newline at end of file + + + diff --git a/docs/ja-JP/index.html b/docs/ja-JP/index.html index 4477e090..4d97811b 100644 --- a/docs/ja-JP/index.html +++ b/docs/ja-JP/index.html @@ -1,161 +1,177 @@ + + + + ベジェ曲線入門 - - - - ベジェ曲線入門 + - + - - - - + - - - + - :root[lang="ja-JP"] { - font-family: "ヒラギノ角ゴ Pro W3", "Hiragino Kaku Gothic Pro", "Osaka", "メイリオ", "Meiryo", "MS Pゴシック", "MS PGothic", "Helvetica Neue", Helvetica, Arial, sans-serif; - font-size: 15.6px; - } + + + + + + + + + + + + + + + - + + + + + + + - + + - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - + + - - - - + + + - - + + -
    - This page on GitHub - - This page on GitHub - -
    +
    + This page on GitHub + + This page on GitHub + +
    -
    - - - submit to reddit - submit to hacker news - tweet your read - -
    +
    + + + submit to reddit + submit to hacker news + tweet your read + +
    - + +
    +

    + ベジェ曲線入門 +

    +

    A free, online book for when you really need to know how to do Bézier things.

    +
    + Read this in your own language: + +

    + (Don't see your language listed, or want to see it reach 100%? + Help translate this content!) +

    +
    -
    - -

    ベジェ曲線入門

    -

    A free, online book for when you really need to know how to do Bézier things.

    -
    - Read this in your own language: - -

    (Don't see your language listed, or want to see it reach 100%? Help translate this content!)

    -
    - -

    - Welcome to the Primer on Bezier Curves. This is a free website/ebook dealing with both - the maths and programming aspects of Bezier Curves, covering a wide range of topics - relating to drawing and working with that curve that seems to pop up everywhere, from - Photoshop paths to CSS easing functions to Font outline descriptions. -

    -

    - If this is your first time here: welcome! Let me know if you were looking for anything - in particular that the primer doesn't cover over on the issue tracker! -

    - -

    Donations and sponsorship

    +

    + Welcome to the Primer on Bezier Curves. This is a free website/ebook dealing with both the maths and programming aspects of Bezier Curves, + covering a wide range of topics relating to drawing and working with that curve that seems to pop up everywhere, from Photoshop paths to CSS + easing functions to Font outline descriptions. +

    +

    + If this is your first time here: welcome! Let me know if you were looking for anything in particular that the primer doesn't cover over on the + issue tracker! +

    -

    - If this is a resource that you're using for research, as work reference, or even writing your own software, - please consider donating - (any amount helps) or signing up as a patron on Patreon. - I don't get paid to work on this, so if you find this site valuable, and you'd like it - to stick around for a long time to come, a lot of coffee went into writing this over the - years, and a lot more coffee will need to go into it yet: if you can spare a coffee, you'd - be helping keep a resource alive and well. -

    -

    - Also, if you are a company and your staff uses this book as a resource, or you use it as an - onboarding resource, then please: consider sponsoring the site! I am more than happy to work - with your finance department on sponsorship invoicing and recognition. -

    - +

    Donations and sponsorship

    - -
    +
    -

    - — Pomax -

    - + +
    - -
    +
    +
    +

    まえがき

    +

    + 2次元上になにかを描くとき、普通は線を使いますが、これは直線と曲線の2つに分類することができます。直線を描くのはとても簡単で、これをコンピュータに描かせるのも容易です。直線の始点と終点をコンピュータに与えてやれば、ポン!直線が描けました。疑問の余地もありません。 +

    +

    + しかしながら、曲線の方はもっと大きな問題です。私たちはフリーハンドでいとも簡単に曲線を描くことができますが、コンピュータの方は少し不利です。曲線の描き方を表した数学的な関数が与えられないと、コンピュータは曲線を描くことができないのです。実際には、直線でさえも関数が必要になります。直線の関数はとても簡単なので、わたしたちはよく無視してしまいますが、コンピュータにとっては直線であれ曲線であれ、線はすべて「関数」なのです。しかしこれは、コンピュータで速く計算できて、きれいな曲線が得られるような関数を見つける必要がある、ということになります。そのような関数はたくさんありますが、多くの関心を集め続け、そしてどんな場面でも使われている、ある特定の関数に対してこの記事では焦点を絞ります。この関数は「ベジエ」曲線を描きます。 +

    +

    + ベジエ曲線はPierre Bézierから名付けられました。この曲線がデザイン作業に適していることを世界に知らしめたのが、彼なのです(ルノーに勤務し、1962年にその研究を発表しました)。ただし、この曲線を「発明」したのは彼が最初で唯一というわけではありません。数学者Paul de Casteljauはシトロエンで働いていた1959年、この曲線の性質について研究し、ベジエ曲線の非常にエレガントな描き方を発見しました。これが最初だと言う人もいます。しかしながら、de + Casteljauは自分の成果を発表しなかったため、「誰が最初か?」という問いに答えるのがとても難しくなっています。またベジエ曲線は、核心的にはSergei Natanovich Bernsteinが研究した「ベルンシュタイン多項式」という数学関数の一種ですが、こちらの公刊に関しては少なくとも1912年まで遡ることができます。いずれにせよ、これらはほとんど瑣末なことです。より注目すべきなのは、ベジエ曲線は取り扱いに便利だいうことです。たとえば複数のベジエ曲線を繋いで、1つの曲線に見えるようにすることができます。もしあなたがPhotoshopで「パス」を描いたり、FlashやIllustrator、Inkscapeのようなベクタードローイングソフトを使ったことがあるのであれば、そこで描いてきた曲線はベジエ曲線です。 +

    +

    + では、これを自分でプログラムしなければならないとなったらどうでしょう?ハマりどころは何でしょうか?どうやってベジエ曲線を描くのでしょう?バウンディングボックスとは何で、どうやって交点を求め、どうやったら曲線を押し出せるのでしょうか?つまるところ、ベジエ曲線に対して行いたいあらゆる操作は、どのようにすればいいのでしょう?このページはそれに答えるためにあります。数学にとりかかりましょう! +

    +
    +

    追伸:コーヒーをおごってくれませんか?

    +

    + この本のことを印刷するほど気に入ったのであれば、あなたは「何かお礼をする方法はないか」と考えているかもしれません。この質問に対する答えはこうです:いつでもわたしにコーヒーをおごってください。あなたが住んでいるところの、コーヒー1杯の値段でかまいません。この本にかなりの金額を支払いたいのであれば、非常に高価なコーヒーをおごってくれてもかまいません + =) +

    +

    + この本は小さな入門からはじまり、印刷85ページ以上に相当するようなベジエ曲線の電子書籍へと、年々成長してきています。そして、多くのコーヒーがその執筆に費やされてきました。わたしは執筆に使った時間が惜しいとは思いませんが、もう少しコーヒーがあれば、ずっと書き続けることができるのです!https://pomax.github.io/bezierinfo/にアクセスしてオンライン版のまえがきにあるリンクをクリックし、コーヒー代を寄付してください。 +

    +
    -
    - -

    まえがき

    -

    2次元上になにかを描くとき、普通は線を使いますが、これは直線と曲線の2つに分類することができます。直線を描くのはとても簡単で、これをコンピュータに描かせるのも容易です。直線の始点と終点をコンピュータに与えてやれば、ポン!直線が描けました。疑問の余地もありません。

    -

    しかしながら、曲線の方はもっと大きな問題です。私たちはフリーハンドでいとも簡単に曲線を描くことができますが、コンピュータの方は少し不利です。曲線の描き方を表した数学的な関数が与えられないと、コンピュータは曲線を描くことができないのです。実際には、直線でさえも関数が必要になります。直線の関数はとても簡単なので、わたしたちはよく無視してしまいますが、コンピュータにとっては直線であれ曲線であれ、線はすべて「関数」なのです。しかしこれは、コンピュータで速く計算できて、きれいな曲線が得られるような関数を見つける必要がある、ということになります。そのような関数はたくさんありますが、多くの関心を集め続け、そしてどんな場面でも使われている、ある特定の関数に対してこの記事では焦点を絞ります。この関数は「ベジエ」曲線を描きます。

    -

    ベジエ曲線はPierre Bézierから名付けられました。この曲線がデザイン作業に適していることを世界に知らしめたのが、彼なのです(ルノーに勤務し、1962年にその研究を発表しました)。ただし、この曲線を「発明」したのは彼が最初で唯一というわけではありません。数学者Paul de Casteljauはシトロエンで働いていた1959年、この曲線の性質について研究し、ベジエ曲線の非常にエレガントな描き方を発見しました。これが最初だと言う人もいます。しかしながら、de Casteljauは自分の成果を発表しなかったため、「誰が最初か?」という問いに答えるのがとても難しくなっています。またベジエ曲線は、核心的にはSergei Natanovich Bernsteinが研究した「ベルンシュタイン多項式」という数学関数の一種ですが、こちらの公刊に関しては少なくとも1912年まで遡ることができます。いずれにせよ、これらはほとんど瑣末なことです。より注目すべきなのは、ベジエ曲線は取り扱いに便利だいうことです。たとえば複数のベジエ曲線を繋いで、1つの曲線に見えるようにすることができます。もしあなたがPhotoshopで「パス」を描いたり、FlashやIllustrator、Inkscapeのようなベクタードローイングソフトを使ったことがあるのであれば、そこで描いてきた曲線はベジエ曲線です。

    -

    では、これを自分でプログラムしなければならないとなったらどうでしょう?ハマりどころは何でしょうか?どうやってベジエ曲線を描くのでしょう?バウンディングボックスとは何で、どうやって交点を求め、どうやったら曲線を押し出せるのでしょうか?つまるところ、ベジエ曲線に対して行いたいあらゆる操作は、どのようにすればいいのでしょう?このページはそれに答えるためにあります。数学にとりかかりましょう!

    -
    - -

    追伸:コーヒーをおごってくれませんか?

    -

    この本のことを印刷するほど気に入ったのであれば、あなたは「何かお礼をする方法はないか」と考えているかもしれません。この質問に対する答えはこうです:いつでもわたしにコーヒーをおごってください。あなたが住んでいるところの、コーヒー1杯の値段でかまいません。この本にかなりの金額を支払いたいのであれば、非常に高価なコーヒーをおごってくれてもかまいません =)

    -

    この本は小さな入門からはじまり、印刷85ページ以上に相当するようなベジエ曲線の電子書籍へと、年々成長してきています。そして、多くのコーヒーがその執筆に費やされてきました。わたしは執筆に使った時間が惜しいとは思いませんが、もう少しコーヒーがあれば、ずっと書き続けることができるのです!https://pomax.github.io/bezierinfo/にアクセスしてオンライン版のまえがきにあるリンクをクリックし、コーヒー代を寄付してください。

    -
    - -
    - -

    注:ベジエの図はすべてインタラクティブになっています。

    -

    このページではBezier.jsを活用し、インタラクティブな例示を行っています。

    - -

    本書はオープンソースです。

    -

    この本はオープンソースソフトウェアのプロジェクトで、2つのGitHubリポジトリ上に存在しています。1つ目のhttps://github.com/pomax/bezierinfoは表示のためだけに用意されたバージョンで、あなたが今読んでいるものです。もう一方のhttps://github.com/pomax/BezierInfo-2は開発版で、すべてのHTML・JavaScript・CSSが含まれています。どちらのリポジトリもフォークすることができますし、あなたの好きなように使ってかまいません。ただし、これを自分の成果だと騙って売ることはもちろん除きます。=)

    -

    数学はどこまで難しくなりますか?

    -

    この入門に出てくる数学は、大半が高校初年度程度です。基本的な計算を理解していて、英語の読み方が分かっていれば、こなすことができるはずです。時にはずっと難しい数学も出てきますが、もし理解できないように感じたら、そこは読み飛ばしても大丈夫です。節の中の「詳細欄」を飛ばしてもいいですし、厄介そうな数学があれば節の最後まで読み飛ばしてもかまいません。各節の最後にはたいてい結論を並べてありますので、これを直に利用することもできます。

    -

    質問・コメント:

    -

    新しい節の提案があれば、GitHubのissueトラッカーをクリックしてください(右上にあるリポジトリのリンクからでもたどり着けます)。改訂中のため、現在はこのページにはコメント欄がありませんが、内容に関する質問がある場合にもissueトラッカーを使ってかまいません。改訂が完了したら、総合的なコメント欄を復活させる予定です。あるいは、「質問対象の節を選択して『質問』ボタンをクリック」するような項目別のシステムになるかもしれません。いずれわかります。

    -

    コーヒーをおごってくれませんか?

    -

    この本が気に入った場合や、取り組んでいたことに役に立つと思った場合、あるいは、この本への感謝をわたしに伝えるにはどうすればいいのかわからない場合。そのような場合には、わたしにコーヒーをおごってください。あなたが住んでいるところのコーヒー1杯の値段でかまいません。この本は小さな入門からはじまり、印刷で70ページほどに相当するようなベジエ曲線の読み物へと、年々成長してきています。そして、多くのコーヒーが執筆に費やされてきました。わたしは執筆に使った時間が惜しいとは思いませんが、もう少しコーヒーがあれば、ずっと書き続けることができるのです!

    -
    -
    -
    -

    What's new?

    -

    This primer is a living document, and so depending on when you last look at it, there may be new content. Click the following link to expand this section to have a look at what got added, when, or click through to the News posts for more detailed updates. (RSS feed available)

    - - - -

    November 2020

    • Added a section on finding curve/circle intersections

      -
    -

    October 2020

    • Added the Ukranian locale! Help out in getting its localization to 100%!

      -
    -

    August-September 2020

    • Completely overhauled the site: the Primer is now a normal web page that works fine with JS disabled, but obviously better with JS turned on.

      -
    -

    June 2020

    • Added automatic CI/CD using Github Actions

      -
    -

    January 2020

    • Added reset buttons to all graphics

      -
    • -
    • Updated to preface to correctly describe the on-page maths

      -
    • -
    • Fixed the Catmull-Rom section because it had glaring maths errors

      -
    -

    August 2019

    • Added a section on (plain) rational Bezier curves

      -
    • -
    • Improved the Graphic component to allow for sliders

      -
    -

    December 2018

    • Added a section on curvature and calculating kappa.

      -
    • -
    • Added a Patreon page! Head on over to patreon.com/bezierinfo to help support this site!

      -
    -

    August 2018

    • Added a section on finding a curve's y, if all you have is the x coordinate.

      -
    -

    July 2018

    • Rewrote the 3D normals section, implementing and explaining Rotation Minimising Frames.

      -
    • -
    • Updated the section on curve order raising/lowering, showing how to get a least-squares optimized lower order curve.

      -
    • -
    • (Finally) updated 'npm test' so that it automatically rebuilds when files are changed while the dev server is running.

      -
    -

    June 2018

    • Added a section on direct curve fitting.

      -
    • -
    • Added source links for all graphics.

      -
    • -
    • Added this "What's new?" section.

      -
    -

    April 2017

    • Added a section on 3d normals.

      -
    • -
    • Added live-updating for the social link buttons, so they always link to the specific section you're reading.

      -
    -

    February 2017

    • Finished rewriting the entire codebase for localization.

      -
    -

    January 2016

    • Added a section to explain the Bezier interval.

      -
    • -
    • Rewrote the Primer as a React application.

      -
    -

    December 2015

    • Set up the split repository between BezierInfo-2 as development repository, and bezierinfo as live page.

      -
    • -
    • Removed the need for client-side LaTeX parsing entirely, so the site doesn't take a full minute or more to load all the graphics.

      -
    -

    May 2015

    • Switched over to pure JS rather than Processing-through-Processing.js

      -
    • -
    • Added Cardano's algorithm for finding the roots of a cubic polynomial.

      -
    -

    April 2015

    • Added a section on arc length approximations.

      -
    -

    February 2015

    • Added a section on the canonical cubic Bezier form.

      -
    -

    November 2014

    • Switched to HTTPS.

      -
    -

    July 2014

    • Added the section on arc approximation.

      -
    -

    April 2014

    • Added the section on Catmull-Rom fitting.

      -
    -

    November 2013

    • Added the section on Catmull-Rom / Bezier conversion.

      -
    • -
    • Added the section on Bezier cuves as matrices.

      -
    -

    April 2013

    • Added a section on poly-Beziers.

      -
    • -
    • Added a section on boolean shape operations.

      -
    -

    March 2013

    • First drastic rewrite.

      -
    • -
    • Added sections on circle approximations.

      -
    • -
    • Added a section on projecting a point onto a curve.

      -
    • -
    • Added a section on tangents and normals.

      -
    • -
    • Added Legendre-Gauss numerical data tables.

      -
    -

    October 2011

    • First commit for the bezierinfo site, based on the pre-Primer webpage that covered the basics of Bezier curves in HTML with Processing.js examples.

      -
    -
    -
    -

    バッとした導入

    -

    まずは良い例から始めましょう。ベジエ曲線というのは、下の図に表示されているもののことです。ベジエ曲線はある始点からある終点へと延びており、その曲率は1個以上の「中間」制御点に左右されています。さて、このページの図はどれもインタラクティブになっていますので、ここで曲線をちょっと操作してみましょう。点をドラッグしたとき、曲線の形がそれに応じてどう変化するのか、確かめてみてください。

    -
    - - - Scripts are disabled. Showing fallback image. - - - - - - Scripts are disabled. Showing fallback image. - - - -
    +

    本書はオープンソースです。

    +

    + この本はオープンソースソフトウェアのプロジェクトで、2つのGitHubリポジトリ上に存在しています。1つ目のhttps://github.com/pomax/bezierinfoは表示のためだけに用意されたバージョンで、あなたが今読んでいるものです。もう一方のhttps://github.com/pomax/BezierInfo-2は開発版で、すべてのHTML・JavaScript・CSSが含まれています。どちらのリポジトリもフォークすることができますし、あなたの好きなように使ってかまいません。ただし、これを自分の成果だと騙って売ることはもちろん除きます。=) +

    +

    数学はどこまで難しくなりますか?

    +

    + この入門に出てくる数学は、大半が高校初年度程度です。基本的な計算を理解していて、英語の読み方が分かっていれば、こなすことができるはずです。時にはずっと難しい数学も出てきますが、もし理解できないように感じたら、そこは読み飛ばしても大丈夫です。節の中の「詳細欄」を飛ばしてもいいですし、厄介そうな数学があれば節の最後まで読み飛ばしてもかまいません。各節の最後にはたいてい結論を並べてありますので、これを直に利用することもできます。 +

    +

    質問・コメント:

    +

    + 新しい節の提案があれば、GitHubのissueトラッカーをクリックしてください(右上にあるリポジトリのリンクからでもたどり着けます)。改訂中のため、現在はこのページにはコメント欄がありませんが、内容に関する質問がある場合にもissueトラッカーを使ってかまいません。改訂が完了したら、総合的なコメント欄を復活させる予定です。あるいは、「質問対象の節を選択して『質問』ボタンをクリック」するような項目別のシステムになるかもしれません。いずれわかります。 +

    +

    コーヒーをおごってくれませんか?

    +

    + この本が気に入った場合や、取り組んでいたことに役に立つと思った場合、あるいは、この本への感謝をわたしに伝えるにはどうすればいいのかわからない場合。そのような場合には、わたしにコーヒーをおごってください。あなたが住んでいるところのコーヒー1杯の値段でかまいません。この本は小さな入門からはじまり、印刷で70ページほどに相当するようなベジエ曲線の読み物へと、年々成長してきています。そして、多くのコーヒーが執筆に費やされてきました。わたしは執筆に使った時間が惜しいとは思いませんが、もう少しコーヒーがあれば、ずっと書き続けることができるのです! +

    + +
    +
    +

    What's new?

    +

    + This primer is a living document, and so depending on when you last look at it, there may be new content. Click the following link to expand + this section to have a look at what got added, when, or click through to the News posts for more detailed updates. (RSS feed + available) +

    + + + +
    +

    November 2020

    +
      +
    • Added a section on finding curve/circle intersections

    • +
    +

    October 2020

    +
      +
    • Added the Ukranian locale! Help out in getting its localization to 100%!

    • +
    +

    August-September 2020

    +
      +
    • +

      + Completely overhauled the site: the Primer is now a normal web page that works fine with JS disabled, but obviously better with JS + turned on. +

      +
    • +
    +

    June 2020

    +
      +
    • Added automatic CI/CD using Github Actions

    • +
    +

    January 2020

    +
      +
    • Added reset buttons to all graphics

    • +
    • Updated to preface to correctly describe the on-page maths

    • +
    • Fixed the Catmull-Rom section because it had glaring maths errors

    • +
    +

    August 2019

    +
      +
    • Added a section on (plain) rational Bezier curves

    • +
    • Improved the Graphic component to allow for sliders

    • +
    +

    December 2018

    +
      +
    • Added a section on curvature and calculating kappa.

    • +
    • +

      + Added a Patreon page! Head on over to patreon.com/bezierinfo to help support this + site! +

      +
    • +
    +

    August 2018

    +
      +
    • Added a section on finding a curve's y, if all you have is the x coordinate.

    • +
    +

    July 2018

    +
      +
    • Rewrote the 3D normals section, implementing and explaining Rotation Minimising Frames.

    • +
    • Updated the section on curve order raising/lowering, showing how to get a least-squares optimized lower order curve.

    • +
    • +

      (Finally) updated 'npm test' so that it automatically rebuilds when files are changed while the dev server is running.

      +
    • +
    +

    June 2018

    +
      +
    • Added a section on direct curve fitting.

    • +
    • Added source links for all graphics.

    • +
    • Added this "What's new?" section.

    • +
    +

    April 2017

    +
      +
    • Added a section on 3d normals.

    • +
    • Added live-updating for the social link buttons, so they always link to the specific section you're reading.

    • +
    +

    February 2017

    +
      +
    • Finished rewriting the entire codebase for localization.

    • +
    +

    January 2016

    +
      +
    • Added a section to explain the Bezier interval.

    • +
    • Rewrote the Primer as a React application.

    • +
    +

    December 2015

    +
      +
    • Set up the split repository between BezierInfo-2 as development repository, and bezierinfo as live page.

    • +
    • +

      + Removed the need for client-side LaTeX parsing entirely, so the site doesn't take a full minute or more to load all the graphics. +

      +
    • +
    +

    May 2015

    +
      +
    • Switched over to pure JS rather than Processing-through-Processing.js

    • +
    • Added Cardano's algorithm for finding the roots of a cubic polynomial.

    • +
    +

    April 2015

    +
      +
    • Added a section on arc length approximations.

    • +
    +

    February 2015

    +
      +
    • Added a section on the canonical cubic Bezier form.

    • +
    +

    November 2014

    +
      +
    • Switched to HTTPS.

    • +
    +

    July 2014

    +
      +
    • Added the section on arc approximation.

    • +
    +

    April 2014

    +
      +
    • Added the section on Catmull-Rom fitting.

    • +
    +

    November 2013

    +
      +
    • Added the section on Catmull-Rom / Bezier conversion.

    • +
    • Added the section on Bezier cuves as matrices.

    • +
    +

    April 2013

    +
      +
    • Added a section on poly-Beziers.

    • +
    • Added a section on boolean shape operations.

    • +
    +

    March 2013

    +
      +
    • First drastic rewrite.

    • +
    • Added sections on circle approximations.

    • +
    • Added a section on projecting a point onto a curve.

    • +
    • Added a section on tangents and normals.

    • +
    • Added Legendre-Gauss numerical data tables.

    • +
    +

    October 2011

    +
      +
    • +

      + First commit for the bezierinfo site, based on the pre-Primer webpage that covered + the basics of Bezier curves in HTML with Processing.js examples. +

      +
    • +
    +
    +
    +
    +
    +

    + + バッとした導入 +

    +

    + まずは良い例から始めましょう。ベジエ曲線というのは、下の図に表示されているもののことです。ベジエ曲線はある始点からある終点へと延びており、その曲率は1個以上の「中間」制御点に左右されています。さて、このページの図はどれもインタラクティブになっていますので、ここで曲線をちょっと操作してみましょう。点をドラッグしたとき、曲線の形がそれに応じてどう変化するのか、確かめてみてください。 +

    +
    + + + Scripts are disabled. Showing fallback image. + + + + + + Scripts are disabled. Showing fallback image. + + + +
    -

    ベジエ曲線は、CAD(computer aided designやCAM(computer aided manufacturing)のアプリケーションで多用されています。もちろん、Adobe Illustrator・Photoshop・Inkscape・Gimp などのグラフィックデザインアプリケーションや、SVG(scalable vector graphics)・OpenTypeフォント(otf/ttf)のようなグラフィック技術でも利用されています。ベジエ曲線はたくさんのものに使われていますので、これについてもっと詳しく学びたいのであれば……さあ、準備しましょう!

    - -
    -
    -

    ではベジエ曲線はどうやってできるのでしょう?

    -

    ベジエ曲線がどのように動くのか、点を触ってみて感覚が摑めたかもしれません。では、実際のところベジエ曲線とはいったい何でしょうか?これを説明する方法は2通りありますが、どちらの説明でも行き着く先はまったく同じです。一方は複雑な数学を使うのに対し、もう一方はとても簡単です。というわけで……簡単な説明の方から始めましょう。

    -

    ベジエ曲線は、線形補間の結果です。というと難しそうに聞こえますが、誰でも幼い頃から線形補間をやってきています。例えば、何か2つのものの間を指し示すときには、いつも線形補間を行っています。線形補間とは、単純に「2点の間から点を得る」ことなのです。

    -

    例えば、2点間の距離がわかっているとして、一方の点から距離の20%だけ離れた(すなわち、もう一方の点から80%離れた)新しい点を求めたい場合、次のようにとても簡単に計算できます。

    - - -

    では、実際に見てみましょう。下の図はインタラクティブになっています。上下キーで補間の比率が増減しますので、どうなるか確かめてみましょう。最初に3点があり、それを結んで2本の直線が引かれています。この直線の上でそれぞれ線形補間を行うと、2つの点が得られます。この2点の間でさらに線形補間を行うと、1つの点を得ることができます。そして、あらゆる比率に対して同様に点を求め、それをすべて集めると、このようにベジエ曲線ができるのです。

    - - - Scripts are disabled. Showing fallback image. - - - - - -

    また、これが複雑な方の数学につながっていきます。微積分です。

    -

    いま上で行ったものとは似つかないように思えますが、実はあれは2次曲線を描いていたのです。ただし一発で描きあげるのではなく、手順を追って描いていきました。ベジエ曲線は多項式関数で表現できますが、その一方で、とても単純な補間の補間の補間の……というふうにも説明できます。これがベジエ曲線のおもしろいところです。これはまた、ベジェ曲線は「本当の数学」で見る(関数を調べたり微分を調べたり、あらゆる方法で)ことも可能ですし、「機械的」な組み立て操作として見る(例えば、ベジエ曲線は組み立てに使う点の間からは決してはみ出ないということがわかります)ことも可能だということを意味しています。

    -

    それでは、もう少し詳しくベジエ曲線を見ていきましょう。数学的な表現やそこから導かれる性質、さらには、ベジエ曲線に対して/ベジエ曲線を使ってできるさまざまな内容についてです。

    - -
    -
    -

    ベジエ曲線の数学

    -

    ベジエ曲線は「パラメトリック」関数の一種です。数学的に言えば、パラメトリック関数というのはインチキです。というのも、「関数」はきっちり定義された用語であり、いくつかの入力を1つの出力に対応させる写像を表すものだからです。いくつかの数値を入れると、1つの数値が出てきます。入れる数値が変わっても、出てくる数値はやはり1つだけです。パラメトリック関数はインチキです。基本的には「じゃあわかった、値を複数個出したいから、関数を複数個使うことにするよ」ということです。例として、ある値xに何らかの操作を行い、別の値へと写す関数があるとします。

    - - -

    f(x)という記法は、これが関数(1つしかない場合は慣習的にfと呼びます)であり、その出力が1つの変数(この場合はxです)に応じて変化する、ということを示す標準的な方法です。xを変化させると、f(x)の出力が変化します。

    -

    ここまでは順調です。では、パラメトリック関数について、これがどうインチキなのかを見てみましょう。以下の2つの関数を考えます。

    - - -

    注目すべき箇所は特に何もありません。ただの正弦関数と余弦関数です。ただし、入力が別々の名前になっていることに気づくでしょう。仮にaの値を変えたとしても、f(b)の出力の値は変わらないはずです。なぜなら、こちらの関数にはaは使われていないからです。パラメトリック関数は、これを変えてしまうのでインチキなのです。パラメトリック関数においては、どの関数も変数を共有しています。例えば、

    - - -

    複数の関数がありますが、変数は1つだけです。tの値を変えた場合、fa(t)fb(t)の両方の出力が変わります。これがどのように役に立つのか、疑問に思うかもしれません。しかし、実際には答えは至ってシンプルです。fa(t)fb(t)のラベルを、パラメトリック曲線の表示によく使われているもので置き換えてやれば、ぐっとはっきりするかと思います。

    - - -

    きました。x/y座標です。謎の値tを通して繫がっています。

    -

    というわけで、普通の関数ではy座標をx座標によって定義しますが、パラメトリック曲線ではそうではなく、座標の値を「制御」変数と結びつけます。tの値を変化させるたびに2つの値が変化するので、これをグラフ上の座標 (x,y)として使うことができます。例えば、先ほどの関数の組は円周上の点を生成します。負の無限大から正の無限大へとtを動かすと、得られる座標(x,y)は常に中心(0,0)・半径1の円の上に乗ります。tを0から5まで変化させてプロットした場合は、このようになります。

    - - - Scripts are disabled. Showing fallback image. - - - - - -

    ベジエ曲線はパラメトリック関数の一種であり、どの次元に対しても同じ基底関数を使うという点で特徴づけられます。先ほどの例では、xの値とyの値とで異なる関数(正弦関数と余弦関数)を使っていましたが、ベジエ曲線ではxyの両方で「二項係数多項式」を使います。では、二項係数多項式とは何でしょう?

    -

    高校で習った、こんな形の多項式を思い出すかもしれません。

    - - -

    最高次の項がであれば3次多項式、であれば2次多項式と呼び、xだけの場合は1次多項式――ただの直線です。(そしてxの入った項が何もなければ、多項式ではありません!)

    -

    ベジエ曲線はxの多項式ではなく、tの多項式です。tの値は0から1までの間に制限され、その係数abなどは「二項係数」の形をとります。というと複雑そうに聞こえますが、実際には値を組み合わせて、とてもシンプルに記述できます。

    - - -

    「そこまでシンプルには見えないよ」と思っていることでしょう。しかし仮に、tを取り去って係数に1を掛けることにしてしまえば、急激に簡単になります。これが二項係数部分の項です。

    - - -

    2は1+1に等しく、3は2+1や1+2に等しく、6は3+3に等しく、……ということに注目してください。見てわかるように、先頭と末尾は単に1になっていますが、中間はどれも次数が増えるたびに「上の2つの数を足し合わせた」ものになっています。これなら覚えやいですね。

    -

    多項式部分の項がどうなっているのか、同じぐらい簡単な方法で考えることができます。仮に、(1-t)aに、tbに書き換え、さらに重みを一旦削除してしまえば、このようになります。

    - - -

    これは要するに、「abのすべての組み合わせ」の単なる和です。プラスが出てくるたびに、abへと1つずつ置き換えていけばよいのです。こちらも本当に単純です。さて、これで「二項係数多項式」がわかりました。完璧を期するため、この関数の一般の形を示しておきます。

    - - -

    そして、これがベジエ曲線の完全な表現です。この関数中のΣは、加算の繰り返し(Σの下にある変数を使って、...=<値>から始めてΣの下にある値まで)を表します。

    -
    + +

    + そして、これがベジエ曲線の完全な表現です。この関数中のΣは、加算の繰り返し(Σの下にある変数を使って、...=<値>から始めてΣの下にある値まで)を表します。 +

    +
    +

    基底関数の実装方法

    +

    上で説明した関数を使えば、数学的な組み立て方で、基底関数をナイーブに実装することもできます。

    -

    基底関数の実装方法

    -

    上で説明した関数を使えば、数学的な組み立て方で、基底関数をナイーブに実装することもできます。

    - - - - - -
    1 - -
    2
    3
    4
    5
    - -

    「こともできる」と書いたのは、この方法では実装しない方が良いからです。階乗はとてつもなく重い計算なのです。また、先ほどの説明からわかるように、実際は階乗を使わなくても、かなり簡単にパスカルの三角形を作ることができます。[1]から始めて[1,1]、[1,2,1]、[1,3,3,1]、……としていくだけです。下の段は上の段よりも1つ要素が増え、各段の先頭と末尾は1になります。中間の数はどれも、左右斜め上にある両要素の和になります。

    -

    このパスカルの三角形は、「リストのリスト」として瞬時に生成できます。そして、これをルックアップテーブルとして利用すれば、二項係数を計算する必要はまったくなくなります。

    + return sum + + + + 2 + + + 3 + + + 4 + + + 5 + + - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    - -

    これはどのように動くのでしょう?最初に、十分に大きなサイズのルックアップテーブルを宣言します。次に、求めたい値を得るための関数を定義します。この関数は、求めたい値のn/kのペアがテーブル中にまだ存在しない場合、先にテーブルを拡張するようになっています。さて、これで基底関数は次のようになりました。

    + return lut[n][k] + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + 18 + + - - - - -
    1 - -
    2
    3
    4
    5
    - -

    完璧です。もちろん、さらなる最適化を施すこともできます。コンピュータグラフィクス用途ではたいてい、任意の次数の曲線が必要になるわけではありません。2次と3次の曲線だけが必要であれば、以下のようにコードを劇的に単純化することができます(実際、この入門では任意の次数までは扱いませんので、これに似たようなコードが出てきます)。

    + return sum + + + + 2 + + + 3 + + + 4 + + + 5 + + - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    - -

    これで基底関数をプログラムする方法がわかりました。すばらしい。

    -
    + return mt3 + 3*mt2*t + 3*mt*t2 + t3 + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + -

    というわけで、基底関数がどのようなものか理解できました。今度はベジエ曲線を特別にする魔法――制御点を導入する時間です。

    +

    これで基底関数をプログラムする方法がわかりました。すばらしい。

    +
    -
    -
    -

    ベジエ曲線の曲率の制御

    -

    ベジエ曲線は(すべての「スプライン」と同様に)補間関数です。これは点の集合を受け取って、それらの点のどこか「内側」の値を生成するということです。(このことから、制御点同士を結んで輪郭をつくったとき、その外側に位置する点は決して生成されないことがわかります。なお、この輪郭を曲線の「包」と呼びます。お役立ち情報でした!)実際に、補間関数によって生成された値に対する、各点の寄与の大きさを可視化することができますが、これを見れば、ベジエ曲線のどの場所でどの点が重要になるのかがわかります。

    -

    下のグラフは、2次ベジエ曲線や3次ベジエ曲線の補間関数を表しています。ここでSは、ベジエ関数全体に対しての、その点の寄与の大きさを示します。あるtにおいて、ベジエ曲線を定義する各点の補間率がどのようになっているのか、クリックドラッグをして確かめてみてください。

    -
    - - - Scripts are disabled. Showing fallback image. - - - - - - - - Scripts are disabled. Showing fallback image. - - - - - - - - Scripts are disabled. Showing fallback image. - - - - - -
    +

    というわけで、基底関数がどのようなものか理解できました。今度はベジエ曲線を特別にする魔法――制御点を導入する時間です。

    +
    +
    +

    + + ベジエ曲線の曲率の制御 +

    +

    + ベジエ曲線は(すべての「スプライン」と同様に)補間関数です。これは点の集合を受け取って、それらの点のどこか「内側」の値を生成するということです。(このことから、制御点同士を結んで輪郭をつくったとき、その外側に位置する点は決して生成されないことがわかります。なお、この輪郭を曲線の「包」と呼びます。お役立ち情報でした!)実際に、補間関数によって生成された値に対する、各点の寄与の大きさを可視化することができますが、これを見れば、ベジエ曲線のどの場所でどの点が重要になるのかがわかります。 +

    +

    + 下のグラフは、2次ベジエ曲線や3次ベジエ曲線の補間関数を表しています。ここでSは、ベジエ関数全体に対しての、その点の寄与の大きさを示します。あるtにおいて、ベジエ曲線を定義する各点の補間率がどのようになっているのか、クリックドラッグをして確かめてみてください。 +

    +
    + + + Scripts are disabled. Showing fallback image. + + + + + + + + Scripts are disabled. Showing fallback image. + + + + + + + + Scripts are disabled. Showing fallback image. + + + + + +
    -

    あわせて、15次ベジエ関数における補間関数も示しています。始点と終点は他の制御点と比較して、曲線の形に対してかなり大きな影響を与えていることがわかります。

    -

    曲線を変更したい場合は、各点の重みを変える(実質的には補間率を変える)必要があります。これはとても単純で、寄与の大きさを変えるための値を、各点にただ掛ければいいのです。この値は「重み」と呼ばれていますが、これを元のベジエ関数に組み込めば、次のようになります。

    - - -

    複雑そうに見えますが、運がいいことに「重み」というのは実はただの座標値です。というのはn次の曲線の場合、w0が始点の座標、wnが終点の座標となり、その間はどれも制御点の座標になります。例えば、始点が(120,160)、制御点が(35,200)と(220,260)、終点が(220,40)となる3次ベジエ曲線は、次のようになります。

    - - -

    この式からは、記事の冒頭に出てきた曲線が得られます。

    - + +

    この式からは、記事の冒頭に出てきた曲線が得られます。

    + -

    ベジエ曲線で、他にはどんなことができるでしょうか?実は、非常にたくさんのことが可能です。この記事の残りの部分では、実現可能な各種操作や適用可能なアルゴリズム、そしてこれによって達成できるタスクについて扱います。

    -
    +

    + ベジエ曲線で、他にはどんなことができるでしょうか?実は、非常にたくさんのことが可能です。この記事の残りの部分では、実現可能な各種操作や適用可能なアルゴリズム、そしてこれによって達成できるタスクについて扱います。 +

    +
    +

    重みつき基底関数の実装方法

    +

    基底関数の実装方法はすでに知っていますし、これに制御点を組み込むのは非常に簡単です。

    -

    重みつき基底関数の実装方法

    -

    基底関数の実装方法はすでに知っていますし、これに制御点を組み込むのは非常に簡単です。

    - - - - - -
    1 - -
    2
    3
    4
    5
    - -

    そして、最適化を行ったバージョンは以下のようになります。

    + return sum + + + + 2 + + + 3 + + + 4 + + + 5 + + - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    - -

    これで、重みつき基底関数をプログラムする方法がわかりました。

    -
    + return w[0]*mt3 + 3*w[1]*mt2*t + 3*w[2]*mt*t2 + w[3]*t3 + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + -
    -
    -

    Controlling Bézier curvatures, part 2: Rational Béziers

    -

    We can further control Bézier curves by "rationalising" them: that is, adding a "ratio" value in addition to the weight value discussed in the previous section, thereby gaining control over "how strongly" each coordinate influences the curve.

    -

    Adding these ratio values to the regular Bézier curve function is fairly easy. Where the regular function is the following:

    - - -

    The function for rational Bézier curves has two more terms:

    - - -

    In this, the first new term represents an additional weight for each coordinate. For example, if our ratio values are [1, 0.5, 0.5, 1] then ratio0 = 1, ratio1 = 0.5, and so on, and is effectively identical as if we were just using different weight. So far, nothing too special.

    -

    However, the second new term is what makes the difference: every point on the curve isn't just a "double weighted" point, it is a fraction of the "doubly weighted" value we compute by introducing that ratio. When computing points on the curve, we compute the "normal" Bézier value and then divide that by the Bézier value for the curve that only uses ratios, not weights.

    -

    This does something unexpected: it turns our polynomial into something that isn't a polynomial anymore. It is now a kind of curve that is a super class of the polynomials, and can do some really cool things that Bézier curves can't do "on their own", such as perfectly describing circles (which we'll see in a later section is literally impossible using standard Bézier curves).

    -

    But the best way to show what this does is to do literally that: let's look at the effect of "rationalising" our Bézier curves using an interactive graphic for a rationalised curves. The following graphic shows the Bézier curve from the previous section, "enriched" with ratio factors for each coordinate. The closer to zero we set one or more terms, the less relative influence the associated coordinate exerts on the curve (and of course the higher we set them, the more influence they have). Try to change the values and see how it affects what gets drawn:

    - - - Scripts are disabled. Showing fallback image. - - - - - - - - -

    You can think of the ratio values as each coordinate's "gravity": the higher the gravity, the closer to that coordinate the curve will want to be. You'll also notice that if you simply increase or decrease all the ratios by the same amount, nothing changes... much like with gravity, if the relative strengths stay the same, nothing really changes. The values define each coordinate's influence relative to all other points.

    -
    + +

    + In this, the first new term represents an additional weight for each coordinate. For example, if our ratio values are [1, 0.5, 0.5, 1] + then ratio0 = 1, ratio1 = 0.5, and so on, and is effectively identical as if we were just + using different weight. So far, nothing too special. +

    +

    + However, the second new term is what makes the difference: every point on the curve isn't just a "double weighted" point, it is a + fraction of the "doubly weighted" value we compute by introducing that ratio. When computing points on the curve, we compute the + "normal" Bézier value and then divide that by the Bézier value for the curve that only uses ratios, not weights. +

    +

    + This does something unexpected: it turns our polynomial into something that isn't a polynomial anymore. It is now a kind of curve + that is a super class of the polynomials, and can do some really cool things that Bézier curves can't do "on their own", such as perfectly + describing circles (which we'll see in a later section is literally impossible using standard Bézier curves). +

    +

    + But the best way to show what this does is to do literally that: let's look at the effect of "rationalising" our Bézier curves using an + interactive graphic for a rationalised curves. The following graphic shows the Bézier curve from the previous section, "enriched" with + ratio factors for each coordinate. The closer to zero we set one or more terms, the less relative influence the associated coordinate + exerts on the curve (and of course the higher we set them, the more influence they have). Try to change the values and see how it affects + what gets drawn: +

    + + + Scripts are disabled. Showing fallback image. + + + + + + + + +

    + You can think of the ratio values as each coordinate's "gravity": the higher the gravity, the closer to that coordinate the curve will + want to be. You'll also notice that if you simply increase or decrease all the ratios by the same amount, nothing changes... much like + with gravity, if the relative strengths stay the same, nothing really changes. The values define each coordinate's influence + relative to all other points. +

    +
    +

    How to implement rational curves

    +

    Extending the code of the previous section to include ratios is almost trivial:

    -

    How to implement rational curves

    -

    Extending the code of the previous section to include ratios is almost trivial:

    - - - - - - - - - - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    20
    21
    22
    23
    24
    25
    26
    - -

    And that's all we have to do.

    -
    + return (f[0] * w[0] + f[1] * w[1] + f[2] * w[2] + f[3] * w[3])/basis + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + 18 + + + 19 + + + 20 + + + 21 + + + 22 + + + 23 + + + 24 + + + 25 + + + 26 + + -
    -
    -

    ベジエ曲線の区間 [0,1]

    -

    ここまでの説明で、ベジエ曲線の裏側にある数学がわかりました。しかし、気づいているかもしれませんが、ひとつ引っかかる点があります。ベジエ曲線はいつもt=0からt=1の間を動いていますが、どうしてこの特定の区間なのでしょう?

    -

    このことは、曲線の「始点」から曲線の「終点」までどうやって動かすか、ということにすべて関係しています。2つの値を混ぜ合わせて1つの値をつくる場合、一般の式は次のようになります。

    - - -

    明らかに、始点ではa=1, b=0とする必要があります。こうすれば、値1が100%、値2が0%で混ぜ合わさるからです。また、終点ではa=0, b=1とする必要があります。こうすれば、値1が0%、値2が100%で混ぜ合わさります。これに加えて、abを独立にしておきたくはありません。独立になっている場合、何でも好きな値にすることできますが、こうすると例えば「値1が100%かつ値2が100%」のようなことが可能になってしまいます。これはこれで原則としてはかまいませんが、ベジエ曲線の場合は混ぜ合わさった値が常に始点と終点のになってほしいのです。というわけで、混ぜ合わせの和が100%を決して超えないように、abの値を設定する必要があります。これは次のようにすれば簡単です。

    - - -

    こうすれば、和が100%を超えることはないと保証できます。aの値を区間[0,1]に制限してしまえば、混ぜ合わさった値は常に2つの値の間のどこか(両端を含む)になり、また和は常に100%になります。

    -

    しかし……この式を0と1の間の値だけで使うのではなく、もし仮にこの区間の外の値で使うとしたら、どうなるのでしょう?めちゃくちゃになってしまうのでしょうか?……実はそうではありません。「その先」が見えるのです。

    -

    ベジエ曲線の場合、区間を広げると曲線は単純に「そのまま延びて」いきます。ベジエ曲線はある多項式曲線の一部分にすぎませんので、単純に区間を広くとればとるほど、曲線のより多くの部分が現れるようになります。では、どのように見えるのでしょうか?

    -

    下の2つの図は「いつもの方法」で描いたベジエ曲線ですが、これと一緒に、tの値をずっと先まで広げた場合の「延びた」ベジエ曲線も表示しています。見てわかるように、曲線の残りの部分には多くの「かたち」が隠れています。そして曲線の点を動かせば、その部分の形状も変わります。

    -
    - - - Scripts are disabled. Showing fallback image. - - - - - - Scripts are disabled. Showing fallback image. - - - -
    + +

    + こうすれば、和が100%を超えることはないと保証できます。aの値を区間[0,1]に制限してしまえば、混ぜ合わさった値は常に2つの値の間のどこか(両端を含む)になり、また和は常に100%になります。 +

    +

    + しかし……この式を0と1の間の値だけで使うのではなく、もし仮にこの区間の外の値で使うとしたら、どうなるのでしょう?めちゃくちゃになってしまうのでしょうか?……実はそうではありません。「その先」が見えるのです。 +

    +

    + ベジエ曲線の場合、区間を広げると曲線は単純に「そのまま延びて」いきます。ベジエ曲線はある多項式曲線の一部分にすぎませんので、単純に区間を広くとればとるほど、曲線のより多くの部分が現れるようになります。では、どのように見えるのでしょうか? +

    +

    + 下の2つの図は「いつもの方法」で描いたベジエ曲線ですが、これと一緒に、tの値をずっと先まで広げた場合の「延びた」ベジエ曲線も表示しています。見てわかるように、曲線の残りの部分には多くの「かたち」が隠れています。そして曲線の点を動かせば、その部分の形状も変わります。 +

    +
    + + + Scripts are disabled. Showing fallback image. + + + + + + Scripts are disabled. Showing fallback image. + + + +
    -

    実際に、グラフィックデザインやコンピュータモデリングで使われている曲線の中には、座標が固定されていて、区間は自由に動かせるような曲線があります。これは、区間が固定されていて、座標を自由に動かすことのできるベジエ曲線とは反対になっています。すばらしい例が「Spiro」曲線で、これはオイラー螺旋とも呼ばれるクロソイド曲線の一部分に基づいた曲線です。非常に美しく心地よい曲線で、FontForgeInkscapeなど多くのグラフィックアプリに実装されており、フォントデザインにも利用されています(Inconsolataフォントなど)。

    - -
    -
    -

    行列演算としてのベジエ曲線の曲率

    -

    ベジエ曲線は、行列演算の形でも表現することができます。ベジエ曲線の式を多項式基底と係数行列で表し、実際の座標も行列として表現するのです。これがどういうことを意味しているのか、3次ベジエ曲線について見てみましょう。

    - - -

    実際の座標を一旦無視すると、次のようになります。

    - - -

    これは、4つの項の和になっています。

    - - -

    それぞれの項を展開します。

    - - -

    その上で、係数の0や1もすべて明示的に書けば、このようになります。

    - - -

    さらに、これは4つの行列演算の和として見ることができます。

    - - -

    これを1つの行列演算にまとめると、以下のようになります。

    - - -

    多項式基底をこのような形で表現する場合、通常はその基底を昇冪の順に並べます。したがって、tの行列を左右反転させ、大きな「混合」行列は上下に反転させる必要があります。

    - - -

    そして最後に、もともとあった座標を3番目の行列として付け加えます。

    - - -

    2次ベジエ曲線の場合も同様に変形することができ、最終的には以下のようになります。

    - - -

    tの値を代入して行列の乗算を行えば、もともとの多項式関数から計算したときの値や、線形補間によって順次求めたときの値と、まったく同じものが得られます。

    -

    では、なぜわざわざ行列を使うのでしょう? 行列表現を使うことによって、他の表現ではわからなかった関数の性質を発見できるようになります。ベジエ曲線は三角行列の形になり、その行列式は実際の座標の積に等しくなることがわかります。また、この行列には逆行列が存在しますが、これはさまざまな性質が満たされることを意味します。もっとも、疑問の中心は「それでなぜこれが役に立つのか?」という点でしょうが、「ただちには役に立たない」というのが回答です。しかしながら、この先に出てくる曲線のプロパティの中には、関数を操作して求めることも、また行列をうまいこと利用して求めることも、どちらでも可能な例があります。そしてときには、行列による手法の方が(劇的に)速くなる場合があるのです。

    -

    というわけで、現時点では「ベジエ曲線は行列で表現可能」ということだけを覚えて、次に進みましょう。

    + +

    + tの値を代入して行列の乗算を行えば、もともとの多項式関数から計算したときの値や、線形補間によって順次求めたときの値と、まったく同じものが得られます。 +

    +

    + では、なぜわざわざ行列を使うのでしょう? + 行列表現を使うことによって、他の表現ではわからなかった関数の性質を発見できるようになります。ベジエ曲線は三角行列の形になり、その行列式は実際の座標の積に等しくなることがわかります。また、この行列には逆行列が存在しますが、これはさまざまな性質が満たされることを意味します。もっとも、疑問の中心は「それでなぜこれが役に立つのか?」という点でしょうが、「ただちには役に立たない」というのが回答です。しかしながら、この先に出てくる曲線のプロパティの中には、関数を操作して求めることも、また行列をうまいこと利用して求めることも、どちらでも可能な例があります。そしてときには、行列による手法の方が(劇的に)速くなる場合があるのです。 +

    +

    というわけで、現時点では「ベジエ曲線は行列で表現可能」ということだけを覚えて、次に進みましょう。

    +
    +
    +

    + + ド・カステリョのアルゴリズム +

    +

    + ベジエ曲線を描く場合は、tの値を0から1まで動かしながら重みつき基底関数を計算し、プロットに必要なx/yの値を求めます。しかし、曲線が複雑になればなるほど、計算コストがかかるようになってしまいます。そこでその代わりに、「ド・カステリョのアルゴリズム」を使って曲線を描くこともできます。こちらは幾何学的に曲線を描く方法で、実装も非常に簡単です。実際、鉛筆と定規を使って手描きすることもできるほど、とても簡単な方法なのです。 +

    +

    x/yの値を求めるために関数を使うのではなく、次のようにします。

    +
      +
    • tを(そのまま)比率として考えます。t=0は直線上の0%の位置、t=1は100%の位置です。
    • +
    • 曲線を定める点同士を結ぶように、それぞれ直線を引きます。n次の曲線であれば、n本の直線を引きます。
    • +
    • + 各直線上において、距離がtとなる点にそれぞれ印をつけます。例えばtが0.2なら、始点から20%、終点から80%の位置になります。 +
    • +
    • 今度は、それらの点同士を直線で結びます。n-1本の直線が得られます。
    • +
    • 各直線上において、距離がtとなる点にそれぞれ印をつけます。
    • +
    • それらの点同士を直線で結びます。n-2本の直線が得られます。
    • +
    • 印をつけ、直線で結び、印をつけ、……
    • +
    • 1本の直線になるまで繰り返します。その直線上のtの点は、元の曲線上でtとなる点に一致しています。
    • +
    +

    + 下の図にマウスを乗せると、この様子を実際に見ることができます。ド・カステリョのアルゴリズムによって曲線上の点を明示的に計算していますが、マウスを動かすと求める点が変わります。マウスカーソルを左から右へ(もちろん、右から左へでも)動かせば、このアルゴリズムによって曲線が生成される様子がわかります。 +

    + + + Scripts are disabled. Showing fallback image. + + + + + +
    +

    ド・カステリョのアルゴリズムの実装方法

    +

    いま説明したアルゴリズムを実装すると、以下のようになります。

    -
    -
    -

    ド・カステリョのアルゴリズム

    -

    ベジエ曲線を描く場合は、tの値を0から1まで動かしながら重みつき基底関数を計算し、プロットに必要なx/yの値を求めます。しかし、曲線が複雑になればなるほど、計算コストがかかるようになってしまいます。そこでその代わりに、「ド・カステリョのアルゴリズム」を使って曲線を描くこともできます。こちらは幾何学的に曲線を描く方法で、実装も非常に簡単です。実際、鉛筆と定規を使って手描きすることもできるほど、とても簡単な方法なのです。

    -

    x/yの値を求めるために関数を使うのではなく、次のようにします。

    -
      -
    • tを(そのまま)比率として考えます。t=0は直線上の0%の位置、t=1は100%の位置です。
    • -
    • 曲線を定める点同士を結ぶように、それぞれ直線を引きます。n次の曲線であれば、n本の直線を引きます。
    • -
    • 各直線上において、距離がtとなる点にそれぞれ印をつけます。例えばtが0.2なら、始点から20%、終点から80%の位置になります。
    • -
    • 今度は、それらの点同士を直線で結びます。n-1本の直線が得られます。
    • -
    • 各直線上において、距離がtとなる点にそれぞれ印をつけます。
    • -
    • それらの点同士を直線で結びます。n-2本の直線が得られます。
    • -
    • 印をつけ、直線で結び、印をつけ、……
    • -
    • 1本の直線になるまで繰り返します。その直線上のtの点は、元の曲線上でtとなる点に一致しています。
    • -
    -

    下の図にマウスを乗せると、この様子を実際に見ることができます。ド・カステリョのアルゴリズムによって曲線上の点を明示的に計算していますが、マウスを動かすと求める点が変わります。マウスカーソルを左から右へ(もちろん、右から左へでも)動かせば、このアルゴリズムによって曲線が生成される様子がわかります。

    - - - Scripts are disabled. Showing fallback image. - - - - - -
    - -

    ド・カステリョのアルゴリズムの実装方法

    -

    いま説明したアルゴリズムを実装すると、以下のようになります。

    - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    - -

    これで実装完了です。ただし、+演算子のオーバーロードなどという贅沢品はたいてい無いでしょうから、xyの値を直接扱う場合のコードも示しておきます。

    + drawCurvePoint(newpoints, t) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    - -

    さて、これは何をしているのでしょう?関数に渡す点のリストが長さ1であれば、点を1つ描きます。それ以外であれば、比率tの位置の点(すなわち、さきほどの説明に出てきた「印」)のリストを作り、そしてこの新しいリストを引数にして関数を呼び出します。

    -
    + drawCurvePoint(newpoints, t) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + -
    -
    -

    簡略化した描画

    -

    曲線を複数点で「サンプリング」し、さらにそれを直線で繫げてしまえば、描画の手順を簡略化することができます。単なる一連の直線、つまり「平坦」な線へと曲線を単純化するので、この処理は「平坦化」という名前で知られています。

    -

    例えば「X個の線分がほしい」場合には、分割数がそうなるようにサンプリング間隔を選び、曲線をサンプリングします。この方法の利点は速さです。曲線の座標を100個だの1000個だの計算するのではなく、ずっと少ない回数のサンプリングでも、十分きれいに見えるような曲線を作ることができるのです。欠点はもちろん、「本物の曲線」に比べて精度が損なわれてしまうことです。したがって、交点の検出や曲線の位置揃えを正しく行いたい場合には、平坦化した曲線は普通利用できません。

    -
    - - - Scripts are disabled. Showing fallback image. - - - - - - - - Scripts are disabled. Showing fallback image. - - - - - -
    +

    + さて、これは何をしているのでしょう?関数に渡す点のリストが長さ1であれば、点を1つ描きます。それ以外であれば、比率tの位置の点(すなわち、さきほどの説明に出てきた「印」)のリストを作り、そしてこの新しいリストを引数にして関数を呼び出します。 +

    + +
    +
    +

    + + 簡略化した描画 +

    +

    + 曲線を複数点で「サンプリング」し、さらにそれを直線で繫げてしまえば、描画の手順を簡略化することができます。単なる一連の直線、つまり「平坦」な線へと曲線を単純化するので、この処理は「平坦化」という名前で知られています。 +

    +

    + 例えば「X個の線分がほしい」場合には、分割数がそうなるようにサンプリング間隔を選び、曲線をサンプリングします。この方法の利点は速さです。曲線の座標を100個だの1000個だの計算するのではなく、ずっと少ない回数のサンプリングでも、十分きれいに見えるような曲線を作ることができるのです。欠点はもちろん、「本物の曲線」に比べて精度が損なわれてしまうことです。したがって、交点の検出や曲線の位置揃えを正しく行いたい場合には、平坦化した曲線は普通利用できません。 +

    +
    + + + Scripts are disabled. Showing fallback image. + + + + + + + + Scripts are disabled. Showing fallback image. + + + + + +
    -

    2次ベジエ曲線も3次ベジエ曲線も、図をクリックして上下キーを押すと曲線の分割数が増減しますので、試してみてください。ある曲線では分割数が少なくてもうまくいきますが、曲線が複雑になればなるほど、曲率の変化を正確に捉えるためにはより多くの分割数が必要になることがわかります(3次ベジエ曲線で試してみてください)。

    -
    +

    + 2次ベジエ曲線も3次ベジエ曲線も、図をクリックして上下キーを押すと曲線の分割数が増減しますので、試してみてください。ある曲線では分割数が少なくてもうまくいきますが、曲線が複雑になればなるほど、曲率の変化を正確に捉えるためにはより多くの分割数が必要になることがわかります(3次ベジエ曲線で試してみてください)。 +

    +
    +

    曲線平坦化の実装方法

    +

    上でいま解説したアルゴリズムを使って実装するだけです。

    -

    曲線平坦化の実装方法

    -

    上でいま解説したアルゴリズムを使って実装するだけです。

    - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    - -

    これで完了です。実装できました。あとは、一連の直線で結果の「曲線」を描画するだけです。

    + return coordinates; + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    - -

    先頭の座標を参照点にしてスタートし、あとはそれぞれの点からその次の点へと、直線を引いていくだけです。

    -
    + coord = _coord + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + -
    -
    -

    曲線の分割

    -

    ベジエ曲線を分割して、繫ぎ合わせたときに元に戻るような小さい2曲線にしたい場合にも、ド・カステリョのアルゴリズムを使えば、これに必要な点をすべて求めることができます。ある値tに対してド・カステリョの骨格を組み立てると、そのtで曲線を分割する際に必要になる点がすべて得られます。骨格内部の点のうち、曲線上の点から見て手前側にある点によって一方の曲線が定義され、向こう側にある点によってもう一方の曲線が定義されます。

    - - - Scripts are disabled. Showing fallback image. - - - - - -
    +

    先頭の座標を参照点にしてスタートし、あとはそれぞれの点からその次の点へと、直線を引いていくだけです。

    +
    +
    +
    +

    + + 曲線の分割 +

    +

    + ベジエ曲線を分割して、繫ぎ合わせたときに元に戻るような小さい2曲線にしたい場合にも、ド・カステリョのアルゴリズムを使えば、これに必要な点をすべて求めることができます。ある値tに対してド・カステリョの骨格を組み立てると、そのtで曲線を分割する際に必要になる点がすべて得られます。骨格内部の点のうち、曲線上の点から見て手前側にある点によって一方の曲線が定義され、向こう側にある点によってもう一方の曲線が定義されます。 +

    + + + Scripts are disabled. Showing fallback image. + + + + + +
    +

    曲線分割の実装方法

    +

    ド・カステリョの関数の中に記録する処理を追加すれば、曲線の分割が実装できます。

    -

    曲線分割の実装方法

    -

    ド・カステリョの関数の中に記録する処理を追加すれば、曲線の分割が実装できます。

    - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    - -

    ある値tに対してこの関数を実行すると、leftrightに新しい2曲線の座標が入ります。一方はtの「左」側、もう一方は「右」側の曲線です。この2曲線は元の曲線と同じ次数になり、また元の曲線とぴったり重なります。

    -
    + drawCurvePoint(newpoints, t) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + -
    -
    -

    行列による曲線の分割

    -

    曲線分割には、ベジエ曲線の行列表現を利用する方法もあります。行列についての節では、行列の乗算で曲線が表現できることを確認しました。特に2次・3次のベジエ曲線に関しては、それぞれ以下のような形になりました(読みやすさのため、ベジエの係数ベクトルを反転させています)。

    - - -

    ならびに

    - - -

    曲線をある点t = zで分割し、新しく2つの(自明ですが、より短い)ベジエ曲線を作ることを考えましょう。曲線の行列表現と線形代数を利用すると、この2つのベジエ曲線の座標を求めることができます。まず、実際の「曲線上の点」の情報を分解し、新しい行列の積のかたちにします。

    - - -

    ならびに

    - - -

    これらの行列をまとめて、仮に**[tの値たち] · [ベジエ行列] · [列ベクトル]**の形式にできたとしましょう。ただし、先頭2つの行列は変わらずそのままだとします。このとき、右端の列ベクトルは、前半部分すなわちt = 0からt = zを表す、新しいベジエ曲線の座標となります。結論からいうと、線形代数の簡単な規則を使えば、この変形は非常に容易です(そして、導出過程を気にしないのであれば、囲みの末尾まで飛ばして結果に行ってもかまいません!)。

    -
    - -

    新しい凸包の座標の導出

    -

    曲線を分割して2つの部分を得るためには、いくつかの段階を経る必要があります。曲線の次数が高くなるほど手間がかかるようになりますので、まずは2次の曲線で見てみましょう。

    - - - - - - - - -

    [M · M-1]は単位行列なので、このような操作ができるのです(これは微積分でいえば、なにかにx/xを掛けるようなものです。関数自体はなにも変わりませんが、解きやすいかたちに変形したり、別のかたちに分解したりといったことが可能になります)。この行列を掛けると、式全体としてはなにも変わりませんが、[なにか · M]という行列の並びを[M · なにか]という並びに変えることができます。そして、これが大きな違いを生み出します。[M-1 · Z · M]の値が分かれば、それを座標に掛け合わせることによって、2次ベジエ曲線の正しい行列表現(すなわち[T · M · P])と、t = 0からt = zまでの曲線を表す座標の組とが得られます。では、計算してみましょう。

    - - -

    いいですね!これで、新しい2次ベジエ曲線が得られます。

    - - - - - - -

    すばらしいt = 0からt = zの部分曲線を求める場合、始点の座標はそのままになります(もっともです)。制御点は、元々の制御点と始点を、比率zで混ぜ合わせたものになります。そして不思議なことに、新たな終点は2次のベルンシュタイン多項式に似た混ぜ合わせになります。これらの新しい座標は、とても簡単に直接計算ができるのです!

    -

    もちろん、これは2曲線のうちの片方にすぎません。t = zからt = 1の部分を得るためには、同様の計算をする必要があります。まず、今さっき行ったのは、一般の区間[0,z]についての計算でした。これは0があるので簡単な形になっていましたが、実際には、次の式を計算していたということがわかります。

    - - - - -

    区間[z,1]を求めたい場合は、かわりに次のような計算になります。

    - - - - -

    先ほどと同じ手法を使い、[なにか · M]を[M · なにか]に変えます。

    - - -

    よって、後半部分の曲線は結局のところ以下のようになります。

    - - - - - - -

    おみごと。先ほどと同じようになっていることがわかります。終点の座標はそのままで(もっともです)、制御点は、元々の制御点と終点を比率zで混ぜ合わせたものになります。そして不思議なことに、新たな始点は2次のベルンシュタイン多項式に似た混ぜ合わせになります。ただし、(1-z)の代わりに(z-1)になっています。これらの新しい座標、とても簡単に直接計算ができるのです!

    -
    + +

    + おみごと。先ほどと同じようになっていることがわかります。終点の座標はそのままで(もっともです)、制御点は、元々の制御点と終点を比率zで混ぜ合わせたものになります。そして不思議なことに、新たな始点は2次のベルンシュタイン多項式に似た混ぜ合わせになります。ただし、(1-z)の代わりに(z-1)になっています。これらの新しい座標、とても簡単に直接計算ができるのです! +

    + -

    というわけで、ド・カステリョのアルゴリズムではなく線形代数の方を使うと、どのような2次ベジエ曲線でもある値t = zで分割すれば2つのベジエ曲線となり、しかもその座標は簡単に求められる、ということがわかりました。

    - - -

    および

    - - -

    3次の曲線についても同様です。ただし、実際の導出はあなたにとっておきますので(自力で書き下してみてください)、新しい座標の組の結果を示すだけにします。

    - - -

    および

    - - -

    さて、これらの行列を見るに、後半部分の曲線の行列は本当に計算する必要があったのでしょうか?いえ、ありませんでした。片方の行列が得られれば、実はもう一方の行列も暗に得られたことになります。まず、行列 Q の各行の値を右側に寄せ、右側にあった0を左側に押しのけます。次に行列を上下に反転させます。これでたちまち Q' が「計算」できるのです。

    -

    この方法で曲線の分割を実装すれば、再帰が少なくて済みます。また、数値のキャッシュを利用した単純な演算になるので、再帰の計算コストが大きいシステムにおいては、コストが抑えられるかもしれません。行列の乗算に適したデバイスで計算を行えば、ド・カステリョのアルゴリズムに比べてかなり速くなるでしょう。

    - -
    -
    -

    Lowering and elevating curve order

    -

    One interesting property of Bézier curves is that an nth order curve can always be perfectly represented by an (n+1)th order curve, by giving the higher-order curve specific control points.

    -

    If we have a curve with three points, then we can create a curve with four points that exactly reproduces the original curve. First, we give it the same start and end points, and for its two control points we pick "1/3rd start + 2/3rd control" and "2/3rd control + 1/3rd end". Now we have exactly the same curve as before, except represented as a cubic curve rather than a quadratic curve.

    -

    The general rule for raising an nth order curve to an (n+1)th order curve is as follows (observing that the start and end weights are the same as the start and end weights for the old curve):

    - - -

    However, this rule also has as direct consequence that you cannot generally safely lower a curve from nth order to (n-1)th order, because the control points cannot be "pulled apart" cleanly. We can try to, but the resulting curve will not be identical to the original, and may in fact look completely different.

    -

    However, there is a surprisingly good way to ensure that a lower order curve looks "as close as reasonably possible" to the original curve: we can optimise the "least-squares distance" between the original curve and the lower order curve, in a single operation (also explained over on Sirver's Castle). However, to use it, we'll need to do some calculus work and then switch over to linear algebra. As mentioned in the section on matrix representations, some things can be done much more easily with matrices than with calculus functions, and this is one of those things. So... let's go!

    -

    We start by taking the standard Bézier function, and condensing it a little:

    - - -

    Then, we apply one of those silly (actually, super useful) calculus tricks: since our t value is always between zero and one (inclusive), we know that (1-t) plus t always sums to 1. As such, we can express any value as a sum of t and 1-t:

    - - -

    So, with that seemingly trivial observation, we rewrite that Bézier function by splitting it up into a sum of a (1-t) and t component:

    - - -

    So far so good. Now, to see why we did this, let's write out the (1-t) and t parts, and see what that gives us. I promise, it's about to make sense. We start with (1-t):

    - - -

    So by using this seemingly silly trick, we can suddenly express part of our nth order Bézier function in terms of an (n+1)th order Bézier function. And that sounds a lot like raising the curve order! Of course we need to be able to repeat that trick for the t part, but that's not a problem:

    - - -

    So, with both of those changed from an order n expression to an order (n+1) expression, we can put them back together again. Now, where the order n function had a summation from 0 to n, the order n+1 function uses a summation from 0 to n+1, but this shouldn't be a problem as long as we can add some new terms that "contribute nothing". In the next section on derivatives, there is a discussion about why "higher terms than there is a binomial for" and "lower than zero terms" both "contribute nothing". So as long as we can add terms that have the same form as the terms we need, we can just include them in the summation, they'll sit there and do nothing, and the resulting function stays identical to the lower order curve.

    -

    Let's do this:

    - - -

    And this is where we switch over from calculus to linear algebra, and matrices: we can now express this relation between Bézier(n,t) and Bézier(n+1,t) as a very simple matrix multiplication:

    - - -

    where the matrix M is an n+1 by n matrix, and looks like:

    - - -

    That might look unwieldy, but it's really just a mostly-zeroes matrix, with a very simply fraction on the diagonal, and an even simpler fraction to the left of it. Multiplying a list of coordinates with this matrix means we can plug the resulting transformed coordinates into the one-order-higher function and get an identical looking curve.

    -

    Not too bad!

    -

    Equally interesting, though, is that with this matrix operation established, we can now use an incredibly powerful and ridiculously simple way to find out a "best fit" way to reverse the operation, called the normal equation. What it does is minimize the sum of the square differences between one set of values and another set of values. Specifically, if we can express that as some function A x = b, we can use it. And as it so happens, that's exactly what we're dealing with, so:

    - - -

    The steps taken here are:

    -
      -
    1. We have a function in a form that the normal equation can be used with, so
    2. -
    3. apply the normal equation!
    4. -
    5. Then, we want to end up with just Bn on the left, so we start by left-multiply both sides such that we'll end up with lots of stuff on the left that simplified to "a factor 1", which in matrix maths is the identity matrix.
    6. -
    7. In fact, by left-multiplying with the inverse of what was already there, we've effectively "nullified" (but really, one-inified) that big, unwieldy block into the identity matrix I. So we substitute the mess with I, and then
    8. -
    9. because multiplication with the identity matrix does nothing (like multiplying by 1 does nothing in regular algebra), we just drop it.
    10. -
    -

    And we're done: we now have an expression that lets us approximate an n+1th order curve with a lower nth order curve. It won't be an exact fit, but it's definitely a best approximation. So, let's implement these rules for raising and lowering curve order to a (semi) random curve, using the following graphic. Select the sketch, which has movable control points, and press your up and down arrow keys to raise or lower the curve order.

    - - - Scripts are disabled. Showing fallback image. - - - - - - - -
    -
    -

    Derivatives

    -

    There's a number of useful things that you can do with Bézier curves based on their derivative, and one of the more amusing observations about Bézier curves is that their derivatives are, in fact, also Bézier curves. In fact, the differentiation of a Bézier curve is relatively straightforward, although we do need a bit of math.

    -

    First, let's look at the derivative rule for Bézier curves, which is:

    - - -

    which we can also write (observing that b in this formula is the same as our w weights, and that n times a summation is the same as a summation where each term is multiplied by n) as:

    - - -

    Or, in plain text: the derivative of an nth degree Bézier curve is an (n-1)th degree Bézier curve, with one fewer term, and new weights w'0...w'n-1 derived from the original weights as n(wi+1 - wi). So for a 3rd degree curve, with four weights, the derivative has three new weights: w'0 = 3(w1-w0), w'1 = 3(w2-w1) and w'2 = 3(w3-w2).

    -
    - -

    "Slow down, why is that true?"

    -

    Sometimes just being told "this is the derivative" is nice, but you might want to see why this is indeed the case. As such, let's have a look at the proof for this derivative. First off, the weights are independent of the full Bézier function, so the derivative involves only the derivative of the polynomial basis function. So, let's find that:

    - - -

    Applying the product and chain rules gives us:

    - - -

    Which is hard to work with, so let's expand that properly:

    - - -

    Now, the trick is to turn this expression into something that has binomial coefficients again, so we want to end up with things that look like "x! over y!(x-y)!". If we can do that in a way that involves terms of n-1 and k-1, we'll be on the right track.

    - - -

    And that's the first part done: the two components inside the parentheses are actually regular, lower-order Bézier expressions:

    - - -

    Now to apply this to our weighted Bézier curves. We'll write out the plain curve formula that we saw earlier, and then work our way through to its derivative:

    - - -

    If we expand this (with some color to show how terms line up), and reorder the terms by increasing values for k we see the following:

    - - -

    Two of these terms fall way: the first term falls away because there is no -1st term in a summation. As such, it always contributes "nothing", so we can safely completely ignore it for the purpose of finding the derivative function. The other term is the very last term in this expansion: one involving Bn-1,n. This term would have a binomial coefficient of [i choose i+1], which is a non-existent binomial coefficient. Again, this term would contribute "nothing", so we can ignore it, too. This means we're left with:

    - - -

    And that's just a summation of lower order curves:

    - - -

    We can rewrite this as a normal summation, and we're done:

    - - -
    + + -

    Let's rewrite that in a form similar to our original formula, so we can see the difference. We will first list our original formula for Bézier curves, and then the derivative:

    - - - - -

    What are the differences? In terms of the actual Bézier curve, virtually nothing! We lowered the order (rather than n, it's now n-1), but it's still the same Bézier function. The only real difference is in how the weights change when we derive the curve's function. If we have four points A, B, C, and D, then the derivative will have three points, the second derivative two, and the third derivative one:

    - - -

    We can keep performing this trick for as long as we have more than one weight. Once we have one weight left, the next step will see k = 0, and the result of our "Bézier function" summation is zero, because we're not adding anything at all. As such, a quadratic curve has no second derivative, a cubic curve has no third derivative, and generalized: an nth order curve has n-1 (meaningful) derivatives, with any further derivative being zero.

    - -
    -
    -

    Tangents and normals

    -

    If you want to move objects along a curve, or "away from" a curve, the two vectors you're most interested in are the tangent vector and normal vector for curve points. These are actually really easy to find. For moving and orienting along a curve, we use the tangent, which indicates the direction of travel at specific points, and is literally just the first derivative of our curve:

    - - -

    This gives us the directional vector we want. We can normalize it to give us uniform directional vectors (having a length of 1.0) at each point, and then do whatever it is we want to do based on those directions:

    - - -

    The tangent is very useful for moving along a line, but what if we want to move away from the curve instead, perpendicular to the curve at some point t? In that case we want the normal vector. This vector runs at a right angle to the direction of the curve, and is typically of length 1.0, so all we have to do is rotate the normalized directional vector and we're done:

    - - -
    - -

    Rotating coordinates is actually very easy, if you know the rule for it. You might find it explained as "applying a rotation matrix, which is what we'll look at here, too. Essentially, the idea is to take the circles over which we can rotate, and simply "sliding the coordinates" over these circles by the desired -angle. If we want a quarter circle turn, we take the coordinate, slide it along the circle by a quarter turn, and done.

    -

    To turn any point (x,y) into a rotated point (x',y') (over 0,0) by some angle φ, we apply this nice and easy computation:

    - - -

    Which is the "long" version of the following matrix transformation:

    - - -

    And that's all we need to rotate any coordinate. Note that for quarter, half, and three-quarter turns these functions become even easier, since sin and cos for these angles are, respectively: 0 and 1, -1 and 0, and 0 and -1.

    -

    But why does this work? Why this matrix multiplication? Wikipedia (technically, Thomas Herter and Klaus Lott) tells us that a rotation matrix can be -treated as a sequence of three (elementary) shear operations. When we combine this into a single matrix operation (because all matrix multiplications can be collapsed), we get the matrix that you see above. DataGenetics have an excellent article about this very thing: it's really quite cool, and I strongly recommend taking a quick break from this primer to read that article.

    -
    + +

    + And that's all we need to rotate any coordinate. Note that for quarter, half, and three-quarter turns these functions become even + easier, since sin and cos for these angles are, respectively: 0 and 1, -1 and 0, and 0 and -1. +

    +

    + But why does this work? Why this matrix multiplication? + Wikipedia (technically, Thomas Herter and Klaus + Lott) tells us that a rotation matrix can be treated as a sequence of three (elementary) shear operations. When we combine this into a + single matrix operation (because all matrix multiplications can be collapsed), we get the matrix that you see above. + DataGenetics have an excellent article about this very thing: it's + really quite cool, and I strongly recommend taking a quick break from this primer to read that article. +

    + -

    The following two graphics show the tangent and normal along a quadratic and cubic curve, with the direction vector coloured blue, and the normal vector coloured red (the markers are spaced out evenly as t-intervals, not spaced equidistant).

    -
    - - - Scripts are disabled. Showing fallback image. - - - - - - Scripts are disabled. Showing fallback image. - - - -
    +

    + The following two graphics show the tangent and normal along a quadratic and cubic curve, with the direction vector coloured blue, and the + normal vector coloured red (the markers are spaced out evenly as t-intervals, not spaced equidistant). +

    +
    + + + Scripts are disabled. Showing fallback image. + + + + + + Scripts are disabled. Showing fallback image. + + + +
    +
    +
    +

    + + Working with 3D normals +

    +

    + Before we move on to the next section we need to spend a little bit of time on the difference between 2D and 3D. While for many things + this difference is irrelevant and the procedures are identical (for instance, getting the 3D tangent is just doing what we do for 2D, but + for x, y, and z, instead of just for x and y), when it comes to normals things are a little more complex, and thus more work. Mind you, + it's not "super hard", but there are more steps involved and we should have a look at those. +

    +

    + Getting normals in 3D is in principle the same as in 2D: we take the normalised tangent vector, and then rotate it by a quarter turn. + However, this is where things get that little more complex: we can turn in quite a few directions, since "the normal" in 3D is a plane, + not a single vector, so we basically need to define what "the" normal is in the 3D case. +

    +

    + The "naïve" approach is to construct what is known as the + Frenet normal, where we follow a simple recipe that works in + many cases (but does super bizarre things in some others). The idea is that even though there are infinitely many vectors that are + perpendicular to the tangent (i.e. make a 90 degree angle with it), the tangent itself sort of lies on its own plane already: since each + point on the curve (no matter how closely spaced) has its own tangent vector, we can say that each point lies in the same plane as the + local tangent, as well as the tangents "right next to it". +

    +

    + Even if that difference in tangent vectors is minute, "any difference" is all we need to find out what that plane is - or rather, what the + vector perpendicular to that plane is. Which is what we need: if we can calculate that vector, and we have the tangent vector that we know + lies on a plane, then we can rotate the tangent vector over the perpendicular, and presto. We have computed the normal using the same + logic we used for the 2D case: "just rotate it 90 degrees". +

    +

    So let's do that! And in a twist surprise, we can do this in four lines:

    +
      +
    • a = normalize(B'(t))
    • +
    • b = normalize(a + B''(t))
    • +
    • r = normalize(b × a)
    • +
    • normal = normalize(r × a)
    • +
    +

    Let's unpack that a little:

    +
      +
    • + We start by taking the normalized vector for the derivative at some point on the + curve. We normalize it so the maths is less work. Less work is good. +
    • +
    • + Then, we compute b which represents what a next point's tangent would be if the curve stopped changing at our point and + just had the same derivative and second derivative from that point on. +
    • +
    • + This lets us find two vectors (the derivative, and the second derivative added to the derivative) that lie on the same plane, which + means we can use them to compute a vector perpendicular to that plane, using an elementary vector operation called the + cross product. (Note that while that operation uses the × operator, it's most + definitely not a multiplication!) The result of that gives us a vector that we can use as the "axis of rotation" for turning the tangent + a quarter circle to get our normal, just like we did in the 2D case. +
    • +
    • + Since the cross product lets us find a vector that is perpendicular to some plane defined by two other vectors, and since the normal + vector should be perpendicular to the plane that the tangent and the axis of rotation lie in, we can use the cross product a second + time, and immediately get our normal vector. +
    • +
    +

    + And then we're done, we found "the" normal vector for a 3D curve. Let's see what that looks like for a sample curve, shall we? You can + move your cursor across the graphic from left to right, to show the normal at a point with a t value that is based on your cursor + position: all the way on the left is 0, all the way on the right = 1, midway is t=0.5, etc: +

    + + + Scripts are disabled. Showing fallback image. + + + + + +

    + However, if you've played with that graphic a bit, you might have noticed something odd. The normal seems to "suddenly twist around the + curve" between t=0.65 and t=0.75... Why is it doing that? +

    +

    + As it turns out, it's doing that because that's how the maths works, and that's the problem with Frenet normals: while they are + "mathematically correct", they are "practically problematic", and so for any kind of graphics work what we really want is a way to compute + normals that just... look good. +

    +

    Thankfully, Frenet normals are not our only option.

    +

    + Another option is to take a slightly more algorithmic approach and compute a form of + Rotation Minimising Frame + (also known as "parallel transport frame" or "Bishop frame") instead, where a "frame" is a set made up of the tangent, the rotational + axis, and the normal vector, centered on an on-curve point. +

    +

    + These type of frames are computed based on "the previous frame", so we cannot simply compute these "on demand" for single points, as we + could for Frenet frames; we have to compute them for the entire curve. Thankfully, the procedure is pretty simple, and can be performed at + the same time that you're building lookup tables for your curve. +

    +

    + The idea is to take a starting "tangent/rotation axis/normal" frame at t=0, and then compute what the next frame "should" look like by + applying some rules that yield a good looking next frame. In the case of the RMF paper linked above, those rules are: +

    +
      +
    • Take a point on the curve for which we know the RM frame already,
    • +
    • take a next point on the curve for which we don't know the RM frame yet, and
    • +
    • + reflect the known frame onto the next point, by treating the plane through the curve at the point exactly between the next and previous + points as a "mirror". +
    • +
    • + This gives the next point a tangent vector that's essentially pointing in the opposite direction of what it should be, and a normal + that's slightly off-kilter, so: +
    • +
    • + reflect the vectors of our "mirrored frame" a second time, but this time using the plane through the "next point" itself as "mirror". +
    • +
    • Done: the tangent and normal have been fixed, and we have a good looking frame to work with.
    • +
    +

    So, let's write some code for that!

    +
    +

    Implementing Rotation Minimising Frames

    +

    + We first assume we have a function for calculating the Frenet frame at a point, which we already discussed above, inn a way that it + yields a frame with properties: +

    -
    -
    -

    Working with 3D normals

    -

    Before we move on to the next section we need to spend a little bit of time on the difference between 2D and 3D. While for many things this difference is irrelevant and the procedures are identical (for instance, getting the 3D tangent is just doing what we do for 2D, but for x, y, and z, instead of just for x and y), when it comes to normals things are a little more complex, and thus more work. Mind you, it's not "super hard", but there are more steps involved and we should have a look at those.

    -

    Getting normals in 3D is in principle the same as in 2D: we take the normalised tangent vector, and then rotate it by a quarter turn. However, this is where things get that little more complex: we can turn in quite a few directions, since "the normal" in 3D is a plane, not a single vector, so we basically need to define what "the" normal is in the 3D case.

    -

    The "naïve" approach is to construct what is known as the Frenet normal, where we follow a simple recipe that works in many cases (but does super bizarre things in some others). The idea is that even though there are infinitely many vectors that are perpendicular to the tangent (i.e. make a 90 degree angle with it), the tangent itself sort of lies on its own plane already: since each point on the curve (no matter how closely spaced) has its own tangent vector, we can say that each point lies in the same plane as the local tangent, as well as the tangents "right next to it".

    -

    Even if that difference in tangent vectors is minute, "any difference" is all we need to find out what that plane is - or rather, what the vector perpendicular to that plane is. Which is what we need: if we can calculate that vector, and we have the tangent vector that we know lies on a plane, then we can rotate the tangent vector over the perpendicular, and presto. We have computed the normal using the same logic we used for the 2D case: "just rotate it 90 degrees".

    -

    So let's do that! And in a twist surprise, we can do this in four lines:

    -
      -
    • a = normalize(B'(t))
    • -
    • b = normalize(a + B''(t))
    • -
    • r = normalize(b × a)
    • -
    • normal = normalize(r × a)
    • -
    -

    Let's unpack that a little:

    -
      -
    • We start by taking the normalized vector for the derivative at some point on the curve. We normalize it so the maths is less work. Less work is good.
    • -
    • Then, we compute b which represents what a next point's tangent would be if the curve stopped changing at our point and just had the same derivative and second derivative from that point on.
    • -
    • This lets us find two vectors (the derivative, and the second derivative added to the derivative) that lie on the same plane, which means we can use them to compute a vector perpendicular to that plane, using an elementary vector operation called the cross product. (Note that while that operation uses the × operator, it's most definitely not a multiplication!) The result of that gives us a vector that we can use as the "axis of rotation" for turning the tangent a quarter circle to get our normal, just like we did in the 2D case.
    • -
    • Since the cross product lets us find a vector that is perpendicular to some plane defined by two other vectors, and since the normal vector should be perpendicular to the plane that the tangent and the axis of rotation lie in, we can use the cross product a second time, and immediately get our normal vector.
    • -
    -

    And then we're done, we found "the" normal vector for a 3D curve. Let's see what that looks like for a sample curve, shall we? You can move your cursor across the graphic from left to right, to show the normal at a point with a t value that is based on your cursor position: all the way on the left is 0, all the way on the right = 1, midway is t=0.5, etc:

    - - - Scripts are disabled. Showing fallback image. - - - - - -

    However, if you've played with that graphic a bit, you might have noticed something odd. The normal seems to "suddenly twist around the curve" between t=0.65 and t=0.75... Why is it doing that?

    -

    As it turns out, it's doing that because that's how the maths works, and that's the problem with Frenet normals: while they are "mathematically correct", they are "practically problematic", and so for any kind of graphics work what we really want is a way to compute normals that just... look good.

    -

    Thankfully, Frenet normals are not our only option.

    -

    Another option is to take a slightly more algorithmic approach and compute a form of Rotation Minimising Frame (also known as "parallel transport frame" or "Bishop frame") instead, where a "frame" is a set made up of the tangent, the rotational axis, and the normal vector, centered on an on-curve point.

    -

    These type of frames are computed based on "the previous frame", so we cannot simply compute these "on demand" for single points, as we could for Frenet frames; we have to compute them for the entire curve. Thankfully, the procedure is pretty simple, and can be performed at the same time that you're building lookup tables for your curve.

    -

    The idea is to take a starting "tangent/rotation axis/normal" frame at t=0, and then compute what the next frame "should" look like by applying some rules that yield a good looking next frame. In the case of the RMF paper linked above, those rules are:

    -
      -
    • Take a point on the curve for which we know the RM frame already,
    • -
    • take a next point on the curve for which we don't know the RM frame yet, and
    • -
    • reflect the known frame onto the next point, by treating the plane through the curve at the point exactly between the next and previous points as a "mirror".
    • -
    • This gives the next point a tangent vector that's essentially pointing in the opposite direction of what it should be, and a normal that's slightly off-kilter, so:
    • -
    • reflect the vectors of our "mirrored frame" a second time, but this time using the plane through the "next point" itself as "mirror".
    • -
    • Done: the tangent and normal have been fixed, and we have a good looking frame to work with.
    • -
    -

    So, let's write some code for that!

    -
    - -

    Implementing Rotation Minimising Frames

    -

    We first assume we have a function for calculating the Frenet frame at a point, which we already discussed above, inn a way that it yields a frame with properties:

    - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    - -

    Then, we can write a function that generates a sequence of RM frames in the following manner:

    +} + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    20
    21
    22
    23
    24
    25
    26
    27
    28
    29
    30
    31
    32
    33
    34
    35
    36
    - -

    Ignoring comments, this is certainly more code than when we were just computing a single Frenet frame, but it's not a crazy amount more code to get much better looking normals.

    -
    + frames.add(x1) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + 18 + + + 19 + + + 20 + + + 21 + + + 22 + + + 23 + + + 24 + + + 25 + + + 26 + + + 27 + + + 28 + + + 29 + + + 30 + + + 31 + + + 32 + + + 33 + + + 34 + + + 35 + + + 36 + + -

    Speaking of better looking, what does this actually look like? Let's revisit that earlier curve, but this time use rotation minimising frames rather than Frenet frames:

    - - - Scripts are disabled. Showing fallback image. - - - - - -

    That looks so much better!

    -

    For those reading along with the code: we don't even strictly speaking need a Frenet frame to start with: we could, for instance, treat the z-axis as our initial axis of rotation, so that our initial normal is (0,0,1) × tangent, and then take things from there, but having that initial "mathematically correct" frame so that the initial normal seems to line up based on the curve's orientation in 3D space is just nice.

    +

    + Ignoring comments, this is certainly more code than when we were just computing a single Frenet frame, but it's not a crazy amount more + code to get much better looking normals. +

    + -
    -
    -

    Component functions

    -

    One of the first things people run into when they start using Bézier curves in their own programs is "I know how to draw the curve, but how do I determine the bounding box?". It's actually reasonably straightforward to do so, but it requires having some knowledge on exploiting math to get the values we need. For bounding boxes, we aren't actually interested in the curve itself, but only in its "extremities": the minimum and maximum values the curve has for its x- and y-axis values. If you remember your calculus (provided you ever took calculus, otherwise it's going to be hard to remember) we can determine function extremities using the first derivative of that function, but this poses a problem, since our function is parametric: every axis has its own function.

    -

    The solution: compute the derivative for each axis separately, and then fit them back together in the same way we do for the original.

    -

    Let's look at how a parametric Bézier curve "splits up" into two normal functions, one for the x-axis and one for the y-axis. Note the leftmost figure is again an interactive curve, without labeled axes (you get coordinates in the graph instead). The center and rightmost figures are the component functions for computing the x-axis value, given a value for t (between 0 and 1 inclusive), and the y-axis value, respectively.

    -

    If you move points in a curve sideways, you should only see the middle graph change; likewise, moving points vertically should only show a change in the right graph.

    - - - Scripts are disabled. Showing fallback image. - - - -

     

    - - - Scripts are disabled. Showing fallback image. - - - - -
    -
    -

    Finding extremities: root finding

    -

    Now that we understand (well, superficially anyway) the component functions, we can find the extremities of our Bézier curve by finding maxima and minima on the component functions, by solving the equation B'(t) = 0. We've already seen that the derivative of a Bézier curve is a simpler Bézier curve, but how do we solve the equality? Fairly easily, actually, until our derivatives are 4th order or higher... then things get really hard. But let's start simple:

    -

    Quadratic curves: linear derivatives.

    -

    The derivative of a quadratic Bézier curve is a linear Bézier curve, interpolating between just two terms, which means finding the solution for "where is this line 0" is effectively trivial by rewriting it to a function of t and solving. First we turn our quadratic Bézier function into a linear one, by following the rule mentioned at the end of the derivatives section:

    - - -

    And then we turn this into our solution for t using basic arithmetics:

    - - -

    Done.

    -

    Although with the caveat that if b-a is zero, there is no solution and we probably shouldn't try to perform that division.

    -

    Cubic curves: the quadratic formula.

    -

    The derivative of a cubic Bézier curve is a quadratic Bézier curve, and finding the roots for a quadratic polynomial means we can apply the Quadratic formula. If you've seen it before, you'll remember it, and if you haven't, it looks like this:

    - - -

    So, if we can rewrite the Bézier component function as a plain polynomial, we're done: we just plug in the values into the quadratic formula, check if that square root is negative or not (if it is, there are no roots) and then just compute the two values that come out (because of that plus/minus sign we get two). Any value between 0 and 1 is a root that matters for Bézier curves, anything below or above that is irrelevant (because Bézier curves are only defined over the interval [0,1]). So, how do we convert?

    -

    First we turn our cubic Bézier function into a quadratic one, by following the rule mentioned at the end of the derivatives section:

    - - -

    And then, using these v values, we can find out what our a, b, and c should be:

    - - -

    This gives us three coefficients {a, b, c} that are expressed in terms of v values, where the v values are expressions of our original coordinate values, so we can do some substitution to get:

    - - -

    Easy-peasy. We can now almost trivially find the roots by plugging those values into the quadratic formula.

    -

    And as a cubic curve, there is also a meaningful second derivative, which we can compute by simple taking the derivative of the derivative.

    -

    Quartic curves: Cardano's algorithm.

    -

    We haven't really looked at them before now, but the next step up would be a Quartic curve, a fourth degree Bézier curve. As expected, these have a derivative that is a cubic function, and now things get much harder. Cubic functions don't have a "simple" rule to find their roots, like the quadratic formula, and instead require quite a bit of rewriting to a form that we can even start to try to solve.

    -

    Back in the 16th century, before Bézier curves were a thing, and even before calculus itself was a thing, Gerolamo Cardano figured out that even if the general cubic function is really hard to solve, it can be rewritten to a form for which finding the roots is "easier" (even if not "easy"):

    - - -

    We can see that the easier formula only has two constants, rather than four, and only two expressions involving t, rather than three: this makes things considerably easier to solve because it lets us use regular calculus to find the values that satisfy the equation.

    -

    Now, there is one small hitch: as a cubic function, the solutions may be complex numbers rather than plain numbers... And Cardano realised this, centuries before complex numbers were a well-understood and established part of number theory. His interpretation of them was "these numbers are impossible but that's okay because they disappear again in later steps", allowing him to not think about them too much, but we have it even easier: as we're trying to find the roots for display purposes, we don't even care about complex numbers: we're going to simplify Cardano's approach just that tiny bit further by throwing away any solution that's not a plain number.

    -

    So, how do we rewrite the hard formula into the easier formula? This is explained in detail over at Ken J. Ward's page for solving the cubic equation, so instead of showing the maths, I'm simply going to show the programming code for solving the cubic equation, with the complex roots getting totally ignored, but if you're interested you should definitely head over to Ken's page and give the procedure a read-through.

    -
    + +

    + We can see that the easier formula only has two constants, rather than four, and only two expressions involving t, rather + than three: this makes things considerably easier to solve because it lets us use + regular calculus to find the values that satisfy the equation. +

    +

    + Now, there is one small hitch: as a cubic function, the solutions may be + complex numbers rather than plain numbers... And Cardano realised this, + centuries before complex numbers were a well-understood and established part of number theory. His interpretation of them was "these + numbers are impossible but that's okay because they disappear again in later steps", allowing him to not think about them too much, but we + have it even easier: as we're trying to find the roots for display purposes, we don't even care about complex numbers: we're + going to simplify Cardano's approach just that tiny bit further by throwing away any solution that's not a plain number. +

    +

    + So, how do we rewrite the hard formula into the easier formula? This is explained in detail over at + Ken J. Ward's page for solving the + cubic equation, so instead of showing the maths, I'm simply going to show the programming code for solving the cubic equation, with the + complex roots getting totally ignored, but if you're interested you should definitely head over to Ken's page and give the procedure a + read-through. +

    +
    +

    Implementing Cardano's algorithm for finding all real roots

    +

    + The "real roots" part is fairly important, because while you cannot take a square, cube, etc. root of a negative number in the "real" + number space (denoted with ℝ), this is perfectly fine in the + "complex" number space (denoted with ℂ). And, as it so happens, Cardano is + also attributed as the first mathematician in history to have made use of complex numbers in his calculations. For this very algorithm! +

    -

    Implementing Cardano's algorithm for finding all real roots

    -

    The "real roots" part is fairly important, because while you cannot take a square, cube, etc. root of a negative number in the "real" number space (denoted with ℝ), this is perfectly fine in the "complex" number space (denoted with ℂ). And, as it so happens, Cardano is also attributed as the first mathematician in history to have made use of complex numbers in his calculations. For this very algorithm!

    - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    20
    21
    22
    23
    24
    25
    26
    27
    28
    29
    30
    31
    32
    33
    34
    35
    36
    37
    38
    39
    40
    41
    42
    43
    44
    45
    46
    47
    48
    49
    50
    51
    52
    53
    54
    55
    56
    57
    58
    59
    60
    61
    62
    63
    64
    65
    66
    67
    68
    69
    70
    71
    72
    73
    74
    75
    76
    77
    78
    79
    80
    81
    - -
    +} + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + 18 + + + 19 + + + 20 + + + 21 + + + 22 + + + 23 + + + 24 + + + 25 + + + 26 + + + 27 + + + 28 + + + 29 + + + 30 + + + 31 + + + 32 + + + 33 + + + 34 + + + 35 + + + 36 + + + 37 + + + 38 + + + 39 + + + 40 + + + 41 + + + 42 + + + 43 + + + 44 + + + 45 + + + 46 + + + 47 + + + 48 + + + 49 + + + 50 + + + 51 + + + 52 + + + 53 + + + 54 + + + 55 + + + 56 + + + 57 + + + 58 + + + 59 + + + 60 + + + 61 + + + 62 + + + 63 + + + 64 + + + 65 + + + 66 + + + 67 + + + 68 + + + 69 + + + 70 + + + 71 + + + 72 + + + 73 + + + 74 + + + 75 + + + 76 + + + 77 + + + 78 + + + 79 + + + 80 + + + 81 + + +
    -

    And that's it. The maths is complicated, but the code is pretty much just "follow the maths, while caching as many values as we can to prevent recomputing things as much as possible" and now we have a way to find all roots for a cubic function and can just move on with using that to find extremities of our curves.

    -

    And of course, as a quartic curve also has meaningful second and third derivatives, we can quite easily compute those by using the derivative of the derivative (of the derivative), just as for cubic curves.

    -

    Quintic and higher order curves: finding numerical solutions

    -

    And this is where thing stop, because we cannot find the roots for polynomials of degree 5 or higher using algebra (a fact known as the Abel–Ruffini theorem). Instead, for occasions like these, where algebra simply cannot yield an answer, we turn to numerical analysis.

    -

    That's a fancy term for saying "rather than trying to find exact answers by manipulating symbols, find approximate answers by describing the underlying process as a combination of steps, each of which can be assigned a number via symbolic manipulation". For example, trying to mathematically compute how much water fits in a completely crazy three dimensional shape is very hard, even if it got you the perfect, precise answer. A much easier approach, which would be less perfect but still entirely useful, would be to just grab a buck and start filling the shape until it was full: just count the number of buckets of water you used. And if we want a more precise answer, we can use smaller buckets.

    -

    So that's what we're going to do here, too: we're going to treat the problem as a sequence of steps, and the smaller we can make each step, the closer we'll get to that "perfect, precise" answer. And as it turns out, there is a really nice numerical root-finding algorithm, called the Newton-Raphson root finding method (yes, after that Newton), which we can make use of. The Newton-Raphson approach consists of taking our impossible-to-solve function f(x), picking some initial value x (literally any value will do), and calculating f(x). We can think of that value as the "height" of the function at x. If that height is zero, we're done, we have found a root. If it isn't, we calculate the tangent line at f(x) and calculate at which x value its height is zero (which we've already seen is very easy). That will give us a new x and we repeat the process until we find a root.

    -

    Mathematically, this means that for some x, at step n=1, we perform the following calculation until fy(x) is zero, so that the next t is the same as the one we already have:

    - - -

    (The Wikipedia article has a decent animation for this process, so I will not add a graphic for that here)

    -

    Now, this works well only if we can pick good starting points, and our curve is continuously differentiable and doesn't have oscillations. Glossing over the exact meaning of those terms, the curves we're dealing with conform to those constraints, so as long as we pick good starting points, this will work. So the question is: which starting points do we pick?

    -

    As it turns out, Newton-Raphson is so blindingly fast that we could get away with just not picking: we simply run the algorithm from t=0 to t=1 at small steps (say, 1/200th) and the result will be all the roots we want. Of course, this may pose problems for high order Bézier curves: 200 steps for a 200th order Bézier curve is going to go wrong, but that's okay: there is no reason (at least, none that I know of) to ever use Bézier curves of crazy high orders. You might use a fifth order curve to get the "nicest still remotely workable" approximation of a full circle with a single Bézier curve, but that's pretty much as high as you'll ever need to go.

    -

    In conclusion:

    -

    So now that we know how to do root finding, we can determine the first and second derivative roots for our Bézier curves, and show those roots overlaid on the previous graphics. For the quadratic curve, that means just the first derivative, in red:

    - - - Scripts are disabled. Showing fallback image. - - - -

    And for cubic curves, that means first and second derivatives, in red and purple respectively:

    - - - Scripts are disabled. Showing fallback image. - - - + +

    (The Wikipedia article has a decent animation for this process, so I will not add a graphic for that here)

    +

    + Now, this works well only if we can pick good starting points, and our curve is + continuously differentiable and doesn't have + oscillations. Glossing over the exact meaning of those terms, the + curves we're dealing with conform to those constraints, so as long as we pick good starting points, this will work. So the question is: + which starting points do we pick? +

    +

    + As it turns out, Newton-Raphson is so blindingly fast that we could get away with just not picking: we simply run the algorithm from + t=0 to t=1 at small steps (say, 1/200th) and the result will be all the roots we want. Of course, this may + pose problems for high order Bézier curves: 200 steps for a 200th order Bézier curve is going to go wrong, but that's okay: + there is no reason (at least, none that I know of) to ever use Bézier curves of crazy high orders. You might use a fifth order + curve to get the "nicest still remotely workable" approximation of a full circle with a single Bézier curve, but that's pretty much as + high as you'll ever need to go. +

    +

    In conclusion:

    +

    + So now that we know how to do root finding, we can determine the first and second derivative roots for our Bézier curves, and show those + roots overlaid on the previous graphics. For the quadratic curve, that means just the first derivative, in red: +

    + + + Scripts are disabled. Showing fallback image. + + +

    And for cubic curves, that means first and second derivatives, in red and purple respectively:

    + + + Scripts are disabled. Showing fallback image. + + +
    +
    +

    + + Bounding boxes +

    +

    + If we have the extremities, and the start/end points, a simple for-loop that tests for min/max values for x and y means we have the four + values we need to box in our curve: +

    +

    Computing the bounding box for a Bézier curve:

    +
      +
    1. Find all t value(s) for the curve derivative's x- and y-roots.
    2. +
    3. Discard any t value that's lower than 0 or higher than 1, because Bézier curves only use the interval [0,1].
    4. +
    5. + Determine the lowest and highest value when plugging the values t=0, t=1 and each of the found roots into the original + functions: the lowest value is the lower bound, and the highest value is the upper bound for the bounding box we want to construct. +
    6. +
    +

    + Applying this approach to our previous root finding, we get the following + axis-aligned bounding boxes (with all curve extremity points + shown on the curve): +

    +
    + + + Scripts are disabled. Showing fallback image. + + + + + + Scripts are disabled. Showing fallback image. + + + +
    -
    -
    -

    Bounding boxes

    -

    If we have the extremities, and the start/end points, a simple for-loop that tests for min/max values for x and y means we have the four values we need to box in our curve:

    -

    Computing the bounding box for a Bézier curve:

    -
      -
    1. Find all t value(s) for the curve derivative's x- and y-roots.
    2. -
    3. Discard any t value that's lower than 0 or higher than 1, because Bézier curves only use the interval [0,1].
    4. -
    5. Determine the lowest and highest value when plugging the values t=0, t=1 and each of the found roots into the original functions: the lowest value is the lower bound, and the highest value is the upper bound for the bounding box we want to construct.
    6. -
    -

    Applying this approach to our previous root finding, we get the following axis-aligned bounding boxes (with all curve extremity points shown on the curve):

    -
    - - - Scripts are disabled. Showing fallback image. - - - - - - Scripts are disabled. Showing fallback image. - - - -
    - -

    We can construct even nicer boxes by aligning them along our curve, rather than along the x- and y-axis, but in order to do so we first need to look at how aligning works.

    - -
    -
    -

    Aligning curves

    -

    While there are an incredible number of curves we can define by varying the x- and y-coordinates for the control points, not all curves are actually distinct. For instance, if we define a curve, and then rotate it 90 degrees, it's still the same curve, and we'll find its extremities in the same spots, just at different draw coordinates. As such, one way to make sure we're working with a "unique" curve is to "axis-align" it.

    -

    Aligning also simplifies a curve's functions. We can translate (move) the curve so that the first point lies on (0,0), which turns our n term polynomial functions into n-1 term functions. The order stays the same, but we have less terms. Then, we can rotate the curves so that the last point always lies on the x-axis, too, making its coordinate (...,0). This further simplifies the function for the y-component to an n-2 term function. For instance, if we have a cubic curve such as this:

    - - -

    Then translating it so that the first coordinate lies on (0,0), moving all x coordinates by -120, and all y coordinates by -160, gives us:

    - - -

    If we then rotate the curve so that its end point lies on the x-axis, the coordinates (integer-rounded for illustrative purposes here) become:

    - - -

    If we drop all the zero-terms, this gives us:

    - - -

    We can see that our original curve definition has been simplified considerably. The following graphics illustrate the result of aligning our example curves to the x-axis, with the cubic case using the coordinates that were just used in the example formulae:

    - - - Scripts are disabled. Showing fallback image. - - - -

     

    - - - Scripts are disabled. Showing fallback image. - - - + +

    + We can see that our original curve definition has been simplified considerably. The following graphics illustrate the result of aligning + our example curves to the x-axis, with the cubic case using the coordinates that were just used in the example formulae: +

    + + + Scripts are disabled. Showing fallback image. + + +

     

    + + + Scripts are disabled. Showing fallback image. + + +
    +
    +

    + + Tight bounding boxes +

    +

    + With our knowledge of bounding boxes, and curve alignment, We can now form the "tight" bounding box for curves. We first align our curve, + recording the translation we performed, "T", and the rotation angle we used, "R". We then determine the aligned curve's normal bounding + box. Once we have that, we can map that bounding box back to our original curve by rotating it by -R, and then translating it by -T. +

    +

    We now have nice tight bounding boxes for our curves:

    +
    + + + Scripts are disabled. Showing fallback image. + + + + + + Scripts are disabled. Showing fallback image. + + + +
    -
    -
    -

    Tight bounding boxes

    -

    With our knowledge of bounding boxes, and curve alignment, We can now form the "tight" bounding box for curves. We first align our curve, recording the translation we performed, "T", and the rotation angle we used, "R". We then determine the aligned curve's normal bounding box. Once we have that, we can map that bounding box back to our original curve by rotating it by -R, and then translating it by -T.

    -

    We now have nice tight bounding boxes for our curves:

    -
    - - - Scripts are disabled. Showing fallback image. - - - - - - Scripts are disabled. Showing fallback image. - - - -
    - -

    These are, strictly speaking, not necessarily the tightest possible bounding boxes. It is possible to compute the optimal bounding box by determining which spanning lines we need to effect a minimal box area, but because of the parametric nature of Bézier curves this is actually a rather costly operation, and the gain in bounding precision is often not worth it.

    - -
    -
    -

    Curve inflections

    -

    Now that we know how to align a curve, there's one more thing we can calculate: inflection points. Imagine we have a variable size circle that we can slide up against our curve. We place it against the curve and adjust its radius so that where it touches the curve, the curvatures of the curve and the circle are the same, and then we start to slide the circle along the curve - for quadratic curves, we can always do this without the circle behaving oddly: we might have to change the radius of the circle as we slide it along, but it'll always sit against the same side of the curve.

    -

    But what happens with cubic curves? Imagine we have an S curve and we place our circle at the start of the curve, and start sliding it along. For a while we can simply adjust the radius and things will be fine, but once we get to the midpoint of that S, something odd happens: the circle "flips" from one side of the curve to the other side, in order for the curvatures to keep matching. This is called an inflection, and we can find out where those happen relatively easily.

    -

    What we need to do is solve a simple equation:

    - - -

    What we're saying here is that given the curvature function C(t), we want to know for which values of t this function is zero, meaning there is no "curvature", which will be exactly at the point between our circle being on one side of the curve, and our circle being on the other side of the curve. So what does C(t) look like? Actually something that seems not too hard:

    - - -

    The function C(t) is the cross product between the first and second derivative functions for the parametric dimensions of our curve. And, as already shown, derivatives of Bézier curves are just simpler Bézier curves, with very easy to compute new coefficients, so this should be pretty easy.

    -

    However as we've seen in the section on aligning, aligning lets us simplify things a lot, by completely removing the contributions of the first coordinate from most mathematical evaluations, and removing the last y coordinate as well by virtue of the last point lying on the x-axis. So, while we can evaluate C(t) = 0 for our curve, it'll be much easier to first axis-align the curve and then evaluating the curvature function.

    -
    - -

    Let's derive the full formula anyway

    -

    Of course, before we do our aligned check, let's see what happens if we compute the curvature function without axis-aligning. We start with the first and second derivatives, given our basis functions:

    - - -

    And of course the same functions for y:

    - - -

    Asking a computer to now compose the C(t) function for us (and to expand it to a readable form of simple terms) gives us this rather overly complicated set of arithmetic expressions:

    - - -

    That is... unwieldy. So, we note that there are a lot of terms that involve multiplications involving x1, y1, and y4, which would all disappear if we axis-align our curve, which is why aligning is a great idea.

    -
    + +

    + That is... unwieldy. So, we note that there are a lot of terms that involve multiplications involving x1, y1, and y4, which would all + disappear if we axis-align our curve, which is why aligning is a great idea. +

    + -

    Aligning our curve so that three of the eight coefficients become zero, and observing that scale does not affect finding t values, we end up with the following simple term function for C(t):

    - - -

    That's a lot easier to work with: we see a fair number of terms that we can compute and then cache, giving us the following simplification:

    - - -

    This is a plain quadratic curve, and we know how to solve C(t) = 0; we use the quadratic formula:

    - - -

    We can easily compute this value if the discriminator isn't a negative number (because we only want real roots, not complex roots), and if x is not zero, because divisions by zero are rather useless.

    -

    Taking that into account, we compute t, we disregard any t value that isn't in the Bézier interval [0,1], and we now know at which t value(s) our curve will inflect.

    - - - Scripts are disabled. Showing fallback image. - - - - -
    -
    -

    The canonical form (for cubic curves)

    -

    While quadratic curves are relatively simple curves to analyze, the same cannot be said of the cubic curve. As a curvature is controlled by more than one control point, it exhibits all kinds of features like loops, cusps, odd colinear features, and as many as two inflection points because the curvature can change direction up to three times. Now, knowing what kind of curve we're dealing with means that some algorithms can be run more efficiently than if we have to implement them as generic solvers, so is there a way to determine the curve type without lots of work?

    -

    As it so happens, the answer is yes, and the solution we're going to look at was presented by Maureen C. Stone from Xerox PARC and Tony D. deRose from the University of Washington in their joint paper "A Geometric Characterization of Parametric Cubic curves". It was published in 1989, and defines curves as having a "canonical" form (i.e. a form that all curves can be reduced to) from which we can immediately tell what features a curve will have. So how does it work?

    -

    The first observation that makes things work is that if we have a cubic curve with four points, we can apply a linear transformation to these points such that three of the points end up on (0,0), (0,1) and (1,1), with the last point then being "somewhere". After applying that transformation, the location of that last point can then tell us what kind of curve we're dealing with. Specifically, we see the following breakdown:

    - - - Scripts are disabled. Showing fallback image. - - - -

    This is a fairly funky image, so let's see what the various parts of it mean...

    -

    We see the three fixed points at (0,0), (0,1) and (1,1). The various regions and boundaries indicate what property the original curve will have, if the fourth point is in/on that region or boundary. Specifically, if the fourth point is...

    -
      -
    1. ...anywhere inside the red zone, but not on its boundaries, the curve will be self-intersecting (yielding a loop). We won't know where it self-intersects (in terms of t values), but we are guaranteed that it does.

      -
    2. -
    3. ...on the left (red) edge of the red zone, the curve will have a cusp. We again don't know where, but we know there is one. This edge is described by the function:

      - - -
    4. -
    5. ...on the almost circular, lower right (pink) edge, the curve's end point touches the curve, forming a loop. This edge is described by the function:

      - - -
    6. -
    7. ...on the top (blue) edge, the curve's start point touches the curve, forming a loop. This edge is described by the function:

      - - -
    8. -
    9. ...inside the lower (green) zone, past y=1, the curve will have a single inflection (switching concave/convex once).

      -
    10. -
    11. ...between the left and lower boundaries (below the cusp line but above the single-inflection line), the curve will have two inflections (switching from concave to convex and then back again, or from convex to concave and then back again).

      -
    12. -
    13. ...anywhere on the right of self-intersection zone, the curve will have no inflections. It'll just be a simple arch.

      -
    14. -
    -

    Of course, this map is fairly small, but the regions extend to infinity, with well defined boundaries.

    -
    + + +
  • +

    ...inside the lower (green) zone, past y=1, the curve will have a single inflection (switching concave/convex once).

    +
  • +
  • +

    + ...between the left and lower boundaries (below the cusp line but above the single-inflection line), the curve will have two + inflections (switching from concave to convex and then back again, or from convex to concave and then back again). +

    +
  • +
  • ...anywhere on the right of self-intersection zone, the curve will have no inflections. It'll just be a simple arch.

  • + +

    Of course, this map is fairly small, but the regions extend to infinity, with well defined boundaries.

    +
    +

    Wait, where do those lines come from?

    +

    + Without repeating the paper mentioned at the top of this section, the loop-boundaries come from rewriting the curve into canonical form, + and then solving the formulae for which constraints must hold for which possible curve properties. In the paper these functions yield + formulae for where you will find cusp points, or loops where we know t=0 or t=1, but those functions are derived for the full cubic + expression, meaning they apply to t=-∞ to t=∞... For Bézier curves we only care about the "clipped interval" t=0 to t=1, so some of the + properties that apply when you look at the curve over an infinite interval simply don't apply to the Bézier curve interval. +

    +

    + The right bound for the loop region, indicating where the curve switches from "having inflections" to "having a loop", for the general + cubic curve, is actually mirrored over x=1, but for Bézier curves this right half doesn't apply, so we don't need to pay attention to + it. Similarly, the boundaries for t=0 and t=1 loops are also nice clean curves but get "cut off" when we only look at what the general + curve does over the interval t=0 to t=1. +

    +

    + For the full details, head over to the paper and read through sections 3 and 4. If you still remember your high school pre-calculus, you + can probably follow along with this paper, although you might have to read it a few times before all the bits "click". +

    +
    -

    Wait, where do those lines come from?

    -

    Without repeating the paper mentioned at the top of this section, the loop-boundaries come from rewriting the curve into canonical form, and then solving the formulae for which constraints must hold for which possible curve properties. In the paper these functions yield formulae for where you will find cusp points, or loops where we know t=0 or t=1, but those functions are derived for the full cubic expression, meaning they apply to t=-∞ to t=∞... For Bézier curves we only care about the "clipped interval" t=0 to t=1, so some of the properties that apply when you look at the curve over an infinite interval simply don't apply to the Bézier curve interval.

    -

    The right bound for the loop region, indicating where the curve switches from "having inflections" to "having a loop", for the general cubic curve, is actually mirrored over x=1, but for Bézier curves this right half doesn't apply, so we don't need to pay attention to it. Similarly, the boundaries for t=0 and t=1 loops are also nice clean curves but get "cut off" when we only look at what the general curve does over the interval t=0 to t=1.

    -

    For the full details, head over to the paper and read through sections 3 and 4. If you still remember your high school pre-calculus, you can probably follow along with this paper, although you might have to read it a few times before all the bits "click".

    -
    - -

    So now the question becomes: how do we manipulate our curve so that it fits this canonical form, with three fixed points, and one "free" point? Enter linear algebra. Don't worry, I'll be doing all the math for you, as well as show you what the effect is on our curves, but basically we're going to be using linear algebra, rather than calculus, because "it's way easier". Sometimes a calculus approach is very hard to work with, when the equivalent geometrical solution is super obvious.

    -

    The approach is going to start with a curve that doesn't have all-colinear points (so we need to make sure the points don't all fall on a straight line), and then applying three graphics operations that you will probably have heard of: translation (moving all points by some fixed x- and y-distance), scaling (multiplying all points by some x and y scale factor), and shearing (an operation that turns rectangles into parallelograms).

    -

    Step 1: we translate any curve by -p1.x and -p1.y, so that the curve starts at (0,0). We're going to make use of an interesting trick here, by pretending our 2D coordinates are 3D, with the z coordinate simply always being 1. This is an old trick in graphics to overcome the limitations of 2D transformations: without it, we can only turn (x,y) coordinates into new coordinates of the form (ax + by, cx + dy), which means we can't do translation, since that requires we end up with some kind of (x + a, y + b). If we add a bogus z coordinate that is always 1, then we can suddenly add arbitrary values. For example:

    - - -

    Sweet! z stays 1, so we can effectively ignore it entirely, but we added some plain values to our x and y coordinates. So, if we want to subtract p1.x and p1.y, we use:

    - - -

    Running all our coordinates through this transformation gives a new set of coordinates, let's call those U, where the first coordinate lies on (0,0), and the rest is still somewhat free. Our next job is to make sure point 2 ends up lying on the x=0 line, so what we want is a transformation matrix that, when we run it, subtracts x from whatever x we currently have. This is called shearing, and the typical x-shear matrix and its transformation looks like this:

    - - -

    So we want some shearing value that, when multiplied by y, yields -x, so our x coordinate becomes zero. That value is simply -x/y, because *-x/y * y = -x*. Done:

    - - -

    Now, running this on all our points generates a new set of coordinates, let's call those V, which now have point 1 on (0,0) and point 2 on (0, some-value), and we wanted it at (0,1), so we need to do some scaling to make sure it ends up at (0,1). Additionally, we want point 3 to end up on (1,1), so we can also scale x to make sure its x-coordinate will be 1 after we run the transform. That means we'll be x-scaling by 1/point3x, and y-scaling by point2y. This is really easy:

    - - -

    Then, finally, this generates a new set of coordinates, let's call those W, of which point 1 lies on (0,0), point 2 lies on (0,1), and point three lies on (1, ...) so all that's left is to make sure point 3 ends up at (1,1) - but we can't scale! Point 2 is already in the right place, and y-scaling would move it out of (0,1) again, so our only option is to y-shear point three, just like how we x-sheared point 2 earlier. In this case, we do the same trick, but with y/x rather than x/y because we're not x-shearing but y-shearing. Additionally, we don't actually want to end up at zero (which is what we did before) so we need to shear towards an offset, in this case 1:

    - - -

    And this generates our final set of four coordinates. Of these, we already know that points 1 through 3 are (0,0), (0,1) and (1,1), and only the last coordinate is "free". In fact, given any four starting coordinates, the resulting "transformation mapped" coordinate will be:

    - - -

    Okay, well, that looks plain ridiculous, but: notice that every coordinate value is being offset by the initial translation, and also notice that a lot of terms in that expression are repeated. Even though the maths looks crazy as a single expression, we can just pull this apart a little and end up with an easy-to-calculate bit of code!

    -

    First, let's just do that translation step as a "preprocessing" operation so we don't have to subtract the values all the time. What does that leave?

    - - -

    Suddenly things look a lot simpler: the mapped x is fairly straight forward to compute, and we see that the mapped y actually contains the mapped x in its entirety, so we'll have that part already available when we need to evaluate it. In fact, let's pull out all those common factors to see just how simple this is:

    - - -

    That's kind of super-simple to write out in code, I think you'll agree. Coding math tends to be easier than the formulae initially make it look!

    -
    + +

    + That's kind of super-simple to write out in code, I think you'll agree. Coding math tends to be easier than the formulae initially make it + look! +

    +
    +

    How do you track all that?

    +

    + Doing maths can be a pain, so whenever possible, I like to make computers do the work for me. Especially for things like this, I simply + use Mathematica. Tracking all this math by hand is insane, and we invented computers, + literally, to do this for us. I have no reason to use pen and paper when I can write out what I want to do in a program, and have the + program do the math for me. And real math, too, with symbols, not with numbers. In fact, + here's the Mathematica notebook if you want to see how + this works for yourself. +

    +

    + Now, I know, you're thinking "but Mathematica is super expensive!" and that's true, it's + $344 for home use, up from $295 when I original wrote this, but it's + also free when you buy a $35 raspberry pi. Obviously, I bought a + raspberry pi, and I encourage you to do the same. With that, as long as you know what you want to do, Mathematica can just do + it for you. And we don't have to be geniuses to work out what the maths looks like. That's what we have computers for. +

    +
    -

    How do you track all that?

    -

    Doing maths can be a pain, so whenever possible, I like to make computers do the work for me. Especially for things like this, I simply use Mathematica. Tracking all this math by hand is insane, and we invented computers, literally, to do this for us. I have no reason to use pen and paper when I can write out what I want to do in a program, and have the program do the math for me. And real math, too, with symbols, not with numbers. In fact, here's the Mathematica notebook if you want to see how this works for yourself.

    -

    Now, I know, you're thinking "but Mathematica is super expensive!" and that's true, it's $344 for home use, up from $295 when I original wrote this, but it's also free when you buy a $35 raspberry pi. Obviously, I bought a raspberry pi, and I encourage you to do the same. With that, as long as you know what you want to do, Mathematica can just do it for you. And we don't have to be geniuses to work out what the maths looks like. That's what we have computers for.

    -
    - -

    So, let's write up a sketch that'll show us the canonical form for any curve drawn in blue, overlaid on our canonical map, so that we can immediately tell which features our curve must have, based on where the fourth coordinate is located on the map:

    - - - Scripts are disabled. Showing fallback image. - - - - -
    -
    -

    Finding Y, given X

    -

    One common task that pops up in things like CSS work, or parametric equalizers, or image leveling, or any other number of applications where Bézier curves are used as control curves in a way that there is really only ever one "y" value associated with one "x" value, you might want to cut out the middle man, as it were, and compute "y" directly based on "x". After all, the function looks simple enough, finding the "y" value should be simple too, right? Unfortunately, not really. However, it is possible and as long as you have some code in place to help, it's not a lot of a work either.

    -

    We'll be tackling this problem in two stages: the first, which is the hard part, is figuring out which "t" value belongs to any given "x" value. For instance, have a look at the following graphic. On the left we have a Bézier curve that looks for all intents and purposes like it fits our criteria: every "x" has one and only one associated "y" value. On the right we see the function for just the "x" values: that's a cubic curve, but not a really crazy cubic curve. If you move the graphic's slider, you will see a red line drawn that corresponds to the x coordinate: this is a vertical line in the left graphic, and a horizontal line on the right.

    - - - Scripts are disabled. Showing fallback image. - - - - - -

    Now, if you look more closely at that right graphic, you'll notice something interesting: if we treat the red line as "the x axis", then the point where the function crosses our line is really just a root for the cubic function x(t) through a shifted "x-axis"... and we've already seen how to calculate roots, so let's just run cubic root finding - and not even the complicated cubic case either: because of the kind of curve we're starting with, we know there is only root, simplifying the code we need!

    -

    First, let's look at the function for x(t):

    - - -

    We can rewrite this to a plain polynomial form, by just fully writing out the expansion and then collecting the polynomial factors, as:

    - - -

    Nothing special here: that's a standard cubic polynomial in "power" form (i.e. all the terms are ordered by their power of t). So, given that a, b, c, d, and x(t) are all known constants, we can trivially rewrite this (by moving the x(t) across the equal sign) as:

    - - -

    You might be wondering "where did all the other 'minus x' for all the other values a, b, c, and d go?" and the answer there is that they all cancel out, so the only one we actually need to subtract is the one at the end. Handy! So now we just solve this equation using Cardano's algorithm, and we're left with some rather short code:

    + +

    + You might be wondering "where did all the other 'minus x' for all the other values a, b, c, and d go?" and the answer there is that they + all cancel out, so the only one we actually need to subtract is the one at the end. Handy! So now we just solve this equation using + Cardano's algorithm, and we're left with some rather short code: +

    - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    - -

    So the procedure is fairly straight forward: pick an x, find the associated t value, evaluate our curve for that t value, which gives us the curve's {x,y} coordinate, which means we know y for this x. Move the slider for the following graphic to see this in action:

    - - - Scripts are disabled. Showing fallback image. - - - - - +y = curve.get(t).y + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + -
    -
    -

    Arc length

    -

    How long is a Bézier curve? As it turns out, that's not actually an easy question, because the answer requires maths that —much like root finding— cannot generally be solved the traditional way. If we have a parametric curve with fx(t) and fy(t), then the length of the curve, measured from start point to some point t = z, is computed using the following seemingly straight forward (if a bit overwhelming) formula:

    - - -

    or, more commonly written using Leibnitz notation as:

    - - -

    This formula says that the length of a parametric curve is in fact equal to the area underneath a function that looks a remarkable amount like Pythagoras' rule for computing the diagonal of a straight angled triangle. This sounds pretty simple, right? Sadly, it's far from simple... cutting straight to after the chase is over: for quadratic curves, this formula generates an unwieldy computation, and we're simply not going to implement things that way. For cubic Bézier curves, things get even more fun, because there is no "closed form" solution, meaning that due to the way calculus works, there is no generic formula that allows you to calculate the arc length. Let me just repeat this, because it's fairly crucial: for cubic and higher Bézier curves, there is no way to solve this function if you want to use it "for all possible coordinates".

    -

    Seriously: It cannot be done.

    -

    So we turn to numerical approaches again. The method we'll look at here is the Gauss quadrature. This approximation is a really neat trick, because for any nth degree polynomial it finds approximated values for an integral really efficiently. Explaining this procedure in length is way beyond the scope of this page, so if you're interested in finding out why it works, I can recommend the University of South Florida video lecture on the procedure, linked in this very paragraph. The general solution we're looking for is the following:

    - - -

    In plain text: an integral function can always be treated as the sum of an (infinite) number of (infinitely thin) rectangular strips sitting "under" the function's plotted graph. To illustrate this idea, the following graph shows the integral for a sinusoid function. The more strips we use (and of course the more we use, the thinner they get) the closer we get to the true area under the curve, and thus the better the approximation:

    -
    - - - Scripts are disabled. Showing fallback image. - - - - - - Scripts are disabled. Showing fallback image. - - - - - - Scripts are disabled. Showing fallback image. - - - -
    + +

    + In plain text: an integral function can always be treated as the sum of an (infinite) number of (infinitely thin) rectangular strips + sitting "under" the function's plotted graph. To illustrate this idea, the following graph shows the integral for a sinusoid function. The + more strips we use (and of course the more we use, the thinner they get) the closer we get to the true area under the curve, and thus the + better the approximation: +

    +
    + + + Scripts are disabled. Showing fallback image. + + + + + + Scripts are disabled. Showing fallback image. + + + + + + Scripts are disabled. Showing fallback image. + + + +
    -

    Now, infinitely many terms to sum and infinitely thin rectangles are not something that computers can work with, so instead we're going to approximate the infinite summation by using a sum of a finite number of "just thin" rectangular strips. As long as we use a high enough number of thin enough rectangular strips, this will give us an approximation that is pretty close to what the real value is.

    -

    So, the trick is to come up with useful rectangular strips. A naive way is to simply create n strips, all with the same width, but there is a far better way using special values for C and f(t) depending on the value of n, which indicates how many strips we'll use, and it's called the Legendre-Gauss quadrature.

    -

    This approach uses strips that are not spaced evenly, but instead spaces them in a special way based on describing the function as a polynomial (the more strips, the more accurate the polynomial), and then computing the exact integral for that polynomial. We're essentially performing arc length computation on a flattened curve, but flattening it based on the intervals dictated by the Legendre-Gauss solution.

    -
    - -

    Note that one requirement for the approach we'll use is that the integral must run from -1 to 1. That's no good, because we're dealing with Bézier curves, and the length of a section of curve applies to values which run from 0 to "some value smaller than or equal to 1" (let's call that value z). Thankfully, we can quite easily transform any integral interval to any other integral interval, by shifting and scaling the inputs. Doing so, we get the following:

    - - -

    That may look a bit more complicated, but the fraction involving z is a fixed number, so the summation, and the evaluation of the f(t) values are still pretty simple.

    -

    So, what do we need to perform this calculation? For one, we'll need an explicit formula for f(t), because that derivative notation is handy on paper, but not when we have to implement it. We'll also need to know what these Ci and ti values should be. Luckily, that's less work because there are actually many tables available that give these values, for any n, so if we want to approximate our integral with only two terms (which is a bit low, really) then these tables would tell us that for n=2 we must use the following values:

    - - -

    Which means that in order for us to approximate the integral, we must plug these values into the approximate function, which gives us:

    - - -

    We can program that pretty easily, provided we have that f(t) available, which we do, as we know the full description for the Bézier curve functions Bx(t) and By(t).

    -
    + +

    + We can program that pretty easily, provided we have that f(t) available, which we do, as we know the full description for the + Bézier curve functions Bx(t) and By(t). +

    + -

    If we use the Legendre-Gauss values for our C values (thickness for each strip) and t values (location of each strip), we can determine the approximate length of a Bézier curve by computing the Legendre-Gauss sum. The following graphic shows a cubic curve, with its computed lengths; Go ahead and change the curve, to see how its length changes. One thing worth trying is to see if you can make a straight line, and see if the length matches what you'd expect. What if you form a line with the control points on the outside, and the start/end points on the inside?

    - - - Scripts are disabled. Showing fallback image. - - - +

    + If we use the Legendre-Gauss values for our C values (thickness for each strip) and t values (location of each strip), + we can determine the approximate length of a Bézier curve by computing the Legendre-Gauss sum. The following graphic shows a cubic curve, + with its computed lengths; Go ahead and change the curve, to see how its length changes. One thing worth trying is to see if you can make + a straight line, and see if the length matches what you'd expect. What if you form a line with the control points on the outside, and the + start/end points on the inside? +

    + + + Scripts are disabled. Showing fallback image. + + + +
    +
    +

    + + Approximated arc length +

    +

    + Sometimes, we don't actually need the precision of a true arc length, and we can get away with simply computing the approximate arc length + instead. The by far fastest way to do this is to flatten the curve and then simply calculate the linear distance from point to point. This + will come with an error, but this can be made arbitrarily small by increasing the segment count. +

    +

    + If we combine the work done in the previous sections on curve flattening and arc length computation, we can implement these with minimal + effort: +

    +
    + + + Scripts are disabled. Showing fallback image. + + + + + + + + Scripts are disabled. Showing fallback image. + + + + + +
    -
    -
    -

    Approximated arc length

    -

    Sometimes, we don't actually need the precision of a true arc length, and we can get away with simply computing the approximate arc length instead. The by far fastest way to do this is to flatten the curve and then simply calculate the linear distance from point to point. This will come with an error, but this can be made arbitrarily small by increasing the segment count.

    -

    If we combine the work done in the previous sections on curve flattening and arc length computation, we can implement these with minimal effort:

    -
    - - - - Scripts are disabled. Showing fallback image. - - - - - - - - Scripts are disabled. Showing fallback image. - - - - - -
    - -

    You may notice that even though the error in length is actually pretty significant in absolute terms, even at a low number of segments we get a length that agrees with the true length when it comes to just the integer part of the arc length. Quite often, approximations can drastically speed things up!

    - -
    -
    -

    Curvature of a curve

    -

    If we have two curves, and we want to line them in up in a way that "looks right", what would we use as metric to let a computer decide what "looks right" means?

    -

    For instance, we can start by ensuring that the two curves share an end coordinate, so that there is no "gap" between the end of one and the start of the next curve, but that won't guarantee that things look right: both curves can be going in wildly different directions, and the resulting joined geometry will have a corner in it, rather than a smooth transition from one curve to the next.

    -

    What we want is to ensure that the curvature at the transition from one curve to the next "looks good". So, we start with a shared coordinate, and then also require that derivatives for both curves match at that coordinate. That way, we're assured that their tangents line up, which must mean the curve transition is perfectly smooth. We can even make the second, third, etc. derivatives match up for better and better transitions.

    -

    Problem solved!

    -

    However, there's a problem with this approach: if we think about this a little more, we realise that "what a curve looks like" and its derivative values are pretty much entirely unrelated. After all, the section on reordering curves showed us that the same looking curve can have an infinite number of curve expressions of arbitrarily high Bézier degree, and each of those will have wildly different derivative values.

    -

    So what we really want is some kind of expression that's not based on any particular expression of t, but is based on something that is invariant to the kind of function(s) we use to draw our curve. And the prime candidate for this is our curve expression, reparameterised for distance: no matter what order of Bézier curve we use, if we were able to rewrite it as a function of distance-along-the-curve, all those different degree Bézier functions would end up being the same function for "coordinate at some distance D along the curve".

    -

    We've seen this before... that's the arc length function.

    -

    So you might think that in order to find the curvature of a curve, we now need to solve the arc length function itself, and that this would be quite a problem because we just saw that there is no way to actually do that. Thankfully, we don't. We only need to know the form of the arc length function, which we saw above and is fairly simple, rather than needing to solve the arc length function. If we start with the arc length expression and the run through the steps necessary to determine its derivative (with an alternative, shorter demonstration of how to do this found over on Stackexchange), then the integral that was giving us so much problems in solving the arc length function disappears entirely (because of the fundamental theorem of calculus), and what we're left with us some surprisingly simple maths that relates curvature (denoted as κ, "kappa") to—and this is the truly surprising bit—a specific combination of derivatives of our original function.

    -

    Let me highlight what just happened, because it's pretty special:

    -
      -
    1. we wanted to make curves line up, and initially thought to match the curves' derivatives, but
    2. -
    3. that turned out to be a really bad choice, so instead
    4. -
    5. we picked a function that is basically impossible to work with, and then worked with that, which
    6. -
    7. gives us a simple formula that is and expression using the curves' derivatives.
    8. -
    -

    That's crazy!

    -

    But that's also one of the things that makes maths so powerful: even if your initial ideas are off the mark, you might be much closer than you thought you were, and the journey from "thinking we're completely wrong" to "actually being remarkably close to being right" is where we can find a lot of insight.

    -

    So, what does the function look like? This:

    - - -

    Which is really just a "short form" that glosses over the fact that we're dealing with functions of t, so let's expand that a tiny bit:

    - - -

    And while that's a little more verbose, it's still just as simple to work with as the first function: the curvature at some point on any (and this cannot be overstated: any) curve is a ratio between the first and second derivative cross product, and something that looks oddly similar to the standard Euclidean distance function. And nothing in these functions is hard to calculate either: for Bézier curves, simply knowing our curve coordinates means we know what the first and second derivatives are, and so evaluating this function for any t value is just a matter of basic arithematics.

    -

    In fact, let's just implement it right now:

    + +

    + And while that's a little more verbose, it's still just as simple to work with as the first function: the curvature at some point on any + (and this cannot be overstated: any) curve is a ratio between the first and second derivative cross product, and something that + looks oddly similar to the standard Euclidean distance function. And nothing in these functions is hard to calculate either: for Bézier + curves, simply knowing our curve coordinates means we know what the first and second derivatives are, and so + evaluating this function for any t value is just a matter of basic arithematics. +

    +

    In fact, let's just implement it right now:

    - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    - -

    That was easy! (Well okay, that "not a number" value will need to be taken into account by downstream code, but that's a reality of programming anyway)

    -

    With all of that covered, let's line up some curves! The following graphic gives you two curves that look identical, but use quadratic and cubic functions, respectively. As you can see, despite their derivatives being necessarily different, their curvature (thanks to being derived based on maths that "ignores" specific function derivative, and instead gives a formula that smooths out any differences) is exactly the same. And because of that, we can put them together such that the point where they overlap has the same curvature for both curves, giving us the smoothest transition.

    - - - Scripts are disabled. Showing fallback image. - - - -

    One thing you may have noticed in this sketch is that sometimes the curvature looks fine, but seems to be pointing in the wrong direction, making it hard to line up the curves properly. A way around that, of course, is to show the curvature on both sides of the curve, so let's just do that. But let's take it one step further: we can also compute the associated "radius of curvature", which gives us the implicit circle that "fits" the curve's curvature at any point, using what is possibly the simplest bit of maths found in this entire primer:

    - - -

    So let's revisit the previous graphic with the curvature visualised on both sides of our curves, as well as showing the circle that "fits" our curve at some point that we can control by using a slider:

    - - - Scripts are disabled. Showing fallback image. - - - - - + +

    + So let's revisit the previous graphic with the curvature visualised on both sides of our curves, as well as showing the circle that "fits" + our curve at some point that we can control by using a slider: +

    + + + Scripts are disabled. Showing fallback image. + + + + + +
    +
    +

    + + Tracing a curve at fixed distance intervals +

    +

    + Say you want to draw a curve with a dashed line, rather than a solid line, or you want to move something along the curve at fixed distance + intervals over time, like a train along a track, and you want to use Bézier curves. +

    +

    Now you have a problem.

    +

    + The reason you have a problem is that Bézier curves are parametric functions with non-linear behaviour, whereas moving a train along a + track is about as close to a practical example of linear behaviour as you can get. The problem we're faced with is that we can't just pick + t values at some fixed interval and expect the Bézier functions to generate points that are spaced a fixed distance apart. In + fact, let's look at the relation between "distance along a curve" and "t value", by plotting them against one another. +

    +

    + The following graphic shows a particularly illustrative curve, and its distance-for-t plot. For linear traversal, this line needs to be + straight, running from (0,0) to (length,1). That is, it's safe to say, not what we'll see: we'll see something very wobbly, instead. To + make matters even worse, the distance-for-t function is also of a much higher order than our curve is: while the curve we're using for + this exercise is a cubic curve, which can switch concave/convex form twice at best, the distance function is our old friend the arc length + function, which can have more inflection points. +

    + + + Scripts are disabled. Showing fallback image. + + +

    + So, how do we "cut up" the arc length function at regular intervals, when we can't really work with it? We basically cheat: we run through + the curve using t values, determine the distance-for-this-t-value at each point we generate during the run, and + then we find "the closest t value that matches some required distance" using those values instead. If we have a low number of + points sampled, we can then even refine which t value "should" work for our desired distance by interpolating between two + points, but if we have a high enough number of samples, we don't even need to bother. +

    +

    + So let's do exactly that: the following graph is similar to the previous one, showing how we would have to "chop up" our distance-for-t + curve in order to get regularly spaced points on the curve. It also shows what using those t values on the real curve looks + like, by coloring each section of curve between two distance markers differently: +

    + + + Scripts are disabled. Showing fallback image. + + + + + +

    Use the slider to increase or decrease the number of equidistant segments used to colour the curve.

    +

    + However, are there better ways? One such way is discussed in "Moving Along a Curve with Specified Speed" by David Eberly of Geometric Tools, LLC, but basically because we have no explicit length function (or rather, one we don't have to + constantly compute for different intervals), you may simply be better off with a traditional lookup table (LUT). +

    +
    +
    +

    + + Intersections +

    +

    + Let's look at some more things we will want to do with Bézier curves. Almost immediately after figuring out how to get bounding boxes to + work, people tend to run into the problem that even though the minimal bounding box (based on rotation) is tight, it's not sufficient to + perform true collision detection. It's a good first step to make sure there might be a collision (if there is no bounding box + overlap, there can't be one), but in order to do real collision detection we need to know whether or not there's an intersection on the + actual curve. +

    +

    + We'll do this in steps, because it's a bit of a journey to get to curve/curve intersection checking. First, let's start simple, by + implementing a line-line intersection checker. While we can solve this the traditional calculus way (determine the functions for both + lines, then compute the intersection by equating them and solving for two unknowns), linear algebra actually offers a nicer solution. +

    +

    Line-line intersections

    +

    + If we have two line segments with two coordinates each, segments A-B and C-D, we can find the intersection of the lines these segments are + an intervals on by linear algebra, using the procedure outlined in this + top coder + article. Of course, we need to make sure that the intersection isn't just on the lines our line segments lie on, but actually on our line + segments themselves. So after we find the intersection, we need to verify that it lies without the bounds of our original line segments. +

    +

    + The following graphic implements this intersection detection, showing a red point for an intersection on the lines our segments lie on + (thus being a virtual intersection point), and a green point for an intersection that lies on both segments (being a real intersection + point). +

    + + + Scripts are disabled. Showing fallback image. + + + +
    +

    Implementing line-line intersections

    +

    + Let's have a look at how to implement a line-line intersection checking function. The basics are covered in the article mentioned above, + but sometimes you need more function signatures, because you might not want to call your function with eight distinct parameters. Maybe + you're using point structs for the line. Let's get coding: +

    -
    -
    -

    Tracing a curve at fixed distance intervals

    -

    Say you want to draw a curve with a dashed line, rather than a solid line, or you want to move something along the curve at fixed distance intervals over time, like a train along a track, and you want to use Bézier curves.

    -

    Now you have a problem.

    -

    The reason you have a problem is that Bézier curves are parametric functions with non-linear behaviour, whereas moving a train along a track is about as close to a practical example of linear behaviour as you can get. The problem we're faced with is that we can't just pick t values at some fixed interval and expect the Bézier functions to generate points that are spaced a fixed distance apart. In fact, let's look at the relation between "distance along a curve" and "t value", by plotting them against one another.

    -

    The following graphic shows a particularly illustrative curve, and its distance-for-t plot. For linear traversal, this line needs to be straight, running from (0,0) to (length,1). That is, it's safe to say, not what we'll see: we'll see something very wobbly, instead. To make matters even worse, the distance-for-t function is also of a much higher order than our curve is: while the curve we're using for this exercise is a cubic curve, which can switch concave/convex form twice at best, the distance function is our old friend the arc length function, which can have more inflection points.

    - - - Scripts are disabled. Showing fallback image. - - - -

    So, how do we "cut up" the arc length function at regular intervals, when we can't really work with it? We basically cheat: we run through the curve using t values, determine the distance-for-this-t-value at each point we generate during the run, and then we find "the closest t value that matches some required distance" using those values instead. If we have a low number of points sampled, we can then even refine which t value "should" work for our desired distance by interpolating between two points, but if we have a high enough number of samples, we don't even need to bother.

    -

    So let's do exactly that: the following graph is similar to the previous one, showing how we would have to "chop up" our distance-for-t curve in order to get regularly spaced points on the curve. It also shows what using those t values on the real curve looks like, by coloring each section of curve between two distance markers differently:

    - - - Scripts are disabled. Showing fallback image. - - - - - -

    Use the slider to increase or decrease the number of equidistant segments used to colour the curve.

    -

    However, are there better ways? One such way is discussed in "Moving Along a Curve with Specified Speed" by David Eberly of Geometric Tools, LLC, but basically because we have no explicit length function (or rather, one we don't have to constantly compute for different intervals), you may simply be better off with a traditional lookup table (LUT).

    - -
    -
    -

    Intersections

    -

    Let's look at some more things we will want to do with Bézier curves. Almost immediately after figuring out how to get bounding boxes to work, people tend to run into the problem that even though the minimal bounding box (based on rotation) is tight, it's not sufficient to perform true collision detection. It's a good first step to make sure there might be a collision (if there is no bounding box overlap, there can't be one), but in order to do real collision detection we need to know whether or not there's an intersection on the actual curve.

    -

    We'll do this in steps, because it's a bit of a journey to get to curve/curve intersection checking. First, let's start simple, by implementing a line-line intersection checker. While we can solve this the traditional calculus way (determine the functions for both lines, then compute the intersection by equating them and solving for two unknowns), linear algebra actually offers a nicer solution.

    -

    Line-line intersections

    -

    If we have two line segments with two coordinates each, segments A-B and C-D, we can find the intersection of the lines these segments are an intervals on by linear algebra, using the procedure outlined in this top coder article. Of course, we need to make sure that the intersection isn't just on the lines our line segments lie on, but actually on our line segments themselves. So after we find the intersection, we need to verify that it lies without the bounds of our original line segments.

    -

    The following graphic implements this intersection detection, showing a red point for an intersection on the lines our segments lie on (thus being a virtual intersection point), and a green point for an intersection that lies on both segments (being a real intersection point).

    - - - Scripts are disabled. Showing fallback image. - - - -
    - -

    Implementing line-line intersections

    -

    Let's have a look at how to implement a line-line intersection checking function. The basics are covered in the article mentioned above, but sometimes you need more function signatures, because you might not want to call your function with eight distinct parameters. Maybe you're using point structs for the line. Let's get coding:

    - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    - -
    + return lli4(line1.p1, line1.p2, line2.p1, line2.p2) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + -

    What about curve-line intersections?

    -

    Curve/line intersection is more work, but we've already seen the techniques we need to use in order to perform it: first we translate/rotate both the line and curve together, in such a way that the line coincides with the x-axis. This will position the curve in a way that makes it cross the line at points where its y-function is zero. By doing this, the problem of finding intersections between a curve and a line has now become the problem of performing root finding on our translated/rotated curve, as we already covered in the section on finding extremities.

    -
    - - - Scripts are disabled. Showing fallback image. - - - - - - Scripts are disabled. Showing fallback image. - - - -
    +

    What about curve-line intersections?

    +

    + Curve/line intersection is more work, but we've already seen the techniques we need to use in order to perform it: first we + translate/rotate both the line and curve together, in such a way that the line coincides with the x-axis. This will position the curve in + a way that makes it cross the line at points where its y-function is zero. By doing this, the problem of finding intersections between a + curve and a line has now become the problem of performing root finding on our translated/rotated curve, as we already covered in the + section on finding extremities. +

    +
    + + + Scripts are disabled. Showing fallback image. + + + + + + Scripts are disabled. Showing fallback image. + + + +
    -

    Curve/curve intersection, however, is more complicated. Since we have no straight line to align to, we can't simply align one of the curves and be left with a simple procedure. Instead, we'll need to apply two techniques we've met before: de Casteljau's algorithm, and curve splitting.

    +

    + Curve/curve intersection, however, is more complicated. Since we have no straight line to align to, we can't simply align one of the + curves and be left with a simple procedure. Instead, we'll need to apply two techniques we've met before: de Casteljau's algorithm, and + curve splitting. +

    +
    +
    +

    + + Curve/curve intersection +

    +

    + Using de Casteljau's algorithm to split the curve we can now implement curve/curve intersection finding using a "divide and conquer" + technique: +

    +
      +
    1. + Take two curves C1 and C2, and treat them as a pair. +
    2. +
    3. If their bounding boxes overlap, split up each curve into two sub-curves
    4. +
    5. + With C1.1, C1.2, C2.1 and C2.2, form four new pairs (C1.1,C2.1), (C1.1, C2.2), (C1.2,C2.1), and (C1.2,C2.2). +
    6. +
    7. + For each pair, check whether their bounding boxes overlap. +
        +
      1. If their bounding boxes do not overlap, discard the pair, as there is no intersection between this pair of curves.
      2. +
      3. If there is overlap, rerun all steps for this pair.
      4. +
      +
    8. +
    9. + Once the sub-curves we form are so small that they effectively occupy sub-pixel areas, we consider an intersection found, noting that we + might have a cluster of multiple intersections at the sub-pixel level, out of which we pick one to act as "found" t value + (we can either throw all but one away, we can average the cluster's t values, or you can do something even more creative). +
    10. +
    +

    + This algorithm will start with a single pair, "balloon" until it runs in parallel for a large number of potential sub-pairs, and then + taper back down as it homes in on intersection coordinates, ending up with as many pairs as there are intersections. +

    +

    + The following graphic applies this algorithm to a pair of cubic curves, one step at a time, so you can see the algorithm in action. Click + the button to run a single step in the algorithm, after setting up your curves in some creative arrangement. You can also change the value + that is used in step 5 to determine whether the curves are small enough. Manipulating the curves or changing the threshold will reset the + algorithm, so you can try this with lots of different curves. +

    +

    (can you find the configuration that yields the maximum number of intersections between two cubic curves? Nine intersections!)

    + + + Scripts are disabled. Showing fallback image. + + + + + + +

    + Finding self-intersections is effectively the same procedure, except that we're starting with a single curve, so we need to turn that into + two separate curves first. This is trivially achieved by splitting at an inflection point, or if there are none, just splitting at + t=0.5 first, and then running the exact same algorithm as above, with all non-overlapping curve pairs getting removed at each + iteration, and each successive step homing in on the curve's self-intersection points. +

    +
    +
    +

    + + The projection identity +

    +

    + De Casteljau's algorithm is the pivotal algorithm when it comes to Bézier curves. You can use it not just to split curves, but also to + draw them efficiently (especially for high-order Bézier curves), as well as to come up with curves based on three points and a tangent. + Particularly this last thing is really useful because it lets us "mold" a curve, by picking it up at some point, and dragging that point + around to change the curve's shape. +

    +

    How does that work? Succinctly: we run de Casteljau's algorithm in reverse!

    +

    + In order to run de Casteljau's algorithm in reverse, we need a few basic things: a start and end point, a point on the curve that we want + to be moving around, which has an associated t value, and a point we've not explicitly talked about before, and as far as I know + has no explicit name, but lives one iteration higher in the de Casteljau process then our on-curve point does. I like to call it "A" for + reasons that will become obvious. +

    +

    + So let's use graphics instead of text to see where this "A" is, because text only gets us so far: move the sliders for the following + graphics to see what, given a specific t value, our A coordinate is. As well as some other coordinates, which + taken together let us derive a value that the graphics call "ratio": if you move the curve's points around, A, B, and C will move, what + happens to that value? +

    +
    + + + Scripts are disabled. Showing fallback image. + + + + + + + + Scripts are disabled. Showing fallback image. + + + + + +
    -
    -
    -

    Curve/curve intersection

    -

    Using de Casteljau's algorithm to split the curve we can now implement curve/curve intersection finding using a "divide and conquer" technique:

    -
      -
    1. Take two curves C1 and C2, and treat them as a pair.
    2. -
    3. If their bounding boxes overlap, split up each curve into two sub-curves
    4. -
    5. With C1.1, C1.2, C2.1 and C2.2, form four new pairs (C1.1,C2.1), (C1.1, C2.2), (C1.2,C2.1), and (C1.2,C2.2).
    6. -
    7. For each pair, check whether their bounding boxes overlap.
        -
      1. If their bounding boxes do not overlap, discard the pair, as there is no intersection between this pair of curves.
      2. -
      3. If there is overlap, rerun all steps for this pair.
      4. -
      -
    8. -
    9. Once the sub-curves we form are so small that they effectively occupy sub-pixel areas, we consider an intersection found, noting that we might have a cluster of multiple intersections at the sub-pixel level, out of which we pick one to act as "found" t value (we can either throw all but one away, we can average the cluster's t values, or you can do something even more creative).
    10. -
    -

    This algorithm will start with a single pair, "balloon" until it runs in parallel for a large number of potential sub-pairs, and then taper back down as it homes in on intersection coordinates, ending up with as many pairs as there are intersections.

    -

    The following graphic applies this algorithm to a pair of cubic curves, one step at a time, so you can see the algorithm in action. Click the button to run a single step in the algorithm, after setting up your curves in some creative arrangement. You can also change the value that is used in step 5 to determine whether the curves are small enough. Manipulating the curves or changing the threshold will reset the algorithm, so you can try this with lots of different curves.

    -

    (can you find the configuration that yields the maximum number of intersections between two cubic curves? Nine intersections!)

    - - - Scripts are disabled. Showing fallback image. - - - - - - -

    Finding self-intersections is effectively the same procedure, except that we're starting with a single curve, so we need to turn that into two separate curves first. This is trivially achieved by splitting at an inflection point, or if there are none, just splitting at t=0.5 first, and then running the exact same algorithm as above, with all non-overlapping curve pairs getting removed at each iteration, and each successive step homing in on the curve's self-intersection points.

    - -
    -
    -

    The projection identity

    -

    De Casteljau's algorithm is the pivotal algorithm when it comes to Bézier curves. You can use it not just to split curves, but also to draw them efficiently (especially for high-order Bézier curves), as well as to come up with curves based on three points and a tangent. Particularly this last thing is really useful because it lets us "mold" a curve, by picking it up at some point, and dragging that point around to change the curve's shape.

    -

    How does that work? Succinctly: we run de Casteljau's algorithm in reverse!

    -

    In order to run de Casteljau's algorithm in reverse, we need a few basic things: a start and end point, a point on the curve that we want to be moving around, which has an associated t value, and a point we've not explicitly talked about before, and as far as I know has no explicit name, but lives one iteration higher in the de Casteljau process then our on-curve point does. I like to call it "A" for reasons that will become obvious.

    -

    So let's use graphics instead of text to see where this "A" is, because text only gets us so far: move the sliders for the following graphics to see what, given a specific t value, our A coordinate is. As well as some other coordinates, which taken together let us derive a value that the graphics call "ratio": if you move the curve's points around, A, B, and C will move, what happens to that value?

    -
    - - - - Scripts are disabled. Showing fallback image. - - - - - - - - Scripts are disabled. Showing fallback image. - - - - - -
    - -

    So these graphics show us several things:

    -
      -
    1. a point at the tip of the curve construction's "hat": let's call that A, as well as
    2. -
    3. our on-curve point give our chosen t value: let's call that B, and finally,
    4. -
    5. a point that we get by projecting A, through B, onto the line between the curve's start and end points: let's call that C.
    6. -
    7. for both quadratic and cubic curves, two points e1 and e2, which represent the single-to-last step in de Casteljau's algorithm: in the last step, we find B at (1-t) * e1 + t * e2.
    8. -
    9. for cubic curves, also the points v1 and v2, which together with A represent the first step in de Casteljau's algorithm: in the next step, we find e1 and e2.
    10. -
    -

    These three values A, B, and C allow us to derive an important identity formula for quadratic and cubic Bézier curves: for any point on the curve with some t value, the ratio of distances from A to B and B to C is fixed: if some t value sets up a C that is 20% away from the start and 80% away from the end, then it doesn't matter where the start, end, or control points are; for that t value, C will always lie at 20% from the start and 80% from the end point. Go ahead, pick an on-curve point in either graphic and then move all the other points around: if you only move the control points, start and end won't move, and so neither will C, and if you move either start or end point, C will move but its relative position will not change.

    -

    So, how can we compute C? We start with our observation that C always lies somewhere between the start and end points, so logically C will have a function that interpolates between those two coordinates:

    - - -

    If we can figure out what the function u(t) looks like, we'll be done. Although we do need to remember that this u(t) will have a different form depending on whether we're working with quadratic or cubic curves. Running through the maths (with thanks to Boris Zbarsky) shows us the following two formulae:

    - - -

    And

    - - -

    So, if we know the start and end coordinates and the t value, we know C without having to calculate the A or even B coordinates. In fact, we can do the same for the ratio function. As another function of t, we technically don't need to know what A or B or C are. It, too, can be expressed as a pure function of t.

    -

    We start by observing that, given A, B, and C, the following always holds:

    - - -

    Working out the maths for this, we see the following two formulae for quadratic and cubic curves:

    - - -

    And

    - - -

    Which now leaves us with some powerful tools: given three points (start, end, and "some point on the curve"), as well as a t value, we can construct curves. We can compute C using the start and end points and our u(t) function, and once we have C, we can use our on-curve point (B) and the ratio(t) function to find A:

    - - -

    With A found, finding e1 and e2 for quadratic curves is a matter of running the linear interpolation with t between start and A to yield e1, and between A and end to yield e2. For cubic curves, there is no single pair of points that can act as e1 and e2 (there are infinitely many, because the tangent at B is a free parameter for cubic curves) so as long as the distance ratio between e1 to B and B to e2 is the Bézier ratio (1-t):t, we are free to pick any pair, after which we can reverse engineer v1 and v2:

    - - -

    And then reverse engineer the curve's control points:

    - - -

    So: if we have a curve's start and end points, as well as some third point B that we want the curve to pass through, then for any t value we implicitly know all the ABC values, which (combined with an educated guess on appropriate e1 and e2 coordinates for cubic curves) gives us the necessary information to reconstruct a curve's "de Casteljau skeleton". Which means that we can now do several things: we can "fit" curves using only three points, which means we can also "mold" curves by moving an on-curve point but leaving its start and end points, and then reconstruct the curve based on where we moved the on-curve point to. These are very useful things, and we'll look at both in the next few sections.

    - -
    -
    -

    Creating a curve from three points

    -

    Given the preceding section, you might be wondering if we can use that knowledge to just "create" curves by placing some points and having the computer do the rest, to which the answer is: that's exactly what we can now do!

    -

    For quadratic curves, things are pretty easy. Technically, we'll need a t value in order to compute the ratio function used in computing the ABC coordinates, but we can just as easily approximate one by treating the distance between the start and B point, and B and end point as a ratio, using

    - - -

    With this code in place, creating a quadratic curve from three points is literally just computing the ABC values, and using A as our curve's control point:

    - - - Scripts are disabled. Showing fallback image. - - - -

    For cubic curves we need to do a little more work, but really only just a little. We're first going to assume that a decent curve through the three points should approximate a circular arc, which first requires knowing how to fit a circle to three points. You may remember (if you ever learned it!) that a line between two points on a circle is called a chord, and that one property of chords is that the line from the center of any chord, perpendicular to that chord, passes through the center of the circle.

    -

    That means that if we have three points on a circle, we have three (different) chords, and consequently, three (different) lines that go from those chords through the center of the circle: if we find two of those lines, then their intersection will be our circle's center, and the circle's radius will—by definition!—be the distance from the center to any of our three points:

    - - - Scripts are disabled. Showing fallback image. - - - -

    With that covered, we now also know the tangent line to our point B, because the tangent to any point on the circle is a line through that point, perpendicular to the line from that point to the center. That just leaves marking appropriate points e1 and e2 on that tangent, so that we can construct a new cubic curve hull. We use the approach as we did for quadratic curves to automatically determine a reasonable t value, and then our e1 and e2 coordinates must obey the standard de Casteljau rule for linear interpolation:

    - - -

    Where d is the total length of the line segment from e1 to e2. So how long do we make that? There are again all kinds of approaches we can take, and a simple-but-effective one is to set the length of that segment to "one third the length of the baseline". This forces e1 and e2 to always be the "linear curve" distance apart, which means if we place our three points on a line, it will actually look like a line. Nice! The last thing we'll need to do is make sure to flip the sign of d depending on which side of the baseline our B is located, so we don't end up creating a funky curve with a loop in it. To do this, we can use the atan2 function:

    - - -

    This angle φ will be between 0 and π if B is "above" the baseline (rotating all three points so that the start is on the left and the end is the right), so we can use a relatively straight forward check to make sure we're using the correct sign for our value d:

    - - -

    The result of this approach looks as follows:

    - - - Scripts are disabled. Showing fallback image. - - - -

    It is important to remember that even though we're using a circular arc to come up with decent e1 and e2 terms, we're not trying to perfectly create a circular arc with a cubic curve (which is good, because we can't; more on that later), we're only trying to come up with some reasonable e1 and e2 points so we can construct a new cubic curve... so now that we have those: let's see what kind of cubic curve that gives us:

    - - - Scripts are disabled. Showing fallback image. - - - -

    That looks perfectly serviceable!

    -

    Of course, we can take this one step further: we can't just "create" curves, we also have (almost!) all the tools available to "mold" curves, where we can reshape a curve by dragging a point on the curve around while leaving the start and end fixed, effectively molding the shape as if it were clay or the like. We'll see the last tool we need to do that in the next section, and then we'll look at implementing curve molding in the section after that, so read on!

    + +

    The result of this approach looks as follows:

    + + + Scripts are disabled. Showing fallback image. + + + +

    + It is important to remember that even though we're using a circular arc to come up with decent e1 and e2 terms, + we're not trying to perfectly create a circular arc with a cubic curve (which is good, because we can't; + more on that later), we're only trying to come up with some reasonable e1 and + e2 points so we can construct a new cubic curve... so now that we have those: let's see what kind of cubic curve that gives + us: +

    + + + Scripts are disabled. Showing fallback image. + + + +

    That looks perfectly serviceable!

    +

    + Of course, we can take this one step further: we can't just "create" curves, we also have (almost!) all the tools available to "mold" + curves, where we can reshape a curve by dragging a point on the curve around while leaving the start and end fixed, effectively molding + the shape as if it were clay or the like. We'll see the last tool we need to do that in the next section, and then we'll look at + implementing curve molding in the section after that, so read on! +

    +
    +
    +

    + + Projecting a point onto a Bézier curve +

    +

    + Before we can move on to actual curve molding, it'll be good if know how to actually be able to find "some point on the curve" that we're + trying to click on. After all, if all we have is our Bézier coordinates, that is not in itself enough to figure out which point on the + curve our cursor will be closest to. So, how do we project points onto a curve? +

    +

    + If the Bézier curve is of low enough order, we might be able to + work out the maths for how to do this, and get a perfect t value back, but in general this is an incredibly hard problem and the easiest solution is, really, a + numerical approach again. We'll be finding our ideal t value using a + binary search. First, we do a coarse distance-check based on + t values associated with the curve's "to draw" coordinates (using a lookup table, or LUT). This is pretty fast: +

    -
    -
    -

    Projecting a point onto a Bézier curve

    -

    Before we can move on to actual curve molding, it'll be good if know how to actually be able to find "some point on the curve" that we're trying to click on. After all, if all we have is our Bézier coordinates, that is not in itself enough to figure out which point on the curve our cursor will be closest to. So, how do we project points onto a curve?

    -

    If the Bézier curve is of low enough order, we might be able to work out the maths for how to do this, and get a perfect t value back, but in general this is an incredibly hard problem and the easiest solution is, really, a numerical approach again. We'll be finding our ideal t value using a binary search. First, we do a coarse distance-check based on t values associated with the curve's "to draw" coordinates (using a lookup table, or LUT). This is pretty fast:

    - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    - -

    After this runs, we know that LUT[i] is the coordinate on the curve in our LUT that is closest to the point we want to project, so that's a pretty good initial guess as to what the best projection onto our curve is. To refine it, we note that LUT[i] is a better guess than both LUT[i-1] and LUT[i+1], but there might be an even better projection somewhere else between those two values, so that's what we're going to be testing for, using a variation of the binary search.

    -
      -
    1. we start with our point p, and the t values t1=LUT[i-1].t and t2=LUT[i+1].t, which span an interval v = t2-t1.
    2. -
    3. we test this interval in five spots: the start, middle, and end (which we already have), and the two points in between the middle and start/end points
    4. -
    5. we then check which of these five points is the closest to our original point p, and then repeat step 1 with the points before and after the closest point we just found.
    6. -
    -

    This makes the interval we check smaller and smaller at each iteration, and we can keep running the three steps until the interval becomes so small as to lead to distances that are, for all intents and purposes, the same for all points.

    -

    So, let's see that in action: in this case, I'm going to arbitrarily say that if we're going to run the loop until the interval is smaller than 0.001, and show you what that means for projecting your mouse cursor or finger tip onto a rather complex Bézier curve (which, of course, you can reshape as you like). Also shown are the original three points that our coarse check finds.

    - - - Scripts are disabled. Showing fallback image. - - - + i = index + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + -
    -
    -

    Intersections with a circle

    -

    It might seem odd to cover this subject so much later than the line/line, line/curve, and curve/curve intersection topics from several sections earlier, but the reason we can't cover circle/curve intersections is that we can't really discuss circle/curve intersection until we've covered the kind of lookup table (LUT) walking that the section on projecting a point onto a curve uses. To see why, let's look at what we would have to do if we wanted to find the intersections between a curve and a circle using calculus.

    -

    First, we observe that "finding intersections" in this case means that, given a circle defined by a center point c = (x,y) and a radius r, we want to find all points on the Bezier curve for which the distance to the circle's center point is equal to the circle radius, which by definition means those points lie on the circle, and so count as intersections. In maths, that means we're trying to solve:

    - - -

    Which seems simple enough. Unfortunately, when we expand that dist function, things get a lot more problematic:

    - - -

    And now we have a problem because that's a sixth degree polynomial inside the square root. So, thanks to the Abel-Ruffini theorem that we saw before, we can't solve this by just going "square both sides because we don't care about signs"... we can't solve a sixth degree polynomial. So, we're going to have to actually evaluate that expression. We can "simplify" this by translating all our coordinates so that the center of the circle is (0,0) and all our coordinates are shifted accordingly, which makes the cx and cy terms fall away, but then we're still left with a monstrous function to solve.

    -

    So instead, we turn to the same kind of "LUT walking" that we saw for projecting points onto a curve, with a twist: instead of finding the on-curve point with the smallest distance to our projection point, we want to find the on-curve point that has the exact distance r to our projection point (namely, our circle center). Of course, there can be more than one such point, so there's also a bit more code to make sure we find all of them, but let's look at the steps involved:

    + +

    + And now we have a problem because that's a sixth degree polynomial inside the square root. So, thanks to the + Abel-Ruffini theorem that we saw before, we can't solve this by + just going "square both sides because we don't care about signs"... we can't solve a sixth degree polynomial. So, we're going to have to + actually evaluate that expression. We can "simplify" this by translating all our coordinates so that the center of the circle is (0,0) and + all our coordinates are shifted accordingly, which makes the cx and cy terms fall away, but then we're still left + with a monstrous function to solve. +

    +

    + So instead, we turn to the same kind of "LUT walking" that we saw for projecting points onto a curve, with a twist: instead of finding the + on-curve point with the smallest distance to our projection point, we want to find the on-curve point that has the exact distance + r to our projection point (namely, our circle center). Of course, there can be more than one such point, so there's also a + bit more code to make sure we find all of them, but let's look at the steps involved: +

    - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    - -

    This is very similar to the code in the previous section, with an extra input r for the circle radius, and a minor change in the "distance for this coordinate": rather than just distance(coordinate, p) we want to know the difference between that distance and the circle radius. After all, if that difference is zero, then the distance from the coordinate to the circle center is exactly the radius, so the coordinate lies on both the curve and the circle.

    -

    So far so good.

    -

    However, we also want to make sure we find all the points, not just a single one, so we need a little more code for that:

    + i = index + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    - -

    After running this code, values will be the list of all LUT coordinates that are closest to the distance r: we can use those values to run the same kind of refinement lookup we used for point projection (with the caveat that we're now not checking for smallest distance, but for "distance closest to r"), and we'll have all our intersection points. Of course, that does require explaining what findClosest does: rather than looking for a global minimum, we're now interested in finding a local minimum, so instead of checking a single point and looking at its distance value, we check three points ("current", "previous" and "before previous") and then check whether they form a local minimum:

    + start = i + 2; + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + - - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    - -

    In words: given a start index, the circle center and radius, and our LUT, we check where (closest to our start index) we can find a local minimum for the difference between "the distance from the curve to the circle center", and the circle's radius. We track this by looking at three values (associated with the indices index-2, index-1, and index), and we know we've found a local minimum if the three values show that the middle value (pd1) is less than either value beside it. When we do, we can set our "best guess, relative to start" as index-1. Of course, since we're now checking values relative to some start value, we might not find another candidate value at all, in which case we return start - 1, so that a simple "is the result less than start?" lets us determine that there are no more intersections to find.

    -

    Finally, while not necessary for point projection, there is one more step we need to perform when we run the binary refinement function on our candidate LUT indices, because we've so far only been testing using distances "closest to the radius of the circle", and that's actually not good enough... we need distances that are the radius of the circle. So, after running the refinement for each of these indices, we need to discard any final value that isn't the circle radius. And because we're working with floating point numbers, what this really means is that we need to discard any value that's a pixel or more "off". Or, if we want to get really fancy, "some small epsilon value".

    -

    Based on all of that, the following graphic shows this off for the standard cubic curve (which you can move the coordinates around for, of course) and a circle with a controllable radius centered on the graphic's center, using the code approach described above.

    - - - Scripts are disabled. Showing fallback image. - - - - - -

    And of course, for the full details, click that "view source" link.

    + return start + i + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + 18 + + + 19 + + -
    -
    -

    Molding a curve

    -

    Armed with knowledge of the "ABC" relation, point-on-curve projection, and guestimating reasonable looking helper values for cubic curve construction, we can finally cover curve molding: updating a curve's shape interactively, by dragging points on the curve around.

    -

    For quadratic curve, this is a really simple trick: we project our cursor onto the curve, which gives us a t value and initial B coordinate. We don't even need the latter: with our t value and "wherever the cursor is" as target B, we can compute the associated C:

    - - -

    And then the associated A:

    - - -

    And we're done, because that's our new quadratic control point!

    - - - Scripts are disabled. Showing fallback image. - - - -

    As before, cubic curves are a bit more work, because while it's easy to find our initial t value and ABC values, getting those all-important e1 and e2 coordinates is going to pose a bit of a problem... in the section on curve creation, we were free to pick an appropriate t value ourselves, which allowed us to find appropriate e1 and e2 coordinates. That's great, but when we're curve molding we don't have that luxury: whatever point we decide to start moving around already has its own t value, and its own e1 and e2 values, and those may not make sense for the rest of the curve.

    -

    For example, let's see what happens if we just "go with what we get" when we pick a point and start moving it around, preserving its t value and e1/e2 coordinates:

    - - - Scripts are disabled. Showing fallback image. - - - -

    That looks reasonable, close to the original point, but the further we drag our point, the less "useful" things become. Especially if we drag our point across the baseline, rather than turning into a nice curve.

    -

    One way to combat this might be to combine the above approach with the approach from the creating curves section: generate both the "unchanged t/e1/e2" curve, as well as the "idealized" curve through the start/cursor/end points, with idealized t value, and then interpolating between those two curves:

    - - - Scripts are disabled. Showing fallback image. - - - - - -

    The slide controls the "falloff distance" relative to where the original point on the curve is, so that as we drag our point around, it interpolates with a bias towards "preserving t/e1/e2" closer to the original point, and bias towards "idealized" form the further away we move our point, with anything that's further than our falloff distance simply being the idealized curve. We don't even try to interpolate at that point.

    -

    A more advanced way to try to smooth things out is to implement continuous molding, where we constantly update the curve as we move around, and constantly change what our B point is, based on constantly projecting the cursor on the curve as we're updating it - this is, you won't be surprised to learn, tricky, and beyond the scope of this section: interpolation (with a reasonable distance) will do for now!

    - -
    -
    -

    Curve fitting

    -

    Given the previous section, one question you might have is "what if I don't want to guess t values?". After all, plenty of graphics packages do automated curve fitting, so how can we implement that in a way that just finds us reasonable t values all on its own?

    -

    And really this is just a variation on the question "how do I get the curve through these X points?", so let's look at that. Specifically, let's look at the answer: "curve fitting". This is in fact a rather rich field in geometry, applying to anything from data modelling to path abstraction to "drawing", so there's a fair number of ways to do curve fitting, but we'll look at one of the most common approaches: something called a least squares polynomial regression. In this approach, we look at the number of points we have in our data set, roughly determine what would be an appropriate order for a curve that would fit these points, and then tackle the question "given that we want an nth order curve, what are the coordinates we can find such that our curve is "off" by the least amount?".

    -

    Now, there are many ways to determine how "off" points are from the curve, which is where that "least squares" term comes in. The most common tool in the toolbox is to minimise the squared distance between each point we have, and the corresponding point on the curve we end up "inventing". A curve with a snug fit will have zero distance between those two, and a bad fit will have non-zero distances between every such pair. It's a workable metric. You might wonder why we'd need to square, rather than just ensure that distance is a positive value (so that the total error is easy to compute by just summing distances) and the answer really is "because it tends to be a little better". There's lots of literature on the web if you want to deep-dive the specific merits of least squared error metrics versus least absolute error metrics, but those are well beyond the scope of this material.

    -

    So let's look at what we end up with in terms of curve fitting if we start with the idea of performing least squares Bézier fitting. We're going to follow a procedure similar to the one described by Jim Herold over on his "Least Squares Bézier Fit" article, and end with some nice interactive graphics for doing some curve fitting.

    -

    Before we begin, we're going to use the curve in matrix form. In the section on matrices, I mentioned that some things are easier if we use the matrix representation of a Bézier curve rather than its calculus form, and this is one of those things.

    -

    As such, the first step in the process is expressing our Bézier curve as powers/coefficients/coordinate matrix T x M x C, by expanding the Bézier functions.

    -
    - -

    Revisiting the matrix representation

    -

    Rewriting Bézier functions to matrix form is fairly easy, if you first expand the function, and then arrange them into a multiple line form, where each line corresponds to a power of t, and each column is for a specific coefficient. First, we expand the function:

    - - -

    And then we (trivially) rearrange the terms across multiple lines:

    - - -

    This rearrangement has "factors of t" at each row (the first row is t⁰, i.e. "1", the second row is t¹, i.e. "t", the third row is t²) and "coefficient" at each column (the first column is all terms involving "a", the second all terms involving "b", the third all terms involving "c").

    -

    With that arrangement, we can easily decompose this as a matrix multiplication:

    - - -

    We can do the same for the cubic curve, of course. We know the base function for cubics:

    - - -

    So we write out the expansion and rearrange:

    - - -

    Which we can then decompose:

    - - -

    And, of course, we can do this for quartic curves too (skipping the expansion step):

    - - -

    And so and on so on. Now, let's see how to use these T, M, and C, to do some curve fitting.

    -
    + +

    + And so and on so on. Now, let's see how to use these T, M, and C, to do some curve + fitting. +

    + -

    Let's get started: we're going to assume we picked the right order curve: for n points we're fitting an n-1th order curve, so we "start" with a vector P that represents the coordinates we already know, and for which we want to do curve fitting:

    - - -

    Next, we need to figure out appropriate t values for each point in the curve, because we need something that lets us tie "the actual coordinate" to "some point on the curve". There's a fair number of different ways to do this (and a large part of optimizing "the perfect fit" is about picking appropriate t values), but in this case let's look at two "obvious" choices:

    -
      -
    1. equally spaced t values, and
    2. -
    3. t values that align with distance along the polygon.
    4. -
    -

    The first one is really simple: if we have n points, then we'll just assign each point i a t value of (i-1)/(n-1). So if we have four points, the first point will have t=(1-1)/(4-1)=0/3, the second point will have t=(2-1)/(4-1)=1/3, the third point will have t=2/3, and the last point will be t=1. We're just straight up spacing the t values to match the number of points we have.

    -

    The second one is a little more interesting: since we're doing polynomial regression, we might as well exploit the fact that our base coordinates just constitute a collection of line segments. At the first point, we're fixing t=0, and the last point, we want t=1, and anywhere in between we're simply going to say that t is equal to the distance along the polygon, scaled to the [0,1] domain.

    -

    To get these values, we first compute the general "distance along the polygon" matrix:

    - - -

    Where length() is literally just that: the length of the line segment between the point we're looking at, and the previous point. This isn't quite enough, of course: we still need to make sure that all the values between i=1 and i=n fall in the [0,1] interval, so we need to scale all values down by whatever the total length of the polygon is:

    - - -

    And now we can move on to the actual "curve fitting" part: what we want is a function that lets us compute "ideal" control point values such that if we build a Bézier curve with them, that curve passes through all our original points. Or, failing that, have an overall error distance that is as close to zero as we can get it. So, let's write out what the error distance looks like.

    -

    As mentioned before, this function is really just "the distance between the actual coordinate, and the coordinate that the curve evaluates to for the associated t value", which we'll square to get rid of any pesky negative signs:

    - - -

    Since this function only deals with individual coordinates, we'll need to sum over all coordinates in order to get the full error function. So, we literally just do that; the total error function is simply the sum of all these individual errors:

    - - -

    And here's the trick that justifies using matrices: while we can work with individual values using calculus, with matrices we can compute as many values as we make our matrices big, all at the "same time", We can replace the individual terms pi with the full P coordinate matrix, and we can replace Bézier(si) with the matrix representation T x M x C we talked about before, which gives us:

    - - -

    In which we can replace the rather cumbersome "squaring" operation with a more conventional matrix equivalent:

    - - -

    Here, the letter T is used instead of the number 2, to represent the matrix transpose; each row in the original matrix becomes a column in the transposed matrix instead (row one becomes column one, row two becomes column two, and so on).

    -

    This leaves one problem: T isn't actually the matrix we want: we don't want symbolic t values, we want the actual numerical values that we computed for S, so we need to form a new matrix, which we'll call 𝕋, that makes use of those, and then use that 𝕋 instead of T in our error function:

    - - -

    Which, because of the first and last values in S, means:

    - - -

    Now we can properly write out the error function as matrix operations:

    - - -

    So, we have our error function: we now need to figure out the expression for where that function has minimal value, e.g. where the error between the true coordinates and the coordinates generated by the curve fitting is smallest. Like in standard calculus, this requires taking the derivative, and determining where that derivative is zero:

    - - -
    + +
    +

    Where did this derivative come from?

    +

    + That... is a good question. In fact, when trying to run through this approach, I ran into the same question! And you know what? I + straight up had no idea. I'm decent enough at calculus, I'm decent enough at linear algebra, and I just don't know. +

    +

    + So I did what I always do when I don't understand something: I asked someone to help me understand how things work. In this specific + case, I + posted a question + to Math.stackexchange, and received a answer that goes into way more detail than I had + hoped to receive. +

    +

    + Is that answer useful to you? Probably: no. At least, not unless you like understanding maths on a recreational level. And I do mean + maths in general, not just basic algebra. But it does help in giving us a reference in case you ever wonder "Hang on. Why was that + true?". There are answers. They might just require some time to come to understand. +

    +
    -

    Where did this derivative come from?

    -

    That... is a good question. In fact, when trying to run through this approach, I ran into the same question! And you know what? I straight up had no idea. I'm decent enough at calculus, I'm decent enough at linear algebra, and I just don't know.

    -

    So I did what I always do when I don't understand something: I asked someone to help me understand how things work. In this specific case, I posted a question to Math.stackexchange, and received a answer that goes into way more detail than I had hoped to receive.

    -

    Is that answer useful to you? Probably: no. At least, not unless you like understanding maths on a recreational level. And I do mean maths in general, not just basic algebra. But it does help in giving us a reference in case you ever wonder "Hang on. Why was that true?". There are answers. They might just require some time to come to understand.

    -
    - -

    Now, given the above derivative, we can rearrange the terms (following the rules of matrix algebra) so that we end up with an expression for C:

    - - -

    Here, the "to the power negative one" is the notation for the matrix inverse. But that's all we have to do: we're done. Starting with P and inventing some t values based on the polygon the coordinates in P define, we can compute the corresponding Bézier coordinates C that specify a curve that goes through our points. Or, if it can't go through them exactly, as near as possible.

    -

    So before we try that out, how much code is involved in implementing this? Honestly, that answer depends on how much you're going to be writing yourself. If you already have a matrix maths library available, then really not that much code at all. On the other hand, if you are writing this from scratch, you're going to have to write some utility functions for doing your matrix work for you, so it's really anywhere from 50 lines of code to maybe 200 lines of code. Not a bad price to pay for being able to fit curves to pre-specified coordinates.

    -

    So let's try it out! The following graphic lets you place points, and will start computing exact-fit curves once you've placed at least three. You can click for more points, and the code will simply try to compute an exact fit using a Bézier curve of the appropriate order. Four points? Cubic Bézier. Five points? Quartic. And so on. Of course, this does break down at some point: depending on where you place your points, it might become mighty hard for the fitter to find an exact fit, and things might actually start looking horribly off once there's enough points for compound floating point rounding errors to start making a difference (which is around 10~11 points).

    - - - Scripts are disabled. Showing fallback image. - - - - - - -

    You'll note there is a convenient "toggle" buttons that lets you toggle between equidistant t values, and distance ratio along the polygon formed by the points. Arguably more interesting is that once you have points to abstract a curve, you also get direct control over the time values through sliders for each, because if the time values are our degree of freedom, you should be able to freely manipulate them and see what the effect on your curve is.

    - -
    -
    -

    Bézier curves and Catmull-Rom curves

    -

    Taking an excursion to different splines, the other common design curve is the Catmull-Rom spline, which unlike Bézier curves pass through each control point, so they offer a kind of "built-in" curve fitting.

    -

    In fact, let's start with just playing with one: the following graphic has a predefined curve that you manipulate the points for, and lets you add points by clicking/tapping the background, as well as let you control "how fast" the curve passes through its point using the tension slider. The tenser the curve, the more the curve tends towards straight lines from one point to the next.

    - - - Scripts are disabled. Showing fallback image. - - - - - -

    Now, it may look like Catmull-Rom curves are very different from Bézier curves, because these curves can get very long indeed, but what looks like a single Catmull-Rom curve is actually a spline: a single curve built up of lots of identically-computed pieces, similar to if you just took a whole bunch of Bézier curves, placed them end to end, and lined up their control points so that things look like a single curve. For a Catmull-Rom curve, each "piece" between two points is defined by the point's coordinates, and the tangent for those points, the latter of which can trivially be derived from knowing the previous and next point:

    - + +

    + You'll note there is a convenient "toggle" buttons that lets you toggle between equidistant t values, and distance ratio + along the polygon formed by the points. Arguably more interesting is that once you have points to abstract a curve, you also get + direct control over the time values through sliders for each, because if the time values are our degree of freedom, you should be + able to freely manipulate them and see what the effect on your curve is. +

    +
    +
    +

    + + Bézier curves and Catmull-Rom curves +

    +

    + Taking an excursion to different splines, the other common design curve is the + Catmull-Rom spline, which unlike Bézier curves + pass through each control point, so they offer a kind of "built-in" curve fitting. +

    +

    + In fact, let's start with just playing with one: the following graphic has a predefined curve that you manipulate the points for, and lets + you add points by clicking/tapping the background, as well as let you control "how fast" the curve passes through its point using the + tension slider. The tenser the curve, the more the curve tends towards straight lines from one point to the next. +

    + + + Scripts are disabled. Showing fallback image. + + + + + +

    + Now, it may look like Catmull-Rom curves are very different from Bézier curves, because these curves can get very long indeed, but what + looks like a single Catmull-Rom curve is actually a spline: a single + curve built up of lots of identically-computed pieces, similar to if you just took a whole bunch of Bézier curves, placed them end to end, + and lined up their control points so that things look like a single curve. For a Catmull-Rom curve, each "piece" between two points is + defined by the point's coordinates, and the tangent for those points, the latter of which + can trivially be derived from knowing the + previous and next point: +

    + - -

    One downside of this is that—as you may have noticed from the graphic—the first and last point of the overall curve don't actually join up with the rest of the curve: they don't have a previous/next point respectively, and so there is no way to calculate what their tangent should be. Which also makes it rather tricky to fit a Catmull-Rom curve to three points like we were able to do for Bézier curves. More on that in the next section.

    -

    In fact, before we move on, let's look at how to actually draw the basic form of these curves (I say basic, because there are a number of variations that make things considerable more complex):

    + +

    + One downside of this is that—as you may have noticed from the graphic—the first and last point of the overall curve don't actually join up + with the rest of the curve: they don't have a previous/next point respectively, and so there is no way to calculate what their tangent + should be. Which also makes it rather tricky to fit a Catmull-Rom curve to three points like we were able to do for Bézier curves. More on + that in the next section. +

    +

    + In fact, before we move on, let's look at how to actually draw the basic form of these curves (I say basic, because there are a number of + variations that make things + considerable more + complex): +

    - - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    - -

    Now, since a Catmull-Rom curve is a form of cubic Hermite spline, and as cubic Bézier curves are also a form of cubic Hermite spline, we run into an interesting bit of maths programming: we can convert one to the other and back, and the maths for doing so is surprisingly simple!

    -

    The main difference between Catmull-Rom curves and Bézier curves is "what the points mean":

    -
      -
    • A cubic Bézier curve is defined by a start point, a control point that implies the tangent at the start, a control point that implies the tangent at the end, and an end point, plus a characterizing matrix that we can multiply by that point vector to get on-curve coordinates.
    • -
    • A Catmull-Rom curve is defined by a start point, a tangent that for that starting point, an end point, and a tangent for that end point, plus a characteristic matrix that we can multiple by the point vector to get on-curve coordinates.
    • -
    -

    Those are very similar, so let's see exactly how similar they are. We've already see the matrix form for Bézier curves, so how different is the matrix form for Catmull-Rom curves?:

    - - -

    That's pretty dang similar. So the question is: how can we convert that expression with Catmull-Rom matrix and vector into an expression of the Bézier matrix and vector? The short answer is of course "by using linear algebra", but the longer answer is the rest of this section, and involves some maths that you may not even care for: if you just want to know the (incredibly simple) conversions between the two curve forms, feel free to skip to the end of the following explanation, but if you want to how we can get one from the other... let's get mathing!

    -
    - -

    Deriving the conversion formulae

    -

    In order to convert between Catmull-Rom curves and Bézier curves, we need to know two things. Firstly, how to express the Catmull-Rom curve using a "set of four coordinates", rather than a mix of coordinates and tangents, and secondly, how to convert those Catmull-Rom coordinates to and from Bézier form.

    -

    We start with the first part, to figure out how we can go from Catmull-Rom V coordinates to Bézier P coordinates, by applying "some matrix T". We don't know what that T is yet, but we'll get to that:

    - - -

    So, this mapping says that in order to map a Catmull-Rom "point + tangent" vector to something based on an "all coordinates" vector, we need to determine the mapping matrix such that applying T yields P2 as start point, P3 as end point, and two tangents based on the lines between P1 and P3, and P2 nd P4, respectively.

    -

    Computing T is really more "arranging the numbers":

    - - -

    Thus:

    - - -

    However, we're not quite done, because Catmull-Rom curves have that "tension" parameter, written as τ (a lowercase"tau"), which is a scaling factor for the tangent vectors: the bigger the tension, the smaller the tangents, and the smaller the tension, the bigger the tangents. As such, the tension factor goes in the denominator for the tangents, and before we continue, let's add that tension factor into both our coordinate vector representation, and mapping matrix T:

    - - -

    With the mapping matrix properly done, let's rewrite the "point + tangent" Catmull-Rom matrix form to a matrix form in terms of four coordinates, and see what we end up with:

    - - -

    Replace point/tangent vector with the expression for all-coordinates:

    - - -

    and merge the matrices:

    - - -

    This looks a lot like the Bézier matrix form, which as we saw in the chapter on Bézier curves, should look like this:

    - - -

    So, if we want to express a Catmull-Rom curve using a Bézier curve, we'll need to turn this Catmull-Rom bit:

    - - -

    Into something that looks like this:

    - - -

    And the way we do that is with a fairly straight forward bit of matrix rewriting. We start with the equality we need to ensure:

    - - -

    Then we remove the coordinate vector from both sides without affecting the equality:

    - - -

    Then we can "get rid of" the Bézier matrix on the right by left-multiply both with the inverse of the Bézier matrix:

    - - -

    A matrix times its inverse is the matrix equivalent of 1, and because "something times 1" is the same as "something", so we can just outright remove any matrix/inverse pair:

    - - -

    And now we're basically done. We just multiply those two matrices and we know what V is:

    - - -

    We now have the final piece of our function puzzle. Let's run through each step.

    -
      -
    1. Start with the Catmull-Rom function:
    2. -
    - - -
      -
    1. rewrite to pure coordinate form:
    2. -
    - - -
      -
    1. rewrite for "normal" coordinate vector:
    2. -
    - - -
      -
    1. merge the inner matrices:
    2. -
    - - -
      -
    1. rewrite for Bézier matrix form:
    2. -
    - - -
      -
    1. and transform the coordinates so we have a "pure" Bézier expression:
    2. -
    - - -

    And we're done: we finally know how to convert these two curves!

    -
    + +

    And we're done: we finally know how to convert these two curves!

    + -

    If we have a Catmull-Rom curve defined by four coordinates P1 through P4, then we can draw that curve using a Bézier curve that has the vector:

    - - -

    Similarly, if we have a Bézier curve defined by four coordinates P1 through P4, we can draw that using a standard tension Catmull-Rom curve with the following coordinate values:

    - - -

    Or, if your API allows you to specify Catmull-Rom curves using plain coordinates:

    - - - -
    -
    -

    Creating a Catmull-Rom curve from three points

    -

    Much shorter than the previous section: we saw that Catmull-Rom curves need at least 4 points to draw anything sensible, so how do we create a Catmull-Rom curve from three points?

    -

    Short and sweet: we don't.

    -

    We run through the maths that lets us create a cubic Bézier curve, and then convert its coordinates to Catmull-Rom form using the conversion formulae we saw above.

    - -
    -
    -

    Forming poly-Bézier curves

    -

    Much like lines can be chained together to form polygons, Bézier curves can be chained together to form poly-Béziers, and the only trick required is to make sure that:

    -
      -
    1. the end point of each section is the starting point of the following section, and
    2. -
    3. the derivatives across that dual point line up.
    4. -
    -

    Unless you want sharp corners, of course. Then you don't even need 2.

    -

    We'll cover three forms of poly-Bézier curves in this section. First, we'll look at the kind that just follows point 1. where the end point of a segment is the same point as the start point of the next segment. This leads to poly-Béziers that are pretty hard to work with, but they're the easiest to implement:

    - - - Scripts are disabled. Showing fallback image. - - - - - - Scripts are disabled. Showing fallback image. - - - -

    Dragging the control points around only affects the curve segments that the control point belongs to, and moving an on-curve point leaves the control points where they are, which is not the most useful for practical modelling purposes. So, let's add in the logic we need to make things a little better. We'll start by linking up control points by ensuring that the "incoming" derivative at an on-curve point is the same as it's "outgoing" derivative:

    - - -

    We can effect this quite easily, because we know that the vector from a curve's last control point to its last on-curve point is equal to the derivative vector. If we want to ensure that the first control point of the next curve matches that, all we have to do is mirror that last control point through the last on-curve point. And mirroring any point A through any point B is really simple:

    - - -

    So let's implement that and see what it gets us. The following two graphics show a quadratic and a cubic poly-Bézier curve again, but this time moving the control points around moves others, too. However, you might see something unexpected going on for quadratic curves...

    - - - Scripts are disabled. Showing fallback image. - - - - - - Scripts are disabled. Showing fallback image. - - - -

    As you can see, quadratic curves are particularly ill-suited for poly-Bézier curves, as all the control points are effectively linked. Move one of them, and you move all of them. Not only that, but if we move the on-curve points, it's possible to get a situation where a control point cannot satisfy the constraint that it's the reflection of its two neighbouring control points... This means that we cannot use quadratic poly-Béziers for anything other than really, really simple shapes. And even then, they're probably the wrong choice. Cubic curves are pretty decent, but the fact that the derivatives are linked means we can't manipulate curves as well as we might if we relaxed the constraints a little.

    -

    So: let's relax the requirement a little.

    -

    We can change the constraint so that we still preserve the angle of the derivatives across sections (so transitions from one section to the next will still look natural), but give up the requirement that they should also have the same vector length. Doing so will give us a much more useful kind of poly-Bézier curve:

    - - - Scripts are disabled. Showing fallback image. - - - - - - Scripts are disabled. Showing fallback image. - - - -

    Cubic curves are now better behaved when it comes to dragging control points around, but the quadratic poly-Bézier still has the problem that moving one control points will move the control points and may ending up defining "the next" control point in a way that doesn't work. Quadratic curves really aren't very useful to work with...

    -

    Finally, we also want to make sure that moving the on-curve coordinates preserves the relative positions of the associated control points. With that, we get to the kind of curve control that you might be familiar with from applications like Photoshop, Inkscape, Blender, etc.

    - - - Scripts are disabled. Showing fallback image. - - - - - - Scripts are disabled. Showing fallback image. - - - -

    Again, we see that cubic curves are now rather nice to work with, but quadratic curves have a new, very serious problem: we can move an on-curve point in such a way that we can't compute what needs to "happen next". Move the top point down, below the left and right points, for instance. There is no way to preserve correct control points without a kink at the bottom point. Quadratic curves: just not that good...

    -

    A final improvement is to offer fine-level control over which points behave which, so that you can have "kinks" or individually controlled segments when you need them, with nicely well-behaved curves for the rest of the path. Implementing that, is left as an exercise for the reader.

    - -
    -
    -

    Curve offsetting

    -

    Perhaps you're like me, and you've been writing various small programs that use Bézier curves in some way or another, and at some point you make the step to implementing path extrusion. But you don't want to do it pixel based; you want to stay in the vector world. You find that extruding lines is relatively easy, and tracing outlines is coming along nicely (although junction caps and fillets are a bit of a hassle), and then you decide to do things properly and add Bézier curves to the mix. Now you have a problem.

    -

    Unlike lines, you can't simply extrude a Bézier curve by taking a copy and moving it around, because of the curvatures; rather than a uniform thickness, you get an extrusion that looks too thin in places, if you're lucky, but more likely will self-intersect. The trick, then, is to scale the curve, rather than simply copying it. But how do you scale a Bézier curve?

    -

    Bottom line: you can't. So you cheat. We're not going to do true curve scaling, or rather curve offsetting, because that's impossible. Instead we're going to try to generate 'looks good enough' offset curves.

    -
    - -

    "What do you mean, you can't? Prove it."

    -

    First off, when I say "you can't," what I really mean is "you can't offset a Bézier curve with another Bézier curve", not even by using a really high order curve. You can find the function that describes the offset curve, but it won't be a polynomial, and as such it cannot be represented as a Bézier curve, which has to be a polynomial. Let's look at why this is:

    -

    From a mathematical point of view, an offset curve O(t) is a curve such that, given our original curve B(t), any point on O(t) is a fixed distance d away from coordinate B(t). So let's math that:

    - - -

    However, we're working in 2D, and d is a single value, so we want to turn it into a vector. If we want a point distance d "away" from the curve B(t) then what we really mean is that we want a point at d times the "normal vector" from point B(t), where the "normal" is a vector that runs perpendicular ("at a right angle") to the tangent at B(t). Easy enough:

    - - -

    Now this still isn't very useful unless we know what the formula for N(t) is, so let's find out. N(t) runs perpendicular to the original curve tangent, and we know that the tangent is simply B'(t), so we could just rotate that 90 degrees and be done with it. However, we need to ensure that N(t) has the same magnitude for every t, or the offset curve won't be at a uniform distance, thus not being an offset curve at all. The easiest way to guarantee this is to make sure N(t) always has length 1, which we can achieve by dividing B'(t) by its magnitude:

    - - -

    Determining the length requires computing an arc length, and this is where things get Tricky with a capital T. First off, to compute arc length from some start a to end b, we must use the formula we saw earlier. Noting that "length" is usually denoted with double vertical bars:

    - - -

    So if we want the length of the tangent, we plug in B'(t), with t = 0 as start and -t = 1 as end:

    - - -

    And that's where things go wrong. It doesn't even really matter what the second derivative for B(t) is, that square root is screwing everything up, because it turns our nice polynomials into things that are no longer polynomials.

    -

    There is a small class of polynomials where the square root is also a polynomial, but they're utterly useless to us: any polynomial with unweighted binomial coefficients has a square root that is also a polynomial. Now, you might think that Bézier curves are just fine because they do, but they don't; remember that only the base function has binomial coefficients. That's before we factor in our coordinates, which turn it into a non-binomial polygon. The only way to make sure the functions stay binomial is to make all our coordinates have the same value. And that's not a curve, that's a point. We can already create offset curves for points, we call them circles, and they have much simpler functions than Bézier curves.

    -

    So, since the tangent length isn't a polynomial, the normalised tangent won't be a polynomial either, which means N(t) won't be a polynomial, which means that d times N(t) won't be a polynomial, which means that, ultimately, O(t) won't be a polynomial, which means that even if we can determine the function for O(t) just fine (and that's far from trivial!), it simply cannot be represented as a Bézier curve.

    -

    And that's one reason why Bézier curves are tricky: there are actually a lot of curves that cannot be represented as a Bézier curve at all. They can't even model their own offset curves. They're weird that way. So how do all those other programs do it? Well, much like we're about to do, they cheat. We're going to approximate an offset curve in a way that will look relatively close to what the real offset curve would look like, if we could compute it.

    -
    + +

    + And that's where things go wrong. It doesn't even really matter what the second derivative for B(t) is, that square root is + screwing everything up, because it turns our nice polynomials into things that are no longer polynomials. +

    +

    + There is a small class of polynomials where the square root is also a polynomial, but they're utterly useless to us: any polynomial with + unweighted binomial coefficients has a square root that is also a polynomial. Now, you might think that Bézier curves are just fine + because they do, but they don't; remember that only the base function has binomial coefficients. That's before we + factor in our coordinates, which turn it into a non-binomial polygon. The only way to make sure the functions stay binomial is to make + all our coordinates have the same value. And that's not a curve, that's a point. We can already create offset curves for points, we call + them circles, and they have much simpler functions than Bézier curves. +

    +

    + So, since the tangent length isn't a polynomial, the normalised tangent won't be a polynomial either, which means + N(t) won't be a polynomial, which means that d times N(t) won't be a polynomial, which means + that, ultimately, O(t) won't be a polynomial, which means that even if we can determine the function for + O(t) just fine (and that's far from trivial!), it simply cannot be represented as a Bézier curve. +

    +

    + And that's one reason why Bézier curves are tricky: there are actually a lot of curves that cannot be represented as a Bézier + curve at all. They can't even model their own offset curves. They're weird that way. So how do all those other programs do it? Well, + much like we're about to do, they cheat. We're going to approximate an offset curve in a way that will look relatively close to what the + real offset curve would look like, if we could compute it. +

    + -

    So, you cannot offset a Bézier curve perfectly with another Bézier curve, no matter how high-order you make that other Bézier curve. However, we can chop up a curve into "safe" sub-curves (where "safe" means that all the control points are always on a single side of the baseline, and the midpoint of the curve at t=0.5 is roughly in the center of the polygon defined by the curve coordinates) and then point-scale each sub-curve with respect to its scaling origin (which is the intersection of the point normals at the start and end points).

    -

    A good way to do this reduction is to first find the curve's extreme points, as explained in the earlier section on curve extremities, and use these as initial splitting points. After this initial split, we can check each individual segment to see if it's "safe enough" based on where the center of the curve is. If the on-curve point for t=0.5 is too far off from the center, we simply split the segment down the middle. Generally this is more than enough to end up with safe segments.

    -

    The following graphics show off curve offsetting, and you can use the slider to control the distance at which the curve gets offset. The curve first gets reduced to safe segments, each of which is then offset at the desired distance. Especially for simple curves, particularly easily set up for quadratic curves, no reduction is necessary, but the more twisty the curve gets, the more the curve needs to be reduced in order to get segments that can safely be scaled.

    - - - Scripts are disabled. Showing fallback image. - - - - - - - - Scripts are disabled. Showing fallback image. - - - - - -

    You may notice that this may still lead to small 'jumps' in the sub-curves when moving the curve around. This is caused by the fact that we're still performing a naive form of offsetting, moving the control points the same distance as the start and end points. If the curve is large enough, this may still lead to incorrect offsets.

    - -
    -
    -

    Graduated curve offsetting

    -

    What if we want to do graduated offsetting, starting at some distance s but ending at some other distance e? Well, if we can compute the length of a curve (which we can if we use the Legendre-Gauss quadrature approach) then we can also determine how far "along the line" any point on the curve is. With that knowledge, we can offset a curve so that its offset curve is not uniformly wide, but graduated between with two different offset widths at the start and end.

    -

    Like normal offsetting we cut up our curve in sub-curves, and then check at which distance along the original curve each sub-curve starts and ends, as well as to which point on the curve each of the control points map. This gives us the distance-along-the-curve for each interesting point in the sub-curve. If we call the total length of all sub-curves seen prior to seeing "the current" sub-curve S (and if the current sub-curve is the first one, S is zero), and we call the full length of our original curve L, then we get the following graduation values:

    -
      -
    • start: map S from interval (0,L) to interval (s,e)
    • -
    • c1: map(<strong>S+d1</strong>, 0,L, s,e), d1 = distance along curve to projection of c1
    • -
    • c2: map(<strong>S+d2</strong>, 0,L, s,e), d2 = distance along curve to projection of c2
    • -
    • ...
    • -
    • end: map(<strong>S+length(subcurve)</strong>, 0,L, s,e)
    • -
    -

    At each of the relevant points (start, end, and the projections of the control points onto the curve) we know the curve's normal, so offsetting is simply a matter of taking our original point, and moving it along the normal vector by the offset distance for each point. Doing so will give us the following result (these have with a starting width of 0, and an end width of 40 pixels, but can be controlled with your up and down arrow keys):

    - - - Scripts are disabled. Showing fallback image. - - - - - - - - Scripts are disabled. Showing fallback image. - - - - - - -
    -
    -

    Circles and quadratic Bézier curves

    -

    Circles and Bézier curves are very different beasts, and circles are infinitely easier to work with than Bézier curves. Their formula is much simpler, and they can be drawn more efficiently. But, sometimes you don't have the luxury of using circles, or ellipses, or arcs. Sometimes, all you have are Bézier curves. For instance, if you're doing font design, fonts have no concept of geometric shapes, they only know straight lines, and Bézier curves. OpenType fonts with TrueType outlines only know quadratic Bézier curves, and OpenType fonts with Type 2 outlines only know cubic Bézier curves. So how do you draw a circle, or an ellipse, or an arc?

    -

    You approximate.

    -

    We already know that Bézier curves cannot model all curves that we can think of, and this includes perfect circles, as well as ellipses, and their arc counterparts. However, we can certainly approximate them to a degree that is visually acceptable. Quadratic and cubic curves offer us different curvature control, so in order to approximate a circle we will first need to figure out what the error is if we try to approximate arcs of increasing degree with quadratic and cubic curves, and where the coordinates even lie.

    -

    Since arcs are mid-point-symmetrical, we need the control points to set up a symmetrical curve. For quadratic curves this means that the control point will be somewhere on a line that intersects the baseline at a right angle. And we don't get any choice on where that will be, since the derivatives at the start and end point have to line up, so our control point will lie at the intersection of the tangents at the start and end point.

    -

    First, let's try to fit the quadratic curve onto a circular arc. In the following sketch you can move the mouse around over a unit circle, to see how well, or poorly, a quadratic curve can approximate the arc from (1,0) to where your mouse cursor is:

    - - - Scripts are disabled. Showing fallback image. - - - - - -

    As you can see, things go horribly wrong quite quickly; even trying to approximate a quarter circle using a quadratic curve is a bad idea. An eighth of a turns might look okay, but how okay is okay? Let's apply some maths and find out. What we're interested in is how far off our on-curve coordinates are with respect to a circular arc, given a specific start and end angle. We'll be looking at how much space there is between the circular arc, and the quadratic curve's midpoint.

    -

    We start out with our start and end point, and for convenience we will place them on a unit circle (a circle around 0,0 with radius 1), at some angle φ:

    - - -

    What we want to find is the intersection of the tangents, so we want a point C such that:

    - - -

    i.e. we want a point that lies on the vertical line through S (at some distance a from S) and also lies on the tangent line through E (at some distance b from E). Solving this gives us:

    - - -

    First we solve for b:

    - - -

    which yields:

    - - -

    which we can then substitute in the expression for a:

    - - -

    A quick check shows that plugging these values for a and b into the expressions for Cx and Cy give the same x/y coordinates for both "a away from A" and "b away from B", so let's continue: now that we know the coordinate values for C, we know where our on-curve point T for t=0.5 (or angle φ/2) is, because we can just evaluate the Bézier polynomial, and we know where the circle arc's actual point P is for angle φ/2:

    - - -

    We compute T, observing that if t=0.5, the polynomial values (1-t)², 2(1-t)t, and t² are 0.25, 0.5, and 0.25 respectively:

    - - -

    Which, worked out for the x and y components, gives:

    - - -

    And the distance between these two is the standard Euclidean distance:

    - - -

    So, what does this distance function look like when we plot it for a number of ranges for the angle φ, such as a half circle, quarter circle and eighth circle?

    -
    - - plotted for 0 ≤ φ ≤ π: - - - plotted for 0 ≤ φ ≤ ½π: - - - - - plotted for 0 ≤ φ ≤ ¼π: -
    + +

    + So, what does this distance function look like when we plot it for a number of ranges for the angle φ, such as a half circle, quarter + circle and eighth circle? +

    + + + + + + + + +
    + + plotted for 0 ≤ φ ≤ π: + + + plotted for 0 ≤ φ ≤ ½π: + + + + + plotted for 0 ≤ φ ≤ ¼π: +
    -

    We now see why the eighth circle arc looks decent, but the quarter circle arc doesn't: an error of roughly 0.06 at t=0.5 means we're 6% off the mark... we will already be off by one pixel on a circle with pixel radius 17. Any decent sized quarter circle arc, say with radius 100px, will be way off if approximated by a quadratic curve! For the eighth circle arc, however, the error is only roughly 0.003, or 0.3%, which explains why it looks so close to the actual eighth circle arc. In fact, if we want a truly tiny error, like 0.001, we'll have to contend with an angle of (rounded) 0.593667, which equates to roughly 34 degrees. We'd need 11 quadratic curves to form a full circle with that precision! (technically, 10 and ten seventeenth, but we can't do partial curves, so we have to round up). That's a whole lot of curves just to get a shape that can be drawn using a simple function!

    -

    In fact, let's flip the function around, so that if we plug in the precision error, labelled ε, we get back the maximum angle for that precision:

    - - -

    And frankly, things are starting to look a bit ridiculous at this point, we're doing way more maths than we've ever done, but thankfully this is as far as we need the maths to take us: If we plug in the precisions 0.1, 0.01, 0.001 and 0.0001 we get the radians values 1.748, 1.038, 0.594 and 0.3356; in degrees, that means we can cover roughly 100 degrees (requiring four curves), 59.5 degrees (requiring six curves), 34 degrees (requiring 11 curves), and 19.2 degrees (requiring a whopping nineteen curves).

    -

    The bottom line? Quadratic curves are kind of lousy if you want circular (or elliptical, which are circles that have been squashed in one dimension) curves. We can do better, even if it's just by raising the order of our curve once. So let's try the same thing for cubic curves.

    - -
    -
    -

    Circular arcs and cubic Béziers

    -

    Let's look at approximating circles and circular arcs using cubic Béziers. How much better is that?

    - - - Scripts are disabled. Showing fallback image. - - - - - -

    At cursory glance, a fair bit better, but let's find out how much better by looking at how to construct the Bézier curve.

    -

    A construction diagram for a cubic approximation of a circular arc

    -

    The start and end points are trivial, but the mid point requires a bit of work, but it's mostly basic trigonometry once we know the angle θ for our circular arc: if we scale our circular arc to a unit circle, we can always start our arc, with radius 1, at (1,0) and then given our arc angle θ, we also know that the circular arc has length θ (because unit circles are nice that way). We also know our end point, because that's just (cos(θ), sin(θ)), and so the challenge is to figure out what control points we need in order for the curve at t=0.5 to exactly touch the circular arc at the angle θ/2:

    -

    So let's again formally describe this:

    - - -

    Only P3 isn't quite straight-forward here, and its description is based on the fact that the triangle (origin, P4, P3) is a right angled triangle, with the distance between the origin and P4 being 1 (because we're working with a unit circle), and the distance between P4 and P3 being k, so that we can represent P3 as "The point P4 plus the vector from the origin to P4 but then rotated a quarter circle, counter-clockwise, and scaled by k".

    -

    With that, we can determine the y-coordinates for A, B, e1, and e2, after which we have all the information we need to determine what the value of k is. We can find these values by using (no surprise here) linear interpolation between known points, as A is midway between P2 and P3, e1 is between A and "midway between P1 and P2" (which is "half height" P2), and so forth:

    - - -

    Which now gives us two identities for B, because in addition to determining B through linear interpolation, we also know that B's y coordinate is just sin(θ/2): we started this exercise by saying we were going to approximate the circular arc using a Bézier curve that had its midpoint, which is point B, touching the unit circle at the arc's half-angle, by definition making B the point at (cos(θ/2), sin(θ/2)).

    -

    This means we can equate the two identities we now have for By and solve for k.

    -
    - -

    Deriving k

    -

    Solving for k is fairly straight forward, but it's a fair few steps, and if you just the immediate result: using a tool like Wolfram Alpha is definitely the way to go. That said, let's get going:

    - - -

    And finally, we can take further advantage of several trigonometric identities to drastically simplify our formula for k:

    - - -

    And we're done.

    -
    + +

    And we're done.

    + -

    So, the distance of our control points to the start/end points can be expressed as a number that we get from an almost trivial expression involving the circular arc's angle:

    - - -

    Which means that for any circular arc with angle θ and radius r, our Bézier approximation based on three points of incidence is:

    - - -

    Which also gives us the commonly found value of 0.55228 for quarter circles, based on them having an angle of half π:

    - - -

    And thus giving us the following Bézier coordinates for a quarter circle of radius r:

    - - -
    + +
    +

    So, how accurate is this?

    +

    + Unlike for the quadratic curve, we can't use t=0.5 as our reference point because by its very nature it's one of the three points + that are actually guaranteed to be on the circular arc itself. Instead, we need a different t value that will give us the maximum + deflection - there are two possible choices (as our curve is still strictly "overshoots" the circular arc, and it's symmetrical) but + rather than trying to use calculus to find the perfect t value—which we could! the maths is perfectly reasonable as long as we + get to use computers—we can also just perform a binary search for the biggest deflection and not bother with all this maths stuff. +

    +

    + So let's do that instead: we can run a maximum deflection check that just runs through t from 0 to 1 at some coarse interval, + finds a t value that has "the highest deflection of the bunch", then reruns the same check with a much smaller interval around + that t value, repeating as many times as necessary to get us an arbitrarily precise value of t: +

    -

    So, how accurate is this?

    -

    Unlike for the quadratic curve, we can't use t=0.5 as our reference point because by its very nature it's one of the three points that are actually guaranteed to be on the circular arc itself. Instead, we need a different t value that will give us the maximum deflection - there are two possible choices (as our curve is still strictly "overshoots" the circular arc, and it's symmetrical) but rather than trying to use calculus to find the perfect t value—which we could! the maths is perfectly reasonable as long as we get to use computers—we can also just perform a binary search for the biggest deflection and not bother with all this maths stuff.

    -

    So let's do that instead: we can run a maximum deflection check that just runs through t from 0 to 1 at some coarse interval, finds a t value that has "the highest deflection of the bunch", then reruns the same check with a much smaller interval around that t value, repeating as many times as necessary to get us an arbitrarily precise value of t:

    - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    - -

    Plus, how often do you get to write a function with that name?

    -

    Using this code, we find that our t values are approximately 0.211325 and 0.788675, so let's pick the lower of the two and see what the maximum deflection is across our domain of angles, with the original quadratic error show in green (rocketing off to infinity first, and then coming back down as we approach 2π)

    - - -
    - - - - - -
    - error plotted for 0 ≤ φ ≤ 2π - - error plotted for 0 ≤ φ ≤ π - - error plotted for 0 ≤ φ ≤ ½π -
    + return getMostWrongT(radius, bezier, worst_t - stepsize, worst_t + stepsize) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + -

    That last image is probably not quite clear enough: the cubic approximation of a quarter circle is so incredibly much better that we can't even really see it at the same scale of our quadratic curve. Let's scale the y-axis a little, and try that again:

    -

    +

    Plus, how often do you get to write a function with that name?

    +

    + Using this code, we find that our t values are approximately 0.211325 and 0.788675, so let's pick the lower of the two and see + what the maximum deflection is across our domain of angles, with the original quadratic error show in green (rocketing off to infinity + first, and then coming back down as we approach 2π) +

    + + + + + + + + + + + + + +
    + + + + + +
    + error plotted for 0 ≤ φ ≤ 2π + + error plotted for 0 ≤ φ ≤ π + + error plotted for 0 ≤ φ ≤ ½π +
    -

    Yeah... the error of a cubic approximation for a quarter circle turns out to be two orders of magnitude better. At approximately 0.00027 (or: just shy of being 2.7 pixels off for a circle with a radius of 10,000 pixels) the increase in precision over quadratic curves is quite spectacular - certainly good enough that no one in their right mind should ever use quadratic curves.

    -
    +

    + That last image is probably not quite clear enough: the cubic approximation of a quarter circle is so incredibly much better that we + can't even really see it at the same scale of our quadratic curve. Let's scale the y-axis a little, and try that again: +

    +

    -

    So that's it, kappa is 4/3 · tan(θ/4) , we're done! ...or are we?

    -

    Can we do better?

    -

    Technically: yes, we can. But I'm going to prefix this section with "we can, and we should investigate that possibility, but let me warn you up front that the result is only better if we're going to hard-code the values". We're about to get into the weeds and the standard three-points-of-incidence value is so good already that for most applications, trying to do better won't make any sense at all.

    -

    So with that said: what we calculated above is an upper bound for a best fit Bézier curve for a circular arc: anywhere we don't touch the circular arc in our approximation, we've "overshot" the arc. What if we dropped our value for k just a little, so that the curve starts out as an over-estimation, but then crosses the circular arc, yielding an region of underestimation, and then crosses the circular arc again, with another region of overestimation. This might give us a lower overall error, so let's see what we can do.

    -

    First, let's express the total error (given circular arc angle θ, and some k) using standard calculus notation:

    - - -

    This says that the error function for a given angle and value of k is equal to the "infinite" sum of differences between our curve and the circular arc, as we run t from 0 to 1, using an infinitely small step size. between subsequent t values.

    -

    Now, since we want to find the minimal error, that means we want to know where along this function things go from "error is getting progressively less" to "error is increasing again", which means we want to know where its derivative is zero, which as mathematical expression looks like:

    - - -

    And here we have the most direct application of the Fundamental Theorem of Calculus: the derivative and integral are each other's inverse operations, so they cancel out, leaving us with our original function:

    - - -

    And now we just solve for that... oh wait. We've seen this before. In order to solve this, we'd end up needing to solve this:

    - - -

    And both of those terms on the left of the equal sign are 6th degree polynomials, which means—as we've covered in the section on arc lengths—there is no symbolic solution for this equasion. Instead, we'll have to use a numerical approach to find the solutions here, so... to the computer!

    -
    + +

    + And both of those terms on the left of the equal sign are 6th degree polynomials, which means—as we've covered in the section + on arc lengths—there is no symbolic solution for this equasion. + Instead, we'll have to use a numerical approach to find the solutions here, so... to the computer! +

    +
    +

    Iterating on a solution

    +

    + By which I really mean "to the binary search algorithm", because we're dealing with a reasonably well behaved function: depending on the + value for k , we're either going to end up with a Bézier curve that's on average "not at distance r from the arc's + center", "exactly distance r from the arc's center", or "more than distance r from the arc's center", so we can just + binary search our way to the most accurate value for c that gets us that middle case. +

    +

    First our setup, where we determine our upper and lower bounds, before entering our binary search:

    -

    Iterating on a solution

    -

    By which I really mean "to the binary search algorithm", because we're dealing with a reasonably well behaved function: depending on the value for k , we're either going to end up with a Bézier curve that's on average "not at distance r from the arc's center", "exactly distance r from the arc's center", or "more than distance r from the arc's center", so we can just binary search our way to the most accurate value for c that gets us that middle case.

    -

    First our setup, where we determine our upper and lower bounds, before entering our binary search:

    - - - - -
    1 - -
    2
    3
    4
    - -

    And then the binary search algorithm, which can be found in pretty much any CS textbook, as well as more online articles, tutorials, and blog posts than you can ever read in a life time:

    + return binarySearch(radius, angle, points, lowerBound, upperBound) + + + + 2 + + + 3 + + + 4 + + - - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    - -

    Using the following radialError function, which samples the curve's approximation of the circular arc over several points (although the first and last point will never contribute anything, so we skip them):

    + return binarySearch(radius, angle, points, value, upperBound) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + 18 + + + 19 + + - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    - -

    In this, getOnCurvePoint is just the standard Bézier evaluation function, yielding a point. Treating that point as a vector, we can get its length to the origin using a magnitude call.

    -

    Examining the result

    -

    Running the above code we can get a list of k values associated with a list of angles θ from 0 to π, and we can use that to, for each angle, plot what the difference between the circular arc and the Bézier approximation looks like:

    -

    image-20210419085430711

    -

    Here we see the difference between an arc and its Bézier approximation plotted as we run t from 0 to 1. Just by looking at the plot we can tell that there is maximum deflection at t = 0.5, so let's plot the maximum deflection "function", for angles from 0 to θ:

    -

    In fact, let's plot the maximum deflections for both approaches as a functions over θ:

    - - -
    - - - - - -
    - max deflection using unit scale - - max deflection at 10x scale - - max deflection at 100x scale -
    + return err + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + -

    That doesn't actually appear to be all that much better, so let's look at some numbers, to see what the improvement actually is:

    - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
    angle"improved" deflection"upper bound" deflectiondifference
    1/8 π6.202833502388927E-86.657161222278773E-84.5432771988984655E-9
    1/4 π3.978021202111215E-64.246252911066506E-62.68231708955291E-7
    3/8 π4.547652269037972E-54.8397483513262785E-52.9209608228830675E-6
    1/2 π2.569196199214696E-42.7251652752280364E-41.559690760133403E-5
    5/8 π9.877526288810667E-40.00104441758597118025.666495709011343E-5
    3/4 π0.002981649786796270.00314556284145806051.6391305466179062E-4
    7/8 π0.00763231828070198850.0080477779099483734.1545962924638413E-4
    π0.0173621859640437080.0183490165195459029.86830555502194E-4
    -

    As we can see, the increase in precision is not particularly big: for a quarter circle (π/2) the traditional k will be off by 2.75 pixels on a circle with radius 10,000 pixels, whereas this "better" fit will be off by 2.56 pixels. And while that's certainly an almost 10% improvement, it's also nowhere near enough of an improvement to make a discernible difference.

    -
    +

    + In this, getOnCurvePoint is just the standard Bézier evaluation function, yielding a point. Treating that point as a + vector, we can get its length to the origin using a magnitude call. +

    +

    Examining the result

    +

    + Running the above code we can get a list of k values associated with a list of angles θ from 0 to π, and we can use that to, + for each angle, plot what the difference between the circular arc and the Bézier approximation looks like: +

    +

    image-20210419085430711

    +

    + Here we see the difference between an arc and its Bézier approximation plotted as we run t from 0 to 1. Just by looking at the + plot we can tell that there is maximum deflection at t = 0.5, so let's plot the maximum deflection "function", for angles from + 0 to θ: +

    +

    In fact, let's plot the maximum deflections for both approaches as a functions over θ:

    + + + + + + + + + + + + + +
    + + + + + +
    + max deflection using unit scale + + max deflection at 10x scale + + max deflection at 100x scale +
    -

    At this point it should be clear that while, yes, there are improvement to be had, they're essentially insignificant while also being much more computationally expensive.

    -

    TL;DR: just tell me which value I should be using

    -

    It depends on what we need to do. If we just want the best value for quarter circles, and we're going to hard code the value for k, then there is no reason to hard-code the constant k=4/3*tan(pi/8) when you can just as easily hard-code the constant as k=0.551784777779014 instead.

    -

    If you need "the" value for quarter circles, use 0.551785 instead of 0.55228

    -

    However, for dynamic arc approximation, in code that tries to fit circular paths using Bézier paths instead, it should be fairly obvious that the simple function involving a tangent computation, two divisions, and one multiplication, is vastly more performant than running all the code we ended writing just to get a 25% lower error value, and most certainly worth preferring over getting the "more accurate" value.

    -

    If you need to fit Béziers to circular arcs on the fly, use 4/3 * tan(θ/4)

    -

    However, always remember that if you're writing for humans, you can typically use the best of both worlds: as the user interacts with their curves, you should draw their curves instead of drawing approximations of them. If they need to draw circles or circular arcs, draw those, and only approximate them with a Bézier curve when the data needs to be exported to a format that doesn't support those. Ideally with a preview mechanism that highlights where the errors will be, and how large they will be.

    -

    If you're writing code for graphics design by humans, use circular arcs for circular arcs

    -

    And that's it. We have pretty well exhausted this subject. There are different metrics we could use to find "different best k values", like trying to match arc length (e.g. when we're optimizing for material cost), or minimizing the area between the circular arc and the Bézier curve (e.g. when we're optimizing for inking), or minimizing the rate of change of the Bézier's curvature (e.g. when we're optimizing for curve traversal) and they all yield values that are so similar that it's almost certainly not worth it. (For instance, for quarter circle approximations those values are 0.551777, 0.5533344, and 0.552184 respectively. Much like the 0.551785 we get from minimizing the maximum deflection, none of these values are significantly better enough to prefer them over the upper bound value).

    +

    That doesn't actually appear to be all that much better, so let's look at some numbers, to see what the improvement actually is:

    + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
    angle"improved" deflection"upper bound" deflectiondifference
    1/8 π6.202833502388927E-86.657161222278773E-84.5432771988984655E-9
    1/4 π3.978021202111215E-64.246252911066506E-62.68231708955291E-7
    3/8 π4.547652269037972E-54.8397483513262785E-52.9209608228830675E-6
    1/2 π2.569196199214696E-42.7251652752280364E-41.559690760133403E-5
    5/8 π9.877526288810667E-40.00104441758597118025.666495709011343E-5
    3/4 π0.002981649786796270.00314556284145806051.6391305466179062E-4
    7/8 π0.00763231828070198850.0080477779099483734.1545962924638413E-4
    π0.0173621859640437080.0183490165195459029.86830555502194E-4
    +

    + As we can see, the increase in precision is not particularly big: for a quarter circle (π/2) the traditional k will be off by + 2.75 pixels on a circle with radius 10,000 pixels, whereas this "better" fit will be off by 2.56 pixels. And while that's certainly an + almost 10% improvement, it's also nowhere near enough of an improvement to make a discernible difference. +

    +
    -
    -
    -

    Approximating Bézier curves with circular arcs

    -

    Let's look at doing the exact opposite of the previous section: rather than approximating circular arc using Bézier curves, let's approximate Bézier curves using circular arcs.

    -

    We already saw in the section on circle approximation that this will never yield a perfect equivalent, but sometimes you need circular arcs, such as when you're working with fabrication machinery, or simple vector languages that understand lines and circles, but not much else.

    -

    The approach is fairly simple: pick a starting point on the curve, and pick two points that are further along the curve. Determine the circle that goes through those three points, and see if it fits the part of the curve we're trying to approximate. Decent fit? Try spacing the points further apart. Bad fit? Try spacing the points closer together. Keep doing this until you've found the "good approximation/bad approximation" boundary, record the "good" arc, and then move the starting point up to overlap the end point we previously found. Rinse and repeat until we've covered the entire curve.

    -

    We already saw how to fit a circle through three points in the section on creating a curve from three points, and finding the arc through those points is straight-forward: pick one of the three points as start point, pick another as an end point, and the arc has to necessarily go from the start point, to the end point, over the remaining point.

    -

    So, how can we convert a Bézier curve into a (sequence of) circular arc(s)?

    -
      -
    • Start at t=0
    • -
    • Pick two points further down the curve at some value m = t + n and e = t + 2n
    • -
    • Find the arc that these points define
    • -
    • Determine how close the found arc is to the curve:
        -
      • Pick two additional points e1 = t + n/2 and e2 = t + n + n/2.
      • -
      • These points, if the arc is a good approximation of the curve interval chosen, should - lie on the circle, so their distance to the center of the circle should be the - same as the distance from any of the three other points to the center.
      • -
      • For point points, determine the (absolute) error between the radius of the circle, and the -actual distance from the center of the circle to the point on the curve.
      • -
      • If this error is too high, we consider the arc bad, and try a smaller interval.
      • -
      -
    • -
    -

    The result of this is shown in the next graphic: we start at a guaranteed failure: s=0, e=1. That's the entire curve. The midpoint is simply at t=0.5, and then we start performing a binary search.

    -
      -
    1. We start with low=0, mid=0.5 and high=1
    2. -
    3. That'll fail, so we retry with the interval halved: {0, 0.25, 0.5}
        -
      • If that arc's good, we move back up by half distance: {0, 0.375, 0.75}.
      • -
      • However, if the arc was still bad, we move down by half the distance: {0, 0.125, 0.25}.
      • -
      -
    4. -
    5. We keep doing this over and over until we have two arcs, in sequence, of which the first arc is good, and the second arc is bad. When we find that pair, we've found the boundary between a good approximation and a bad approximation, and we pick the good arc.
    6. -
    -

    The following graphic shows the result of this approach, with a default error threshold of 0.5, meaning that if an arc is off by a combined half pixel over both verification points, then we treat the arc as bad. This is an extremely simple error policy, but already works really well. Note that the graphic is still interactive, and you can use your up and down arrow keys keys to increase or decrease the error threshold, to see what the effect of a smaller or larger error threshold is.

    - - - Scripts are disabled. Showing fallback image. - - - - - -

    With that in place, all that's left now is to "restart" the procedure by treating the found arc's end point as the new to-be-determined arc's starting point, and using points further down the curve. We keep trying this until the found end point is for t=1, at which point we are done. Again, the following graphic allows for up and down arrow key input to increase or decrease the error threshold, so you can see how picking a different threshold changes the number of arcs that are necessary to reasonably approximate a curve:

    - - - Scripts are disabled. Showing fallback image. - - - - - -

    So... what is this good for? Obviously, if you're working with technologies that can't do curves, but can do lines and circles, then the answer is pretty straightforward, but what else? There are some reasons why you might need this technique: using circular arcs means you can determine whether a coordinate lies "on" your curve really easily (simply compute the distance to each circular arc center, and if any of those are close to the arc radii, at an angle between the arc start and end, bingo, this point can be treated as lying "on the curve"). Another benefit is that this approximation is "linear": you can almost trivially travel along the arcs at fixed speed. You can also trivially compute the arc length of the approximated curve (it's a bit like curve flattening). The only thing to bear in mind is that this is a lossy equivalence: things that you compute based on the approximation are guaranteed "off" by some small value, and depending on how much precision you need, arc approximation is either going to be super useful, or completely useless. It's up to you to decide which, based on your application!

    - -
    -
    -

    B-Splines

    -

    No discussion on Bézier curves is complete without also giving mention of that other beast in the curve design space: B-Splines. Easily confused to mean Bézier splines, that's not actually what they are; they are "basis function" splines, which makes a lot of difference, and we'll be looking at those differences in this section. We're not going to dive as deep into B-Splines as we have for Bézier curves (that would be an entire primer on its own) but we'll be looking at how B-Splines work, what kind of maths is involved in computing them, and how to draw them based on a number of parameters that you can pick for individual B-Splines.

    -

    First off: B-Splines are piecewise, polynomial interpolation curves, where the "single curve" is built by performing polynomial interpolation over a set of points, using a sliding window of a fixed number of points. For instance, a "cubic" B-Spline defined by twelve points will have its curve built by evaluating the polynomial interpolation of four points, and the curve can be treated as a lot of different sections, each controlled by four points at a time, such that the full curve consists of smoothly connected sections defined by points {1,2,3,4}, {2,3,4,5}, ..., {8,9,10,11}, and finally {9,10,11,12}, for eight sections.

    -

    What do they look like? They look like this! Tap on the graphic to add more points, and move points around to see how they map to the spline curve drawn.

    - - - Scripts are disabled. Showing fallback image. - - - -

    The important part to notice here is that we are not doing the same thing with B-Splines that we do for poly-Béziers or Catmull-Rom curves: both of the latter simply define new sections as literally "new sections based on new points", so a 12 point cubic poly-Bézier curve is actually impossible, because we start with a four point curve, and then add three more points for each section that follows, so we can only have 4, 7, 10, 13, 16, etc. point Poly-Béziers. Similarly, while Catmull-Rom curves can grow by adding single points, this addition of a single point introduces three implicit Bézier points. Cubic B-Splines, on the other hand, are smooth interpolations of each possible curve involving four consecutive points, such that at any point along the curve except for our start and end points, our on-curve coordinate is defined by four control points.

    -

    Consider the difference to be this:

    -
      -
    • for Bézier curves, the curve is defined as an interpolation of points, but:
    • -
    • for B-Splines, the curve is defined as an interpolation of curves.
    • -
    -

    In fact, let's look at that again, but this time with the base curves shown, too. Each consecutive four points define one curve:

    - - - Scripts are disabled. Showing fallback image. - - - - - -

    In order to make this interpolation of curves work, the maths is necessarily more complex than the maths for Bézier curves, so let's have a look at how things work.

    -

    How to compute a B-Spline curve: some maths

    -

    Given a B-Spline of degree d and thus order k=d+1 (so a quadratic B-Spline is degree 2 and order 3, a cubic B-Spline is degree 3 and order 4, etc) and n control points P0 through Pn-1, we can compute a point on the curve for some value t in the interval [0,1] (where 0 is the start of the curve, and 1 the end, just like for Bézier curves), by evaluating the following function:

    - + +

    + In order to make this interpolation of curves work, the maths is necessarily more complex than the maths for Bézier curves, so let's have + a look at how things work. +

    +

    How to compute a B-Spline curve: some maths

    +

    + Given a B-Spline of degree d and thus order k=d+1 (so a quadratic B-Spline is degree 2 and order 3, a cubic + B-Spline is degree 3 and order 4, etc) and n control points P0 through Pn-1, we can compute a point on the curve for some value t in the interval [0,1] (where 0 is the start of the curve, and 1 the + end, just like for Bézier curves), by evaluating the following function: +

    + - -

    Which, honestly, doesn't tell us all that much. All we can see is that a point on a B-Spline curve is defined as "a mix of all the control points, weighted somehow", where the weighting is achieved through the N(...) function, subscripted with an obvious parameter i, which comes from our summation, and some magical parameter k. So we need to know two things: 1. what does N(t) do, and 2. what is that k? Let's cover both, in reverse order.

    -

    The parameter k represents the "knot interval" over which a section of curve is defined. As we learned earlier, a B-Spline curve is itself an interpolation of curves, and we can treat each transition where a control point starts or stops influencing the total curvature as a "knot on the curve". -Doing so for a degree d B-Spline with n control point gives us d + n + 1 knots, defining d + n intervals along the curve, and it is these intervals that the above k subscript to the N() function applies to.

    -

    Then the N() function itself. What does it look like?

    - - -

    So this is where we see the interpolation: N(t) for an (i,k) pair (that is, for a step in the above summation, on a specific knot interval) is a mix between N(t) for (i,k-1) and N(t) for (i+1,k-1), so we see that this is a recursive iteration where i goes up, and k goes down, so it seem reasonable to expect that this recursion has to stop at some point; obviously, it does, and specifically it does so for the following i/k values:

    - - -

    And this function finally has a straight up evaluation: if a t value lies within a knot-specific interval once we reach a k=1 value, it "counts", otherwise it doesn't. We did cheat a little, though, because for all these values we need to scale our t value first, so that it lies in the interval bounded by knots[d] and knots[n], which are the start point and end point where curvature is controlled by exactly order control points. For instance, for degree 3 (=order 4) and 7 control points, with knot vector [1,2,3,4,5,6,7,8,9,10,11], we map t from [the interval 0,1] to the interval [4,8], and then use that value in the functions above, instead.

    -

    Can we simplify that?

    -

    We can, yes.

    -

    People far smarter than us have looked at this work, and two in particular — Maurice Cox and Carl de Boor — came to a mathematically pleasing solution: to compute a point P(t), we can compute this point by evaluating d(t) on a curve section between knots i and i+1:

    - - -

    This is another recursive function, with k values decreasing from the curve order to 1, and the value α (alpha) defined by:

    - - -

    That looks complicated, but it's not. Computing alpha is just a fraction involving known, plain numbers. And, once we have our alpha value, we also have (1-alpha) because it's a trivial subtraction. Computing the d() function is thus mostly a matter of computing pretty simple arithmetical statements, with some caching of results so we can refer to them as we recurve. While the recursion might see computationally expensive, the total algorithm is cheap, as each step only involves very simple maths.

    -

    Of course, the recursion does need a stop condition:

    - - -

    So, we actually see two stopping conditions: either i becomes 0, in which case d() is zero, or k becomes zero, in which case we get the same "either 1 or 0" that we saw in the N() function above.

    -

    Thanks to Cox and de Boor, we can compute points on a B-Spline pretty easily using the same kind of linear interpolation we saw in de Casteljau's algorithm. For instance, if we write out d() for i=3 and k=3, we get the following recursion diagram:

    - - -

    That is, we compute d(3,3) as a mixture of d(2,3) and d(2,2), where those two are themselves a mixture of d(1,3) and d(1,2), and d(1,2) and d(1,1), respectively, which are themselves a mixture of etc. etc. We simply keep expanding our terms until we reach the stop conditions, and then sum everything back up. It's really quite elegant.

    -

    One thing we need to keep in mind is that we're working with a spline that is constrained by its control points, so even though the d(..., k) values are zero or one at the lowest level, they are really "zero or one, times their respective control point", so in the next section you'll see the algorithm for running through the computation in a way that starts with a copy of the control point vector and then works its way up to that single point, rather than first starting "on the left", working our way "to the right" and then summing back up "to the left". We can just start on the right and work our way left immediately.

    -

    Running the computation

    -

    Unlike the de Casteljau algorithm, where the t value stays the same at every iteration, for B-Splines that is not the case, and so we end having to (for each point we evaluate) run a fairly involving bit of recursive computation. The algorithm is discussed on this Michigan Tech page, but an easier to read version is implemented by b-spline.js, so we'll look at its code.

    -

    Given an input value t, we first map the input to a value from the domain [0,1] to the domain [knots[degree], knots[knots.length - 1 - degree]. Then, we find the section number s that this mapped t value lies on:

    + +

    + That is, we compute d(3,3) as a mixture of d(2,3) and d(2,2), where those two are themselves a + mixture of d(1,3) and d(1,2), and d(1,2) and d(1,1), respectively, which are + themselves a mixture of etc. etc. We simply keep expanding our terms until we reach the stop conditions, and then sum everything back up. + It's really quite elegant. +

    +

    + One thing we need to keep in mind is that we're working with a spline that is constrained by its control points, so even though the + d(..., k) values are zero or one at the lowest level, they are really "zero or one, times their respective control point", so + in the next section you'll see the algorithm for running through the computation in a way that starts with a copy of the control point + vector and then works its way up to that single point, rather than first starting "on the left", working our way "to the right" and then + summing back up "to the left". We can just start on the right and work our way left immediately. +

    +

    Running the computation

    +

    + Unlike the de Casteljau algorithm, where the t value stays the same at every iteration, for B-Splines that is not the case, + and so we end having to (for each point we evaluate) run a fairly involving bit of recursive computation. The algorithm is discussed on + this Michigan Tech page, but an easier to read version + is implemented by b-spline.js, so we'll look at its code. +

    +

    + Given an input value t, we first map the input to a value from the domain [0,1] to the domain + [knots[degree], knots[knots.length - 1 - degree]. Then, we find the section number s that this mapped + t value lies on: +

    - - -
    1 - -
    2
    3
    - -

    after running this code, s is the index for the section the point will lie on. We then run the algorithm mentioned on the MU page (updated to use this description's variable names):

    +} + + + + 2 + + + 3 + + - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    - -

    (A nice bit of behaviour in this code is that we work the interpolation "backwards", starting at i=s at each level of the interpolation, and we stop when i = s - order + level, so we always end up with a value for i such that those v[i-1] don't try to use an array index that doesn't exist)

    -

    Open vs. closed paths

    -

    Much like poly-Béziers, B-Splines can be either open, running from the first point to the last point, or closed, where the first and last point are the same coordinate. However, because B-Splines are an interpolation of curves, not just points, we can't simply make the first and last point the same, we need to link as many points as are necessary to form "a curve" that the spline performs interpolation with. As such, for an order d B-Spline, we need to make the first and last d points the same. This is of course hardly more work than before (simply append points.splice(0,d) to points) but it's important to remember that you need more than just a single point.

    -

    Of course if we want to manipulate these kind of curves we need to make sure to mark them as "closed" so that we know the coordinate for points[0] and points[n-k] etc. don't just happen to have the same x/y values, but really are the same coordinate, so that manipulating one will equally manipulate the other, but programming generally makes this really easy by storing references to points, rather than copies (or other linked values such as coordinate weights, discussed in the NURBS section) rather than separate coordinate objects.

    -

    Manipulating the curve through the knot vector

    -

    The most important thing to understand when it comes to B-Splines is that they work because of the concept of a knot vector. As mentioned above, knots represent "where individual control points start/stop influencing the curve", but we never looked at the values that go in the knot vector. If you look back at the N() and a() functions, you see that interpolations are based on intervals in the knot vector, rather than the actual values in the knot vector, and we can exploit this to do some pretty interesting things with clever manipulation of the knot vector. Specifically there are four things we can do that are worth looking at:

    -
      -
    1. we can use a uniform knot vector, with equally spaced intervals,
    2. -
    3. we can use a non-uniform knot vector, without enforcing equally spaced intervals,
    4. -
    5. we can collapse sequential knots to the same value, locally lowering curve complexity using "null" intervals, and
    6. -
    7. we can form a special case non-uniform vector, by combining (1) and (3) to for a vector with collapsed start and end knots, with a uniform vector in between.
    8. -
    -

    Uniform B-Splines

    -

    The most straightforward type of B-Spline is the uniform spline. In a uniform spline, the knots are distributed uniformly over the entire curve interval. For instance, if we have a knot vector of length twelve, then a uniform knot vector would be [0,1,2,3,...,9,10,11]. Or [4,5,6,...,13,14,15], which defines the same intervals, or even [0,2,3,...,18,20,22], which also defines the same intervals, just scaled by a constant factor, which becomes normalised during interpolation and so does not contribute to the curvature.

    - - - Scripts are disabled. Showing fallback image. - - - - - -

    This is an important point: the intervals that the knot vector defines are relative intervals, so it doesn't matter if every interval is size 1, or size 100 - the relative differences between the intervals is what shapes any particular curve.

    -

    The problem with uniform knot vectors is that, as we need order control points before we have any curve with which we can perform interpolation, the curve does not "start" at the first point, nor "ends" at the last point. Instead there are "gaps". We can get rid of these, by being clever about how we apply the following uniformity-breaking approach instead...

    -

    Reducing local curve complexity by collapsing intervals

    -

    Collapsing knot intervals, by making two or more consecutive knots have the same value, allows us to reduce the curve complexity in the sections that are affected by the knots involved. This can have drastic effects: for every interval collapse, the curve order goes down, and curve continuity goes down, to the point where collapsing order knots creates a situation where all continuity is lost and the curve "kinks".

    - - - Scripts are disabled. Showing fallback image. - - - - - -

    Open-Uniform B-Splines

    -

    By combining knot interval collapsing at the start and end of the curve, with uniform knots in between, we can overcome the problem of the curve not starting and ending where we'd kind of like it to:

    -

    For any curve of degree D with control points N, we can define a knot vector of length N+D+1 in which the values 0 ... D+1 are the same, the values D+1 ... N+1 follow the "uniform" pattern, and the values N+1 ... N+D+1 are the same again. For example, a cubic B-Spline with 7 control points can have a knot vector [0,0,0,0,1,2,3,4,4,4,4], or it might have the "identical" knot vector [0,0,0,0,2,4,6,8,8,8,8], etc. Again, it is the relative differences that determine the curve shape.

    - - - Scripts are disabled. Showing fallback image. - - - - - -

    Non-uniform B-Splines

    -

    This is essentially the "free form" version of a B-Spline, and also the least interesting to look at, as without any specific reason to pick specific knot intervals, there is nothing particularly interesting going on. There is one constraint to the knot vector, other than that any value knots[k+1] should be greater than or equal to knots[k].

    -

    One last thing: Rational B-Splines

    -

    While it is true that this section on B-Splines is running quite long already, there is one more thing we need to talk about, and that's "Rational" splines, where the rationality applies to the "ratio", or relative weights, of the control points themselves. By introducing a ratio vector with weights to apply to each control point, we greatly increase our influence over the final curve shape: the more weight a control point carries, the closer to that point the spline curve will lie, a bit like turning up the gravity of a control point, just like for rational Bézier curves.

    - - - Scripts are disabled. Showing fallback image. - - - - - -

    Of course this brings us to the final topic that any text on B-Splines must touch on before calling it a day: the NURBS, or Non-Uniform Rational B-Spline (NURBS is not a plural, the capital S actually just stands for "spline", but a lot of people mistakenly treat it as if it is, so now you know better). NURBS is an important type of curve in computer-facilitated design, used a lot in 3D modelling (typically as NURBS surfaces) as well as in arbitrary-precision 2D design due to the level of control a NURBS curve offers designers.

    -

    While a true non-uniform rational B-Spline would be hard to work with, when we talk about NURBS we typically mean the Open-Uniform Rational B-Spline, or OURBS, but that doesn't roll off the tongue nearly as nicely, and so remember that when people talk about NURBS, they typically mean open-uniform, which has the useful property of starting the curve at the first control point, and ending it at the last.

    -

    Extending our implementation to cover rational splines

    -

    The algorithm for working with Rational B-Splines is virtually identical to the regular algorithm, and the extension to work in the control point weights is fairly simple: we extend each control point from a point in its original number of dimensions (2D, 3D, etc.) to one dimension higher, scaling the original dimensions by the control point's weight, and then assigning that weight as its value for the extended dimension.

    -

    For example, a 2D point (x,y) with weight w becomes a 3D point (w * x, w * y, w).

    -

    We then run the same algorithm as before, which will automatically perform weight interpolation in addition to regular coordinate interpolation, because all we've done is pretended we have coordinates in a higher dimension. The algorithm doesn't really care about how many dimensions it needs to interpolate.

    -

    In order to recover our "real" curve point, we take the final result of the point generation algorithm, and "unweigh" it: we take the final point's derived weight w' and divide all the regular coordinate dimensions by it, then throw away the weight information.

    -

    Based on our previous example, we take the final 3D point (x', y', w'), which we then turn back into a 2D point by computing (x'/w', y'/w'). And that's it, we're done!

    +} + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + -
    -
    - +

    + (A nice bit of behaviour in this code is that we work the interpolation "backwards", starting at i=s at each level of the + interpolation, and we stop when i = s - order + level, so we always end up with a value for i such that those + v[i-1] don't try to use an array index that doesn't exist) +

    +

    Open vs. closed paths

    +

    + Much like poly-Béziers, B-Splines can be either open, running from the first point to the last point, or closed, where the first and last + point are the same coordinate. However, because B-Splines are an interpolation of curves, not just points, we can't simply make the first + and last point the same, we need to link as many points as are necessary to form "a curve" that the spline performs interpolation with. As + such, for an order d B-Spline, we need to make the first and last d points the same. This is of course hardly + more work than before (simply append points.splice(0,d) to points) but it's important to remember that you need + more than just a single point. +

    +

    + Of course if we want to manipulate these kind of curves we need to make sure to mark them as "closed" so that we know the coordinate for + points[0] and points[n-k] etc. don't just happen to have the same x/y values, but really are the same + coordinate, so that manipulating one will equally manipulate the other, but programming generally makes this really easy by storing + references to points, rather than copies (or other linked values such as coordinate weights, discussed in the NURBS section) rather than + separate coordinate objects. +

    +

    Manipulating the curve through the knot vector

    +

    + The most important thing to understand when it comes to B-Splines is that they work because of the concept of a knot vector. As + mentioned above, knots represent "where individual control points start/stop influencing the curve", but we never looked at the + values that go in the knot vector. If you look back at the N() and a() functions, you see that interpolations are based on + intervals in the knot vector, rather than the actual values in the knot vector, and we can exploit this to do some pretty interesting + things with clever manipulation of the knot vector. Specifically there are four things we can do that are worth looking at: +

    +
      +
    1. we can use a uniform knot vector, with equally spaced intervals,
    2. +
    3. we can use a non-uniform knot vector, without enforcing equally spaced intervals,
    4. +
    5. we can collapse sequential knots to the same value, locally lowering curve complexity using "null" intervals, and
    6. +
    7. + we can form a special case non-uniform vector, by combining (1) and (3) to for a vector with collapsed start and end knots, with a + uniform vector in between. +
    8. +
    +

    Uniform B-Splines

    +

    + The most straightforward type of B-Spline is the uniform spline. In a uniform spline, the knots are distributed uniformly over the entire + curve interval. For instance, if we have a knot vector of length twelve, then a uniform knot vector would be [0,1,2,3,...,9,10,11]. Or + [4,5,6,...,13,14,15], which defines the same intervals, or even [0,2,3,...,18,20,22], which also defines + the same intervals, just scaled by a constant factor, which becomes normalised during interpolation and so does not contribute to + the curvature. +

    + + + Scripts are disabled. Showing fallback image. + + + + + +

    + This is an important point: the intervals that the knot vector defines are relative intervals, so it doesn't matter if every + interval is size 1, or size 100 - the relative differences between the intervals is what shapes any particular curve. +

    +

    + The problem with uniform knot vectors is that, as we need order control points before we have any curve with which we can + perform interpolation, the curve does not "start" at the first point, nor "ends" at the last point. Instead there are "gaps". We can get + rid of these, by being clever about how we apply the following uniformity-breaking approach instead... +

    +

    Reducing local curve complexity by collapsing intervals

    +

    + Collapsing knot intervals, by making two or more consecutive knots have the same value, allows us to reduce the curve complexity in the + sections that are affected by the knots involved. This can have drastic effects: for every interval collapse, the curve order goes down, + and curve continuity goes down, to the point where collapsing order knots creates a situation where all continuity is lost + and the curve "kinks". +

    + + + Scripts are disabled. Showing fallback image. + + + + + +

    Open-Uniform B-Splines

    +

    + By combining knot interval collapsing at the start and end of the curve, with uniform knots in between, we can overcome the problem of the + curve not starting and ending where we'd kind of like it to: +

    +

    + For any curve of degree D with control points N, we can define a knot vector of length N+D+1 in + which the values 0 ... D+1 are the same, the values D+1 ... N+1 follow the "uniform" pattern, and the values + N+1 ... N+D+1 are the same again. For example, a cubic B-Spline with 7 control points can have a knot vector + [0,0,0,0,1,2,3,4,4,4,4], or it might have the "identical" knot vector [0,0,0,0,2,4,6,8,8,8,8], etc. Again, it is the relative differences + that determine the curve shape. +

    + + + Scripts are disabled. Showing fallback image. + + + + + +

    Non-uniform B-Splines

    +

    + This is essentially the "free form" version of a B-Spline, and also the least interesting to look at, as without any specific reason to + pick specific knot intervals, there is nothing particularly interesting going on. There is one constraint to the knot vector, other than + that any value knots[k+1] should be greater than or equal to knots[k]. +

    +

    One last thing: Rational B-Splines

    +

    + While it is true that this section on B-Splines is running quite long already, there is one more thing we need to talk about, and that's + "Rational" splines, where the rationality applies to the "ratio", or relative weights, of the control points themselves. By introducing a + ratio vector with weights to apply to each control point, we greatly increase our influence over the final curve shape: the more weight a + control point carries, the closer to that point the spline curve will lie, a bit like turning up the gravity of a control point, just like + for rational Bézier curves. +

    + + + Scripts are disabled. Showing fallback image. + + + + + +

    + Of course this brings us to the final topic that any text on B-Splines must touch on before calling it a day: the + NURBS, or Non-Uniform Rational B-Spline (NURBS is not a plural, + the capital S actually just stands for "spline", but a lot of people mistakenly treat it as if it is, so now you know better). NURBS is an + important type of curve in computer-facilitated design, used a lot in 3D modelling (typically as NURBS surfaces) as well as in + arbitrary-precision 2D design due to the level of control a NURBS curve offers designers. +

    +

    + While a true non-uniform rational B-Spline would be hard to work with, when we talk about NURBS we typically mean the Open-Uniform + Rational B-Spline, or OURBS, but that doesn't roll off the tongue nearly as nicely, and so remember that when people talk about NURBS, + they typically mean open-uniform, which has the useful property of starting the curve at the first control point, and ending it at the + last. +

    +

    Extending our implementation to cover rational splines

    +

    + The algorithm for working with Rational B-Splines is virtually identical to the regular algorithm, and the extension to work in the + control point weights is fairly simple: we extend each control point from a point in its original number of dimensions (2D, 3D, etc.) to + one dimension higher, scaling the original dimensions by the control point's weight, and then assigning that weight as its value for the + extended dimension. +

    +

    For example, a 2D point (x,y) with weight w becomes a 3D point (w * x, w * y, w).

    +

    + We then run the same algorithm as before, which will automatically perform weight interpolation in addition to regular coordinate + interpolation, because all we've done is pretended we have coordinates in a higher dimension. The algorithm doesn't really care about how + many dimensions it needs to interpolate. +

    +

    + In order to recover our "real" curve point, we take the final result of the point generation algorithm, and "unweigh" it: we take the + final point's derived weight w' and divide all the regular coordinate dimensions by it, then throw away the weight + information. +

    +

    + Based on our previous example, we take the final 3D point (x', y', w'), which we then turn back into a 2D point by computing + (x'/w', y'/w'). And that's it, we're done! +

    +
    +
    + -

    Comments and questions

    -

    First off, if you enjoyed this book, or you simply found it useful for something you were trying to get done, and you were wondering how to let me know you appreciated this book, you have two options: you can either head on over to the Patreon page for this book, or if you prefer to make a one-time donation, head on over to the buy Pomax a coffee page. This work has grown from a small primer to a 70-plus print-page-equivalent reader on the subject of Bézier curves over the years, and a lot of coffee went into the making of it. I don't regret a minute I spent on writing it, but I can always do with some more coffee to keep on writing.

    -

    With that said, on to the comments!

    -
    +

    + + Comments and questions +

    +

    + First off, if you enjoyed this book, or you simply found it useful for something you were trying to get done, and you were wondering how + to let me know you appreciated this book, you have two options: you can either head on over to the + Patreon page for this book, or if you prefer to make a one-time donation, head on over to + the buy Pomax a coffee page. This + work has grown from a small primer to a 70-plus print-page-equivalent reader on the subject of Bézier curves over the years, and a lot of + coffee went into the making of it. I don't regret a minute I spent on writing it, but I can always do with some more coffee to keep on + writing. +

    +

    With that said, on to the comments!

    +
    +
    +
    +
    +
    -
    - -
    + -
    - - - - - - - - - \ No newline at end of file + + + diff --git a/docs/ko-KR/index.html b/docs/ko-KR/index.html index e4c1a13c..decf7d13 100644 --- a/docs/ko-KR/index.html +++ b/docs/ko-KR/index.html @@ -1,164 +1,178 @@ + + + + 베지에 곡선 입문 - - - - 베지에 곡선 입문 + - + - - - - + - - - - @font-face { - font-family: "Noto Sans Korean"; - src: url(./fonts/noto-sans-korean-regular.woff) format("WOFF"); - } + - :root[lang="ko-KR"] { - font-family: "Noto Sans Korean", "맑은 고딕", Helvetica, Arial, sans-serif; - font-size: 17.2px; - line-height: 1.4; - } + + + + + + + + + + + + + + + - + + + + + + + - + + - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - + + - - - - + + + - - + + -
    - This page on GitHub - - This page on GitHub - -
    +
    + This page on GitHub + + This page on GitHub + +
    -
    - - - submit to reddit - submit to hacker news - tweet your read - -
    +
    + + + submit to reddit + submit to hacker news + tweet your read + +
    - + +
    +

    + 베지에 곡선 입문 +

    +

    베지에 곡선이 꼭 필요할 때 읽기 좋은 무료 온라인 서적.

    +
    + 다른 언어로 읽기: + +

    + (원하는 언어가 없거나, 100%까지 올리고 싶다면 + 내용 번역을 도와주세요!) +

    +
    -
    - -

    베지에 곡선 입문

    -

    베지에 곡선이 꼭 필요할 때 읽기 좋은 무료 온라인 서적.

    -
    - 다른 언어로 읽기: - -

    (원하는 언어가 없거나, 100%까지 올리고 싶다면 내용 번역을 도와주세요!)

    -
    - -

    - 베지에 곡선 입문서에 오신 것을 환영합니다. 이곳은 베지에 곡선의 수학과 프로그래밍을 - 모두 다루는 무료 웹사이트/eBook으로, 포토샵 패스부터 CSS easing 함수와 글꼴 테두리까지 - 어느 곳에서든 튀어나오는 바로 그 곡선을 다루고 그리는 데 대한 다양한 주제를 다룹니다. -

    -

    - 이 책은 처음인가요? 어서 오세요! 이 입문서에 없는 것 중 꼭 다루었으면 좋겠는 것이 있다면 - 이슈 트래커로 의견을 나누어 주세요! -

    - -

    기부와 후원

    +

    + 베지에 곡선 입문서에 오신 것을 환영합니다. 이곳은 베지에 곡선의 수학과 프로그래밍을 모두 다루는 무료 웹사이트/eBook으로, 포토샵 패스부터 CSS + easing 함수와 글꼴 테두리까지 어느 곳에서든 튀어나오는 바로 그 곡선을 다루고 그리는 데 대한 다양한 주제를 다룹니다. +

    +

    + 이 책은 처음인가요? 어서 오세요! 이 입문서에 없는 것 중 꼭 다루었으면 좋겠는 것이 있다면 + 이슈 트래커로 의견을 나누어 주세요! +

    -

    - 이 책을 연구나 작업 목적, 혹은 직접 소프트웨어를 작성하는 데 참고하고 계시다면, - 일회성 기부 - (소액 기부도 괜찮습니다)나 Patreon에서 후원자 등록을 고려해 주세요. - 저는 이 책을 유료료 집필하고 있지 않으니, 혹여나 이 사이트에서 도움을 받았고 오랫동안 유지되기를 원하신다면 - 지금까지 몇 년 동안 수백 잔의 커피를 마셨고 앞으로도 더 많은 커피가 필요하다는 것만 기억해 주세요. - 커피 한 잔을 사주시는 것만으로도 이 자료를 보존하는 데 도움을 주실 수 있습니다. -

    -

    - 특히 기업에서 이 책을 업무용이나 온보딩 참고서로 사용하고 계신다면, 부디 이 사이트에 후원을 부탁드립니다! - 재무부에서 후원 문의를 해주신다면 기쁘게 수락하겠습니다. -

    - +

    기부와 후원

    - -
    +
    -

    - — Pomax -

    - + +
    - -
    +
    +
    +

    머리말

    +

    + 2D에 그림을 그리려면 보통 선을 사용하고, 선이라고 하면 직선과 곡선의 두 종류로 나뉩니다. 전자는 컴퓨터에게 그리기를 시킬 줄 안다면 쉽게 그릴 + 수 있습니다. 시작점과 끝점만 입력하면 짜잔! 직선이 나왔습니다. 어렵지 않죠. +

    +

    + 하지만 곡선은 훨씬 까다롭습니다. 손그림으로는 곡선 그리기가 그렇게 쉬울 수 없었겠지만, 컴퓨터는 곡선을 어떻게 그릴지 기술하는 수학적 함수 + 없이 곡선을 그릴 수 없다는 제한이 있습니다. 실은 직선을 그릴 때조차도 함수가 필요하지만, 어차피 직선의 함수는 간단하니 컴퓨터를 사용할 때는 + 보통 이 사실을 무시합니다. 직선이든 곡선이든 모든 선은 "함수"입니다. 그러나 선 그리기에 함수가 필요하기에, 빠르게 계산할 수 있고 컴퓨터로 + 그렸을 때 보기도 좋은 함수를 만들어야 할 필요성이 생깁니다. 이런 함수는 여러 종류가 있는데, 이 글에서 다룰 것은 이 중 많은 주목을 받고 + 곡선을 그릴 수 있으면 어디서나 쓰이는 것, 바로 베지에 곡선입니다. +

    +

    + 베지에 곡선이라는 이름은 이 곡선이 설계 작업을 하기에 좋다고 세상에 알린 + 피에르 베지에의 이름을 땄지만(1962년에 르노 소속으로서 연구 결과를 발표함), + 베지에가 이 곡선을 "발명"한 최초의, 혹은 유일한 사람은 아닙니다. 베지에 곡선을 처음 발견한 사람은 1959년에 시트로엥에서 일하면서 이 곡선의 + 성질을 연구하고 이 곡선을 그리는 정말 우아한 방법을 찾아낸 폴 드 카스텔죠라고 + 하는 경우도 있습니다. 그러나 드 카스텔죠는 본인의 연구 결과를 발표하지 않았기 때문에 누가 먼저인지를 단언하기는 어렵습니다... 정말일까요? + 사실 베지에 곡선은 근본적으로 "베른시테인 다항식"으로, + 세르게이 나타노비치 베른시테인이 연구하여 자그마치 1912년에 발표한 수학적 함수의 집합입니다. +

    +

    + 아무튼 위의 내용은 잡설이었고, 독자가 관심을 가질 만한 내용은 이 곡선이 편리하다는 것입니다. 여러 개의 베지에 곡선을 연결해서 하나의 + 곡선처럼 보이게 만들 수도 있습니다. 포토샵에서 패스를 한 번이라도 그려봤거나, 플래시, 일러스트레이터, 잉크스케이프 등 벡터 드로잉 프로그램을 + 써본 적이 있다면, 이 프로그램에서 그동안 그리던 곡선이 전부 베지에 곡선입니다. +

    +

    + 그런데 이를 직접 프로그래밍하려면 어떻게 해야 할까요? 함정이 어디에 도사리고 있을까요? 곡선의 바운딩 박스는 어떻게 구하고, 교차 판정은 + 어떻게 하고, 밖으로 튀어나오게는 어떻게 하고... 간단히 말해서, 이 곡선으로 할 만한 온갖 것들을 어떻게 하는 것이 좋을까요? 그것이 바로 이 + 책의 존재 이유입니다. 수학 할 준비 되셨나요? +

    +
    +

    거의 모든 베지에 그래픽이 인터랙티브입니다.

    +

    + 이 페이지에서는 Bezier.js를 사용하여 인터랙티브 예제를 제공하며, 수학 공식은 + XeLaTeX 조판 체계를 이용해 조판하고 David Barton님의 + pdf2svg를 이용해 SVG로 변환하여 표시하고 있습니다. +

    +

    이 책은 오픈 소스입니다.

    +

    + 이 책은 오픈 소스 소프트웨어 프로젝트의 일환으로, 소스 코드가 GitHub 레포지토리 두 곳에 보관되어 있습니다. 하나는 + https://github.com/pomax/bezierinfo로, 지금 읽고 계시는 완전히 공개용인 버전입니다. 다른 + 하나는 개발 버전인 https://github.com/pomax/BezierInfo-2로 + 웹 버전으로 변환되는 코드가 보관되어 있습니다. 버그를 찾아서 제보하려고 하거나 입문서의 내용을 수정하거나 추가할 아이디어가 + 있다면 이곳으로 보내 주세요. +

    +

    이 책의 수식은 얼마나 어렵나요?

    +

    + 입문서에 등장하는 대부분의 수식은 고등학교 수준입니다. 기본적인 산수 계산을 할 수 있고 영어를 읽을 줄 안다면 대부분 이해할 수 있을 + 정도입니다. 훨씬 어려운 수식도 가끔 등장하지만, 너무 어려워 보이는 상자 안의 세부 내용을 건너뛰거나, 읽고 있는 장의 끝으로 + 건너뛰어도 좋습니다. 각 장의 끝에는 도출된 값만 활용할 수 있도록 결론을 단순 나열합니다. +

    +

    예제 코드는 무슨 언어로 되어 있나요?

    +

    + 특정한 프로그래밍 언어 하나를 선호하기에는 프로그래밍 언어가 너무 많기 때문에, 입문서에 수록된 모든 예제는 JS나 Python 등 현대적인 + 스크립트 언어와 문법이 어느 정도 비슷하지만 같지는 않은 의사코드로 작성했습니다. 그렇기 때문에 예제 코드를 생각 없이 복사해서 쓸 수 + 없지만, 이는 의도적인 결정입니다. 이 입문서를 보고 계시는 것은 웬만하면 배우기 위해서이고, 배움은 복붙으로 이루어지지 않죠. + 학습은 무언가를 직접 해 보고, 실수를 하며, 그 실수를 고치면서 이루어집니다. 독자들이 실수를 하라고 일부러 예제 코드에 오류를 + 넣는다거나 하는 것은 당연히 아니지만(그건 끔찍하겠죠!), 어떤 한 프로그래밍 언어에 편중된 코드는 의도적으로 지양하고 있습니다. + 절차적 프로그래밍 언어를 하나라도 알고 있다면 예제 코드를 읽는 데는 전혀 어려움이 없으니 걱정은 하지 말아주세요. +

    +

    질문이나 의견이 있다면

    +

    + 새로운 장을 제안하고 싶다면, GitHub 이슈 트래커를 찾아 주세요(오른쪽 위에 + 링크한 레포지토리에서도 확인할 수 있습니다). 코드 재작성이 끝날 때까지는 댓글창이 없을 예정이지만, 내용에 대한 질문이 있다면 역시 이슈 + 트래커를 이용할 수 있습니다. 재작성을 마치면 내용 전반에 대한 댓글창을 추가하고, 확실치는 않지만 아마 "이 부분을 마우스로 선택하고 질문 + 버튼을 눌러서 질문하기" 시스템을 만들 수도 있겠습니다. +

    +

    이 책을 지원해 주세요!

    +

    + 이 책을 재미있게 읽었거나 하려고 했던 작업에 도움이 되었고, 제게 고마움을 표하고 싶다면 두 가지 방법이 있습니다. 이 책의 + Patreon 페이지에서 정기 후원을 하거나, 일회성 후원을 하고 싶다면 + buy Pomax a coffee 페이지를 찾아 + 주세요. 이 책은 지난 몇 년 동안 베지에 곡선에 관한 조그마한 입문서에서 종이책 100페이지짜리 교재로 커졌고, 책을 쓰면서 수백 잔의 커피가 + 들었습니다. 저는 이 책을 쓰게 된 것이 전혀 후회되지 않지만, 약간의 커피만 더 있으면 집필을 이어나갈 수 있겠죠! +

    +
    +
    +
    +

    업데이트 내역

    +

    + 이 입문서는 "살아 숨쉬는" 문서로, 오랜만에 다시 찾아왔다면 새로운 내용이 추가되었을 수도 있습니다. 아래 링크를 눌러서 무엇이 언제 + 추가되었는지 확인할 수도 있고, 뉴스 포스트(영문, RSS 피드도 있습니다)에서 자세한 내역을 + 확인할 수도 있습니다. +

    + + + +
    +

    November 2020

    +
      +
    • Added a section on finding curve/circle intersections

    • +
    +

    October 2020

    +
      +
    • Added the Ukranian locale! Help out in getting its localization to 100%!

    • +
    +

    August-September 2020

    +
      +
    • +

      + Completely overhauled the site: the Primer is now a normal web page that works fine with JS disabled, but obviously better with JS + turned on. +

      +
    • +
    +

    June 2020

    +
      +
    • Added automatic CI/CD using Github Actions

    • +
    +

    January 2020

    +
      +
    • Added reset buttons to all graphics

    • +
    • Updated to preface to correctly describe the on-page maths

    • +
    • Fixed the Catmull-Rom section because it had glaring maths errors

    • +
    +

    August 2019

    +
      +
    • Added a section on (plain) rational Bezier curves

    • +
    • Improved the Graphic component to allow for sliders

    • +
    +

    December 2018

    +
      +
    • Added a section on curvature and calculating kappa.

    • +
    • +

      + Added a Patreon page! Head on over to patreon.com/bezierinfo to help support this + site! +

      +
    • +
    +

    August 2018

    +
      +
    • Added a section on finding a curve's y, if all you have is the x coordinate.

    • +
    +

    July 2018

    +
      +
    • Rewrote the 3D normals section, implementing and explaining Rotation Minimising Frames.

    • +
    • Updated the section on curve order raising/lowering, showing how to get a least-squares optimized lower order curve.

    • +
    • +

      (Finally) updated 'npm test' so that it automatically rebuilds when files are changed while the dev server is running.

      +
    • +
    +

    June 2018

    +
      +
    • Added a section on direct curve fitting.

    • +
    • Added source links for all graphics.

    • +
    • Added this "What's new?" section.

    • +
    +

    April 2017

    +
      +
    • Added a section on 3d normals.

    • +
    • Added live-updating for the social link buttons, so they always link to the specific section you're reading.

    • +
    +

    February 2017

    +
      +
    • Finished rewriting the entire codebase for localization.

    • +
    +

    January 2016

    +
      +
    • Added a section to explain the Bezier interval.

    • +
    • Rewrote the Primer as a React application.

    • +
    +

    December 2015

    +
      +
    • Set up the split repository between BezierInfo-2 as development repository, and bezierinfo as live page.

    • +
    • +

      + Removed the need for client-side LaTeX parsing entirely, so the site doesn't take a full minute or more to load all the graphics. +

      +
    • +
    +

    May 2015

    +
      +
    • Switched over to pure JS rather than Processing-through-Processing.js

    • +
    • Added Cardano's algorithm for finding the roots of a cubic polynomial.

    • +
    +

    April 2015

    +
      +
    • Added a section on arc length approximations.

    • +
    +

    February 2015

    +
      +
    • Added a section on the canonical cubic Bezier form.

    • +
    +

    November 2014

    +
      +
    • Switched to HTTPS.

    • +
    +

    July 2014

    +
      +
    • Added the section on arc approximation.

    • +
    +

    April 2014

    +
      +
    • Added the section on Catmull-Rom fitting.

    • +
    +

    November 2013

    +
      +
    • Added the section on Catmull-Rom / Bezier conversion.

    • +
    • Added the section on Bezier cuves as matrices.

    • +
    +

    April 2013

    +
      +
    • Added a section on poly-Beziers.

    • +
    • Added a section on boolean shape operations.

    • +
    +

    March 2013

    +
      +
    • First drastic rewrite.

    • +
    • Added sections on circle approximations.

    • +
    • Added a section on projecting a point onto a curve.

    • +
    • Added a section on tangents and normals.

    • +
    • Added Legendre-Gauss numerical data tables.

    • +
    +

    October 2011

    +
      +
    • +

      + First commit for the bezierinfo site, based on the pre-Primer webpage that covered + the basics of Bezier curves in HTML with Processing.js examples. +

      +
    • +
    +
    +
    +
    +
    +

    + + 초고속 소개 +

    +

    + 간단하게 시작해 봅시다. "베지에 곡선"이라고 하면 아래 그림에서 볼 수 있는 형태의 것들을 말하는 것입니다. 어떤 시작점에서 시작해서 끝점에서 + 끝나고, 하나 이상의 "중간" 조절점이 그 사이의 모양에 관여합니다. 이 페이지에 있는 모든 그림은 인터랙티브이니 아래 곡선을 마음 가는 대로 + 바꿔 보세요! 조절점을 끌어 옮기면서 이에 따라 곡선의 모양이 어떻게 변하는지 살펴보세요. +

    +
    + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + +
    -
    - -

    머리말

    -

    2D에 그림을 그리려면 보통 선을 사용하고, 선이라고 하면 직선과 곡선의 두 종류로 나뉩니다. 전자는 컴퓨터에게 그리기를 시킬 줄 안다면 쉽게 그릴 수 있습니다. 시작점과 끝점만 입력하면 짜잔! 직선이 나왔습니다. 어렵지 않죠.

    -

    하지만 곡선은 훨씬 까다롭습니다. 손그림으로는 곡선 그리기가 그렇게 쉬울 수 없었겠지만, 컴퓨터는 곡선을 어떻게 그릴지 기술하는 수학적 함수 없이 곡선을 그릴 수 없다는 제한이 있습니다. 실은 직선을 그릴 때조차도 함수가 필요하지만, 어차피 직선의 함수는 간단하니 컴퓨터를 사용할 때는 보통 이 사실을 무시합니다. 직선이든 곡선이든 모든 선은 "함수"입니다. 그러나 선 그리기에 함수가 필요하기에, 빠르게 계산할 수 있고 컴퓨터로 그렸을 때 보기도 좋은 함수를 만들어야 할 필요성이 생깁니다. 이런 함수는 여러 종류가 있는데, 이 글에서 다룰 것은 이 중 많은 주목을 받고 곡선을 그릴 수 있으면 어디서나 쓰이는 것, 바로 베지에 곡선입니다.

    -

    베지에 곡선이라는 이름은 이 곡선이 설계 작업을 하기에 좋다고 세상에 알린 피에르 베지에의 이름을 땄지만(1962년에 르노 소속으로서 연구 결과를 발표함), 베지에가 이 곡선을 "발명"한 최초의, 혹은 유일한 사람은 아닙니다. 베지에 곡선을 처음 발견한 사람은 1959년에 시트로엥에서 일하면서 이 곡선의 성질을 연구하고 이 곡선을 그리는 정말 우아한 방법을 찾아낸 폴 드 카스텔죠라고 하는 경우도 있습니다. 그러나 드 카스텔죠는 본인의 연구 결과를 발표하지 않았기 때문에 누가 먼저인지를 단언하기는 어렵습니다... 정말일까요? 사실 베지에 곡선은 근본적으로 "베른시테인 다항식"으로, 세르게이 나타노비치 베른시테인이 연구하여 자그마치 1912년에 발표한 수학적 함수의 집합입니다.

    -

    아무튼 위의 내용은 잡설이었고, 독자가 관심을 가질 만한 내용은 이 곡선이 편리하다는 것입니다. 여러 개의 베지에 곡선을 연결해서 하나의 곡선처럼 보이게 만들 수도 있습니다. 포토샵에서 패스를 한 번이라도 그려봤거나, 플래시, 일러스트레이터, 잉크스케이프 등 벡터 드로잉 프로그램을 써본 적이 있다면, 이 프로그램에서 그동안 그리던 곡선이 전부 베지에 곡선입니다.

    -

    그런데 이를 직접 프로그래밍하려면 어떻게 해야 할까요? 함정이 어디에 도사리고 있을까요? 곡선의 바운딩 박스는 어떻게 구하고, 교차 판정은 어떻게 하고, 밖으로 튀어나오게는 어떻게 하고... 간단히 말해서, 이 곡선으로 할 만한 온갖 것들을 어떻게 하는 것이 좋을까요? 그것이 바로 이 책의 존재 이유입니다. 수학 할 준비 되셨나요?

    -
    - -

    거의 모든 베지에 그래픽이 인터랙티브입니다.

    -

    이 페이지에서는 Bezier.js를 사용하여 인터랙티브 예제를 제공하며, 수학 공식은 XeLaTeX 조판 체계를 이용해 조판하고 David Barton님의 pdf2svg를 이용해 SVG로 변환하여 표시하고 있습니다.

    -

    이 책은 오픈 소스입니다.

    -

    이 책은 오픈 소스 소프트웨어 프로젝트의 일환으로, 소스 코드가 GitHub 레포지토리 두 곳에 보관되어 있습니다. 하나는 https://github.com/pomax/bezierinfo로, 지금 읽고 계시는 완전히 공개용인 버전입니다. 다른 하나는 개발 버전인 https://github.com/pomax/BezierInfo-2웹 버전으로 변환되는 코드가 보관되어 있습니다. 버그를 찾아서 제보하려고 하거나 입문서의 내용을 수정하거나 추가할 아이디어가 있다면 이곳으로 보내 주세요.

    -

    이 책의 수식은 얼마나 어렵나요?

    -

    입문서에 등장하는 대부분의 수식은 고등학교 수준입니다. 기본적인 산수 계산을 할 수 있고 영어를 읽을 줄 안다면 대부분 이해할 수 있을 정도입니다. 훨씬 어려운 수식도 가끔 등장하지만, 너무 어려워 보이는 상자 안의 세부 내용을 건너뛰거나, 읽고 있는 장의 끝으로 건너뛰어도 좋습니다. 각 장의 끝에는 도출된 값만 활용할 수 있도록 결론을 단순 나열합니다.

    -

    예제 코드는 무슨 언어로 되어 있나요?

    -

    특정한 프로그래밍 언어 하나를 선호하기에는 프로그래밍 언어가 너무 많기 때문에, 입문서에 수록된 모든 예제는 JS나 Python 등 현대적인 스크립트 언어와 문법이 어느 정도 비슷하지만 같지는 않은 의사코드로 작성했습니다. 그렇기 때문에 예제 코드를 생각 없이 복사해서 쓸 수 없지만, 이는 의도적인 결정입니다. 이 입문서를 보고 계시는 것은 웬만하면 배우기 위해서이고, 배움은 복붙으로 이루어지지 않죠. 학습은 무언가를 직접 해 보고, 실수를 하며, 그 실수를 고치면서 이루어집니다. 독자들이 실수를 하라고 일부러 예제 코드에 오류를 넣는다거나 하는 것은 당연히 아니지만(그건 끔찍하겠죠!), 어떤 한 프로그래밍 언어에 편중된 코드는 의도적으로 지양하고 있습니다. 절차적 프로그래밍 언어를 하나라도 알고 있다면 예제 코드를 읽는 데는 전혀 어려움이 없으니 걱정은 하지 말아주세요.

    -

    질문이나 의견이 있다면

    -

    새로운 장을 제안하고 싶다면, GitHub 이슈 트래커를 찾아 주세요(오른쪽 위에 링크한 레포지토리에서도 확인할 수 있습니다). 코드 재작성이 끝날 때까지는 댓글창이 없을 예정이지만, 내용에 대한 질문이 있다면 역시 이슈 트래커를 이용할 수 있습니다. 재작성을 마치면 내용 전반에 대한 댓글창을 추가하고, 확실치는 않지만 아마 "이 부분을 마우스로 선택하고 질문 버튼을 눌러서 질문하기" 시스템을 만들 수도 있겠습니다.

    -

    이 책을 지원해 주세요!

    -

    이 책을 재미있게 읽었거나 하려고 했던 작업에 도움이 되었고, 제게 고마움을 표하고 싶다면 두 가지 방법이 있습니다. 이 책의 Patreon 페이지에서 정기 후원을 하거나, 일회성 후원을 하고 싶다면 buy Pomax a coffee 페이지를 찾아 주세요. 이 책은 지난 몇 년 동안 베지에 곡선에 관한 조그마한 입문서에서 종이책 100페이지짜리 교재로 커졌고, 책을 쓰면서 수백 잔의 커피가 들었습니다. 저는 이 책을 쓰게 된 것이 전혀 후회되지 않지만, 약간의 커피만 더 있으면 집필을 이어나갈 수 있겠죠!

    -
    -
    -
    -

    업데이트 내역

    -

    이 입문서는 "살아 숨쉬는" 문서로, 오랜만에 다시 찾아왔다면 새로운 내용이 추가되었을 수도 있습니다. 아래 링크를 눌러서 무엇이 언제 추가되었는지 확인할 수도 있고, 뉴스 포스트(영문, RSS 피드도 있습니다)에서 자세한 내역을 확인할 수도 있습니다.

    - - - -

    November 2020

    • Added a section on finding curve/circle intersections

      -
    -

    October 2020

    • Added the Ukranian locale! Help out in getting its localization to 100%!

      -
    -

    August-September 2020

    • Completely overhauled the site: the Primer is now a normal web page that works fine with JS disabled, but obviously better with JS turned on.

      -
    -

    June 2020

    • Added automatic CI/CD using Github Actions

      -
    -

    January 2020

    • Added reset buttons to all graphics

      -
    • -
    • Updated to preface to correctly describe the on-page maths

      -
    • -
    • Fixed the Catmull-Rom section because it had glaring maths errors

      -
    -

    August 2019

    • Added a section on (plain) rational Bezier curves

      -
    • -
    • Improved the Graphic component to allow for sliders

      -
    -

    December 2018

    • Added a section on curvature and calculating kappa.

      -
    • -
    • Added a Patreon page! Head on over to patreon.com/bezierinfo to help support this site!

      -
    -

    August 2018

    • Added a section on finding a curve's y, if all you have is the x coordinate.

      -
    -

    July 2018

    • Rewrote the 3D normals section, implementing and explaining Rotation Minimising Frames.

      -
    • -
    • Updated the section on curve order raising/lowering, showing how to get a least-squares optimized lower order curve.

      -
    • -
    • (Finally) updated 'npm test' so that it automatically rebuilds when files are changed while the dev server is running.

      -
    -

    June 2018

    • Added a section on direct curve fitting.

      -
    • -
    • Added source links for all graphics.

      -
    • -
    • Added this "What's new?" section.

      -
    -

    April 2017

    • Added a section on 3d normals.

      -
    • -
    • Added live-updating for the social link buttons, so they always link to the specific section you're reading.

      -
    -

    February 2017

    • Finished rewriting the entire codebase for localization.

      -
    -

    January 2016

    • Added a section to explain the Bezier interval.

      -
    • -
    • Rewrote the Primer as a React application.

      -
    -

    December 2015

    • Set up the split repository between BezierInfo-2 as development repository, and bezierinfo as live page.

      -
    • -
    • Removed the need for client-side LaTeX parsing entirely, so the site doesn't take a full minute or more to load all the graphics.

      -
    -

    May 2015

    • Switched over to pure JS rather than Processing-through-Processing.js

      -
    • -
    • Added Cardano's algorithm for finding the roots of a cubic polynomial.

      -
    -

    April 2015

    • Added a section on arc length approximations.

      -
    -

    February 2015

    • Added a section on the canonical cubic Bezier form.

      -
    -

    November 2014

    • Switched to HTTPS.

      -
    -

    July 2014

    • Added the section on arc approximation.

      -
    -

    April 2014

    • Added the section on Catmull-Rom fitting.

      -
    -

    November 2013

    • Added the section on Catmull-Rom / Bezier conversion.

      -
    • -
    • Added the section on Bezier cuves as matrices.

      -
    -

    April 2013

    • Added a section on poly-Beziers.

      -
    • -
    • Added a section on boolean shape operations.

      -
    -

    March 2013

    • First drastic rewrite.

      -
    • -
    • Added sections on circle approximations.

      -
    • -
    • Added a section on projecting a point onto a curve.

      -
    • -
    • Added a section on tangents and normals.

      -
    • -
    • Added Legendre-Gauss numerical data tables.

      -
    -

    October 2011

    • First commit for the bezierinfo site, based on the pre-Primer webpage that covered the basics of Bezier curves in HTML with Processing.js examples.

      -
    -
    -
    -

    초고속 소개

    -

    간단하게 시작해 봅시다. "베지에 곡선"이라고 하면 아래 그림에서 볼 수 있는 형태의 것들을 말하는 것입니다. 어떤 시작점에서 시작해서 끝점에서 끝나고, 하나 이상의 "중간" 조절점이 그 사이의 모양에 관여합니다. 이 페이지에 있는 모든 그림은 인터랙티브이니 아래 곡선을 마음 가는 대로 바꿔 보세요! 조절점을 끌어 옮기면서 이에 따라 곡선의 모양이 어떻게 변하는지 살펴보세요.

    -
    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - -
    - -

    이런 곡선은 컴퓨터 지원 설계 및 제조(CAD/CAM) 분야뿐만 아니라 어도비 일러스트레이터, 어도비 포토샵, 잉크스케이프, GIMP 등의 그래픽 디자인 프로그램과 SVG, 오픈타입 글꼴(TTF/OTF) 등의 그래픽 기술에서도 자주 사용하고 있습니다. 정말 많은 곳에서 베지에 곡선을 쓰고 있으니, 베지에 곡선을 더 자세히 알아보고 싶다면... 이 책을 마저 읽어 봅시다!

    - -
    -
    -

    베지에 곡선은 어떻게 만드나요?

    -

    곡선의 점들을 이리저리 옮겨보고 나면 베지에 곡선이 어떤 모양을 하는지 어느 정도 감이 올 것입니다. 그런데 베지에 곡선이라는 게 정말 무엇일까요? 베지에 곡선의 정체는 두 가지 방법으로 설명할 수 있고, 알고 보면 둘 다 완전히 동일하지만, 하나는 복잡한 수식을 쓰고 다른 하나는 정말 쉬운 수식을 씁니다. 그러니까... 일단 쉬운 설명부터 들어 봅시다.

    -

    베지에 곡선은 선형 보간의 결과물입니다. 말로만 들으면 어려워 보이지만 사실은 독자 여러분도 어릴 때부터 선형 보간을 한 적이 있을 것입니다. 무언가 둘 사이에 있는 것을 가리킬 때 했던 것이 바로 선형 보간, 즉 "두 점 사이에 점을 하나 찍는 것"입니다.

    -

    이 두 점 사이의 거리를 알고 있고, 예를 들어서 새로운 점을 한 점에서 20% 거리에(즉, 다른 점에서 80% 거리에) 찍으려고 한다면 이 점의 위치는 정말 쉽게 계산할 수 있습니다.

    - - -

    눈으로도 직접 확인해 봅시다. 아래 그림은 상하 방향키로 보간 비율을 늘리거나 줄이면서 어떻게 바뀌는지 확인할 수 있는 인터랙티브 그래픽입니다. 우선 세 점으로 시작해서 선분을 두 개 그립니다. 이 두 선분에 선형 보간을 해서 두 점을 얻을 수 있고, 이 두 점 사이에 다시 선형 보간을 하면 한 점이 됩니다. 바로 이 점(과 모든 비율에 대해 같은 방법으로 취한 모든 점)이 베지에 곡선이 됩니다.

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - -

    바로 여기서 복잡한 수학... 미적분이 등장합니다.

    -

    전혀 그렇게 느껴지지는 않겠지만, 우리가 여기서 한 것이 바로 이차곡선을 (한 획이 아니라 여러 단계에 걸쳐) 그리는 작업입니다. 베지에 곡선의 신기한 점 중 하나가 같은 대상을 다항함수로도, 혹은 간단하게 보간의 보간의 ... 보간으로도 나타낼 수 있다는 것입니다. 다시 말하자면, 우리는 이 곡선의 성질을 "진짜 수학"을 통해서든(곡선의 함수와 도함수 따위를 이용해서), "기계적" 구성을 통해서든(예를 들어, 베지에 곡선은 곡선을 그리는 데 사용한 점 밖으로 빠져나가지 않음을 알 수 있습니다) 탐구할 수 있습니다.

    -

    이제부터는 베지에 곡선을 조금 더 자세히 ― 수식으로 어떻게 표현하는지, 어떤 성질을 가지고 있는지, 어떻게 변형하고 어떻게 쓸 수 있는지 ― 알아보도록 하겠습니다.

    - -
    -
    -

    베지에 곡선을 이루는 수학

    -

    베지에 곡선은 "매개변수" 함수의 일종입니다. 수학적으로 말하자면 매개변수 함수는 꼼수입니다. "함수"란 임의의 개수의 입력에서 하나의 출력으로 가는 대응을 의미하는, 엄연한 정의가 있는 용어입니다. 수를/수들을 넣으면 하나의 값이 나옵니다. 다른 수를 넣어도 값이 하나만 나오는 점은 변하지 않습니다.

    -

    매개변수 함수는 꼼수입니다. "저는 여러 값이 나오는 함수가 필요한데 그냥 함수를 여러 개 쓰면 안 돼요?"인 것이죠. 예컨대 어떤 값(x라고 합시다)을 어떤 연산을 통해 다른 값과 대응시키는 함수가 있다고 합시다.

    - - -

    f(x)라는 표기는 f(함수가 하나만 있을 때는 관습적으로 f를 씁니다)가 함수이며, 그 값이 하나의 변수(여기서는 x)에 따라 바뀜을 나타내는 표준적인 방법입니다. x를 바꾸면 f(x)의 값도 바뀝니다.

    -

    여기까지는 좋습니다. 이제 매개변수 함수를 보고, 어떤 꼼수를 쓰는지 확인해 봅시다. 아래의 두 함수를 생각해 봅시다.

    - - -

    별 특이할 점이 없는 사인과 코사인 함수인데, 자세히 보면 두 입력이 다른 이름을 가지는 것을 확인할 수 있습니다. a의 값을 바꾸더라도 f(b)에서는 a를 쓰지 않으므로 f(b)의 값이 바뀌지는 않습니다. 바로 이 부분을 조금 고치는 것이 매개변수 함수의 꼼수입니다. 매개변수 함수에서는 아래와 같이 여러 가지 함수가 같은 변수를 공유합니다.

    - - -

    함수는 여러 개인데 변수는 하나뿐이네요. t의 값을 바꾸면 fa(t)fb(t)의 값이 모두 바뀝니다. 이게 어딜 봐서 유용한지 궁금해하는 독자가 있다면, 답은 의외로 간단합니다. fa(t)fb(t)라는 이름표를 매개변수 곡선을 만들 때 보통 의미하는 바를 나타내도록 바꾸면 이 식이 눈에 더 잘 들어올지도 모르겠습니다.

    - - -

    짜잔! x/y좌표가 어떤 미지의 값 t를 통해 이어져 있었습니다.

    -

    정리하자면, 매개변수 곡선은 보통의 함수처럼 y좌표를 x좌표값에 대해 정의하는 대신 두 값을 어떤 "조절" 변수로 연결합니다. t의 값을 바꿀 때마다 매개변수 함수에서 값을 얻을 수 있고, 이 값을 그래프에 (x,y) 좌표로 나타낼 수 있습니다. 예를 들어 위에서 소개한 매개변수 함수는 원 위의 점을 생성합니다. t의 값을 음의 무한대에서 양의 무한대까지 잡을 수 있고, 여기서 나오는 (x,y) 좌표는 항상 중심이 원점 (0,0)이고 반지름이 1인 원 위에 있습니다. t의 값 0부터 5까지에 대해 그래프를 그리면 다음과 같습니다.

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - -

    베지에 곡선은 여러 종류의 매개변수 함수 중 한 종류이고, 모든 출력값을 같은 기초 함수로 구합니다. 위에서 살펴본 예제에서는 xy 값을 다른 함수로 구했지만(하나는 사인, 다른 하나는 코사인), 베지에 곡선은 x, y 값을 모두 "이항 다항식"으로 구합니다. 이항 다항식은 또 뭘까요?

    -

    중·고등학교에서 다항식을 배운 기억이 나시나요? 이렇게 생긴 합을 다항식이라고 합니다.

    - - -

    여기서 가장 높은 차수의 항이 이면 3차( 다항)식, 이면 2차식이라 하고, x라면 1차식, 즉 직선이 됩니다. (혹시 x가 있는 항이 하나도 없다면 그건 애초에 다항식이 아니죠!)

    -

    베지에 곡선은 x가 아니라 t에 대한 다항식이고, t의 값이 0과 1 사이로 고정되며 a, b 등의 계수가 "이항계수"의 형태를 띱니다. 이항계수라고 하면 어려워 보이지만 사실은 두 값을 섞는 것을 간단하게 나타낸 것입니다.

    - - -

    제가 한 말이긴 하지만 저 식이 딱히 간단해 보이지는 않죠! 그런데 여기서 t를 없애고 대신 "곱하기 1"을 넣으면, 복잡한 수식이 갑자기 간단하게 바뀝니다. 밑에 늘어놓은 이항계수를 보세요.

    - - -

    식을 잘 보면 2는 1+1과 같고, 3은 2+1과 1+2이고, 6은 3+3...이라는 것을 알 수 있습니다. 보신 바와 같이 한 차수가 높아질 때마다 양 끝은 1이고, 그 사이의 모든 수는 "윗줄에 있는 두 수를 더한 것"과 같으며, 이렇게 만들어진 모양을 파스칼의 삼각형이라고 합니다. 이건 기억하기 쉽죠?

    -

    다항식의 문자 부분도 쉽게 외울 수 있는 방법이 있습니다. (1-t)a로, tb로 바꾸고 계수를 잠깐 지우면 아래의 식을 얻습니다.

    - - -

    이제 이 식은 "ab의 모든 조합의 합"인데, + 부호가 나올 때마다 a가 하나씩 b로 바뀌는 것으로 외울 수 있습니다. 이쪽도 그렇게 어렵지 않았네요. 이것으로 이항 다항식을 모두 배웠고, 혹시 필요할 경우를 위해 아래에 일반항의 식을 추가합니다.

    - - -

    위 식으로 베지에 곡선을 완전히 기술할 수 있습니다. 이때 위 함수에서 Σ(시그마) 기호는 여러 개의 값을 더한다는 의미입니다(Σ 기호 밑에 적힌 문자를 ...=<값>에서 시작해서 Σ 위에 적힌 값까지 바꿔가면서 더합니다).

    -
    + +

    + 위 식으로 베지에 곡선을 완전히 기술할 수 있습니다. 이때 위 함수에서 Σ(시그마) 기호는 여러 개의 값을 더한다는 의미입니다(Σ 기호 밑에 적힌 + 문자를 ...=<값>에서 시작해서 Σ 위에 적힌 값까지 바꿔가면서 더합니다). +

    +
    +

    기저 함수를 구현하는 방법

    +

    기저 함수를 구현할 때, 아래와 같이 위에서 본 수식을 그대로 함수로 옮겨서 구현할 수는 있습니다.

    -

    기저 함수를 구현하는 방법

    -

    기저 함수를 구현할 때, 아래와 같이 위에서 본 수식을 그대로 함수로 옮겨서 구현할 수는 있습니다.

    - - - - - -
    1 - -
    2
    3
    4
    5
    - -

    구현할 "수는" 있다고 한 것은 이렇게 구현하지 않을 것이기 때문입니다. 계승 함수는 매우 무겁고, 위에서 배운 것과 같이 계승을 쓰지 않고도 파스칼의 삼각형은 쉽게 만들 수 있습니다. [1]부터 시작하고, 그 다음에는 [1,1], [1,2,1], [1,3,3,1]과 같은 식으로, 줄이 바뀔 때마다 수 하나를 더 넣고 양쪽 끝은 1로, 그 사이의 모든 수는 지금 계산하고 있는 줄 바로 위의 양쪽 원소의 합으로 채우면 됩니다.

    -

    위의 삼각형은 리스트의 리스트로 눈 깜짝할 새 구현할 수 있고, 이제 룩업 테이블을 만들었으니 이항계수는 다시 구할 필요도 없습니다.

    + return sum + + + + 2 + + + 3 + + + 4 + + + 5 + + - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    - -

    코드를 해석해 볼까요? 우선 웬만한 n/k 값을 모두 다룰 수 있는 적당한 크기로 룩업 테이블을 선언합니다. 그 다음에는 필요한 값을 읽어오는 함수를 선언하고, 인자로 전달된 n/k 값이 테이블에 없으면 우선 테이블을 키우도록 합니다. 이제 기저 함수를 아래와 같이 구현할 수 있습니다.

    + return lut[n][k] + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + 18 + + - - - - -
    1 - -
    2
    3
    4
    5
    - -

    좋습니다. 물론 여기에서도 최적화의 여지가 남아 있습니다. 대부분의 컴퓨터 그래픽 용도로는 임의 차수의 곡선이 필요 없습니다(물론 입문서에서는 임의 차수 곡선을 다루는 코드를 제공할 예정입니다). 보통 2차와 3차 곡선만 사용하기 때문에, 아래와 같이 훨씬 간단한 코드로 고칠 수 있습니다.

    + return sum + + + + 2 + + + 3 + + + 4 + + + 5 + + - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    - -

    이제 기저 함수를 프로그래밍하는 법을 모두 배웠습니다. 와!

    -
    + return mt3 + 3*mt2*t + 3*mt*t2 + t3 + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + -

    지금까지 베지에 곡선의 기저 함수를 살펴보았습니다. 이제 베지에 곡선을 그토록 특별하게 만드는 것, 조절점을 살펴볼 시간입니다.

    +

    이제 기저 함수를 프로그래밍하는 법을 모두 배웠습니다. 와!

    +
    -
    -
    -

    Controlling Bézier curvatures

    -

    Bézier curves are, like all "splines", interpolation functions. This means that they take a set of points, and generate values somewhere "between" those points. (One of the consequences of this is that you'll never be able to generate a point that lies outside the outline for the control points, commonly called the "hull" for the curve. Useful information!). In fact, we can visualize how each point contributes to the value generated by the function, so we can see which points are important, where, in the curve.

    -

    The following graphs show the interpolation functions for quadratic and cubic curves, with "S" being the strength of a point's contribution to the total sum of the Bézier function. Click-and-drag to see the interpolation percentages for each curve-defining point at a specific t value.

    -
    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - -
    +

    지금까지 베지에 곡선의 기저 함수를 살펴보았습니다. 이제 베지에 곡선을 그토록 특별하게 만드는 것, 조절점을 살펴볼 시간입니다.

    +
    +
    +

    + + Controlling Bézier curvatures +

    +

    + Bézier curves are, like all "splines", interpolation functions. This means that they take a set of points, and generate values somewhere + "between" those points. (One of the consequences of this is that you'll never be able to generate a point that lies outside the outline + for the control points, commonly called the "hull" for the curve. Useful information!). In fact, we can visualize how each point + contributes to the value generated by the function, so we can see which points are important, where, in the curve. +

    +

    + The following graphs show the interpolation functions for quadratic and cubic curves, with "S" being the strength of a point's + contribution to the total sum of the Bézier function. Click-and-drag to see the interpolation percentages for each curve-defining point at + a specific t value. +

    +
    + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + + + + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + + + + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + + + +
    -

    Also shown is the interpolation function for a 15th order Bézier function. As you can see, the start and end point contribute considerably more to the curve's shape than any other point in the control point set.

    -

    If we want to change the curve, we need to change the weights of each point, effectively changing the interpolations. The way to do this is about as straightforward as possible: just multiply each point with a value that changes its strength. These values are conventionally called "weights", and we can add them to our original Bézier function:

    - - -

    That looks complicated, but as it so happens, the "weights" are actually just the coordinate values we want our curve to have: for an nth order curve, w0 is our start coordinate, wn is our last coordinate, and everything in between is a controlling coordinate. Say we want a cubic curve that starts at (110,150), is controlled by (25,190) and (210,250) and ends at (210,30), we use this Bézier curve:

    - - -

    Which gives us the curve we saw at the top of the article:

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - -

    What else can we do with Bézier curves? Quite a lot, actually. The rest of this article covers a multitude of possible operations and algorithms that we can apply, and the tasks they achieve.

    -
    + +

    Which gives us the curve we saw at the top of the article:

    + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + +

    + What else can we do with Bézier curves? Quite a lot, actually. The rest of this article covers a multitude of possible operations and + algorithms that we can apply, and the tasks they achieve. +

    +
    +

    How to implement the weighted basis function

    +

    Given that we already know how to implement basis function, adding in the control points is remarkably easy:

    -

    How to implement the weighted basis function

    -

    Given that we already know how to implement basis function, adding in the control points is remarkably easy:

    - - - - - -
    1 - -
    2
    3
    4
    5
    - -

    And now for the optimized versions:

    + return sum + + + + 2 + + + 3 + + + 4 + + + 5 + + - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    - -

    And now we know how to program the weighted basis function.

    -
    + return w[0]*mt3 + 3*w[1]*mt2*t + 3*w[2]*mt*t2 + w[3]*t3 + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + -
    -
    -

    Controlling Bézier curvatures, part 2: Rational Béziers

    -

    We can further control Bézier curves by "rationalising" them: that is, adding a "ratio" value in addition to the weight value discussed in the previous section, thereby gaining control over "how strongly" each coordinate influences the curve.

    -

    Adding these ratio values to the regular Bézier curve function is fairly easy. Where the regular function is the following:

    - - -

    The function for rational Bézier curves has two more terms:

    - - -

    In this, the first new term represents an additional weight for each coordinate. For example, if our ratio values are [1, 0.5, 0.5, 1] then ratio0 = 1, ratio1 = 0.5, and so on, and is effectively identical as if we were just using different weight. So far, nothing too special.

    -

    However, the second new term is what makes the difference: every point on the curve isn't just a "double weighted" point, it is a fraction of the "doubly weighted" value we compute by introducing that ratio. When computing points on the curve, we compute the "normal" Bézier value and then divide that by the Bézier value for the curve that only uses ratios, not weights.

    -

    This does something unexpected: it turns our polynomial into something that isn't a polynomial anymore. It is now a kind of curve that is a super class of the polynomials, and can do some really cool things that Bézier curves can't do "on their own", such as perfectly describing circles (which we'll see in a later section is literally impossible using standard Bézier curves).

    -

    But the best way to show what this does is to do literally that: let's look at the effect of "rationalising" our Bézier curves using an interactive graphic for a rationalised curves. The following graphic shows the Bézier curve from the previous section, "enriched" with ratio factors for each coordinate. The closer to zero we set one or more terms, the less relative influence the associated coordinate exerts on the curve (and of course the higher we set them, the more influence they have). Try to change the values and see how it affects what gets drawn:

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - - - - -

    You can think of the ratio values as each coordinate's "gravity": the higher the gravity, the closer to that coordinate the curve will want to be. You'll also notice that if you simply increase or decrease all the ratios by the same amount, nothing changes... much like with gravity, if the relative strengths stay the same, nothing really changes. The values define each coordinate's influence relative to all other points.

    -
    + +

    + In this, the first new term represents an additional weight for each coordinate. For example, if our ratio values are [1, 0.5, 0.5, 1] + then ratio0 = 1, ratio1 = 0.5, and so on, and is effectively identical as if we were just + using different weight. So far, nothing too special. +

    +

    + However, the second new term is what makes the difference: every point on the curve isn't just a "double weighted" point, it is a + fraction of the "doubly weighted" value we compute by introducing that ratio. When computing points on the curve, we compute the + "normal" Bézier value and then divide that by the Bézier value for the curve that only uses ratios, not weights. +

    +

    + This does something unexpected: it turns our polynomial into something that isn't a polynomial anymore. It is now a kind of curve + that is a super class of the polynomials, and can do some really cool things that Bézier curves can't do "on their own", such as perfectly + describing circles (which we'll see in a later section is literally impossible using standard Bézier curves). +

    +

    + But the best way to show what this does is to do literally that: let's look at the effect of "rationalising" our Bézier curves using an + interactive graphic for a rationalised curves. The following graphic shows the Bézier curve from the previous section, "enriched" with + ratio factors for each coordinate. The closer to zero we set one or more terms, the less relative influence the associated coordinate + exerts on the curve (and of course the higher we set them, the more influence they have). Try to change the values and see how it affects + what gets drawn: +

    + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + + + + + + +

    + You can think of the ratio values as each coordinate's "gravity": the higher the gravity, the closer to that coordinate the curve will + want to be. You'll also notice that if you simply increase or decrease all the ratios by the same amount, nothing changes... much like + with gravity, if the relative strengths stay the same, nothing really changes. The values define each coordinate's influence + relative to all other points. +

    +
    +

    How to implement rational curves

    +

    Extending the code of the previous section to include ratios is almost trivial:

    -

    How to implement rational curves

    -

    Extending the code of the previous section to include ratios is almost trivial:

    - - - - - - - - - - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    20
    21
    22
    23
    24
    25
    26
    - -

    And that's all we have to do.

    -
    + return (f[0] * w[0] + f[1] * w[1] + f[2] * w[2] + f[3] * w[3])/basis + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + 18 + + + 19 + + + 20 + + + 21 + + + 22 + + + 23 + + + 24 + + + 25 + + + 26 + + -
    -
    -

    The Bézier interval [0,1]

    -

    Now that we know the mathematics behind Bézier curves, there's one curious thing that you may have noticed: they always run from t=0 to t=1. Why that particular interval?

    -

    It all has to do with how we run from "the start" of our curve to "the end" of our curve. If we have a value that is a mixture of two other values, then the general formula for this is:

    - - -

    The obvious start and end values here need to be a=1, b=0, so that the mixed value is 100% value 1, and 0% value 2, and a=0, b=1, so that the mixed value is 0% value 1 and 100% value 2. Additionally, we don't want "a" and "b" to be independent: if they are, then we could just pick whatever values we like, and end up with a mixed value that is, for example, 100% value 1 and 100% value 2. In principle that's fine, but for Bézier curves we always want mixed values between the start and end point, so we need to make sure we can never set "a" and "b" to some values that lead to a mix value that sums to more than 100%. And that's easy:

    - - -

    With this we can guarantee that we never sum above 100%. By restricting a to values in the interval [0,1], we will always be somewhere between our two values (inclusively), and we will always sum to a 100% mix.

    -

    But... what if we use this form, which is based on the assumption that we will only ever use values between 0 and 1, and instead use values outside of that interval? Do things go horribly wrong? Well... not really, but we get to "see more".

    -

    In the case of Bézier curves, extending the interval simply makes our curve "keep going". Bézier curves are simply segments of some polynomial curve, so if we pick a wider interval we simply get to see more of the curve. So what do they look like?

    -

    The following two graphics show you Bézier curves rendered "the usual way", as well as the curves they "lie on" if we were to extend the t values much further. As you can see, there's a lot more "shape" hidden in the rest of the curve, and we can model those parts by moving the curve points around.

    -
    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - -
    + +

    + With this we can guarantee that we never sum above 100%. By restricting a to values in the interval [0,1], we will always be + somewhere between our two values (inclusively), and we will always sum to a 100% mix. +

    +

    + But... what if we use this form, which is based on the assumption that we will only ever use values between 0 and 1, and instead use + values outside of that interval? Do things go horribly wrong? Well... not really, but we get to "see more". +

    +

    + In the case of Bézier curves, extending the interval simply makes our curve "keep going". Bézier curves are simply segments of some + polynomial curve, so if we pick a wider interval we simply get to see more of the curve. So what do they look like? +

    +

    + The following two graphics show you Bézier curves rendered "the usual way", as well as the curves they "lie on" if we were to extend the + t values much further. As you can see, there's a lot more "shape" hidden in the rest of the curve, and we can model those + parts by moving the curve points around. +

    +
    + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + +
    -

    In fact, there are curves used in graphics design and computer modelling that do the opposite of Bézier curves; rather than fixing the interval, and giving you freedom to choose the coordinates, they fix the coordinates, but give you freedom over the interval. A great example of this is the "Spiro" curve, which is a curve based on part of a Cornu Spiral, also known as Euler's Spiral. It's a very aesthetically pleasing curve and you'll find it in quite a few graphics packages like FontForge and Inkscape. It has even been used in font design, for example for the Inconsolata typeface.

    - -
    -
    -

    Bézier curvatures as matrix operations

    -

    We can also represent Bézier curves as matrix operations, by expressing the Bézier formula as a polynomial basis function and a coefficients matrix, and the actual coordinates as a matrix. Let's look at what this means for the cubic curve, using P... to refer to coordinate values "in one or more dimensions":

    - - -

    Disregarding our actual coordinates for a moment, we have:

    - - -

    We can write this as a sum of four expressions:

    - - -

    And we can expand these expressions:

    - - -

    Furthermore, we can make all the 1 and 0 factors explicit:

    - - -

    And that, we can view as a series of four matrix operations:

    - - -

    If we compact this into a single matrix operation, we get:

    - - -

    This kind of polynomial basis representation is generally written with the bases in increasing order, which means we need to flip our t matrix horizontally, and our big "mixing" matrix upside down:

    - - -

    And then finally, we can add in our original coordinates as a single third matrix:

    - - -

    We can perform the same trick for the quadratic curve, in which case we end up with:

    - - -

    If we plug in a t value, and then multiply the matrices, we will get exactly the same values as when we evaluate the original polynomial function, or as when we evaluate the curve using progressive linear interpolation.

    -

    So: why would we bother with matrices? Matrix representations allow us to discover things about functions that would otherwise be hard to tell. It turns out that the curves form triangular matrices, and they have a determinant equal to the product of the actual coordinates we use for our curve. It's also invertible, which means there's a ton of properties that are all satisfied. Of course, the main question is "why is this useful to us now?", and the answer to that is that it's not immediately useful, but you'll be seeing some instances where certain curve properties can be either computed via function manipulation, or via clever use of matrices, and sometimes the matrix approach can be (drastically) faster.

    -

    So for now, just remember that we can represent curves this way, and let's move on.

    + +

    + If we plug in a t value, and then multiply the matrices, we will get exactly the same values as when we evaluate the original + polynomial function, or as when we evaluate the curve using progressive linear interpolation. +

    +

    + So: why would we bother with matrices? Matrix representations allow us to discover things about functions that would + otherwise be hard to tell. It turns out that the curves form + triangular matrices, and they have a determinant equal to the product of the + actual coordinates we use for our curve. It's also invertible, which means there's + a ton of properties that are all satisfied. Of + course, the main question is "why is this useful to us now?", and the answer to that is that it's not immediately useful, but + you'll be seeing some instances where certain curve properties can be either computed via function manipulation, or via clever use of + matrices, and sometimes the matrix approach can be (drastically) faster. +

    +

    So for now, just remember that we can represent curves this way, and let's move on.

    +
    +
    +

    + + de Casteljau's algorithm +

    +

    + If we want to draw Bézier curves, we can run through all values of t from 0 to 1 and then compute the weighted basis function + at each value, getting the x/y values we need to plot. Unfortunately, the more complex the curve gets, the more expensive + this computation becomes. Instead, we can use de Casteljau's algorithm to draw curves. This is a geometric approach to curve + drawing, and it's really easy to implement. So easy, in fact, you can do it by hand with a pencil and ruler. +

    +

    Rather than using our calculus function to find x/y values for t, let's do this instead:

    +
      +
    • treat t as a ratio (which it is). t=0 is 0% along a line, t=1 is 100% along a line.
    • +
    • Take all lines between the curve's defining points. For an order n curve, that's n lines.
    • +
    • + Place markers along each of these line, at distance t. So if t is 0.2, place the mark at 20% from the start, + 80% from the end. +
    • +
    • Now form lines between those points. This gives n-1 lines.
    • +
    • Place markers along each of these line at distance t.
    • +
    • Form lines between those points. This'll be n-2 lines.
    • +
    • Place markers, form lines, place markers, etc.
    • +
    • + Repeat this until you have only one line left. The point t on that line coincides with the original curve point at + t. +
    • +
    +

    + To see this in action, move the slider for the following sketch to changes which curve point is explicitly evaluated using de Casteljau's + algorithm. +

    + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + + + +
    +

    How to implement de Casteljau's algorithm

    +

    + Let's just use the algorithm we just specified, and implement that as a function that can take a list of curve-defining points, and a + t value, and draws the associated point on the curve for that t value: +

    -
    -
    -

    de Casteljau's algorithm

    -

    If we want to draw Bézier curves, we can run through all values of t from 0 to 1 and then compute the weighted basis function at each value, getting the x/y values we need to plot. Unfortunately, the more complex the curve gets, the more expensive this computation becomes. Instead, we can use de Casteljau's algorithm to draw curves. This is a geometric approach to curve drawing, and it's really easy to implement. So easy, in fact, you can do it by hand with a pencil and ruler.

    -

    Rather than using our calculus function to find x/y values for t, let's do this instead:

    -
      -
    • treat t as a ratio (which it is). t=0 is 0% along a line, t=1 is 100% along a line.
    • -
    • Take all lines between the curve's defining points. For an order n curve, that's n lines.
    • -
    • Place markers along each of these line, at distance t. So if t is 0.2, place the mark at 20% from the start, 80% from the end.
    • -
    • Now form lines between those points. This gives n-1 lines.
    • -
    • Place markers along each of these line at distance t.
    • -
    • Form lines between those points. This'll be n-2 lines.
    • -
    • Place markers, form lines, place markers, etc.
    • -
    • Repeat this until you have only one line left. The point t on that line coincides with the original curve point at t.
    • -
    -

    To see this in action, move the slider for the following sketch to changes which curve point is explicitly evaluated using de Casteljau's algorithm.

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - -
    - -

    How to implement de Casteljau's algorithm

    -

    Let's just use the algorithm we just specified, and implement that as a function that can take a list of curve-defining points, and a t value, and draws the associated point on the curve for that t value:

    - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    - -

    And done, that's the algorithm implemented. Although: usually you don't get the luxury of overloading the "+" operator, so let's also give the code for when you need to work with x and y values separately:

    + drawCurvePoint(newpoints, t) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    - -

    So what does this do? This draws a point, if the passed list of points is only 1 point long. Otherwise it will create a new list of points that sit at the t ratios (i.e. the "markers" outlined in the above algorithm), and then call the draw function for this new list.

    -
    + drawCurvePoint(newpoints, t) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + +

    + So what does this do? This draws a point, if the passed list of points is only 1 point long. Otherwise it will create a new list of + points that sit at the t ratios (i.e. the "markers" outlined in the above algorithm), and then call the draw function for this + new list. +

    + +
    +
    +

    + + Simplified drawing +

    +

    + We can also simplify the drawing process by "sampling" the curve at certain points, and then joining those points up with straight lines, + a process known as "flattening", as we are reducing a curve to a simple sequence of straight, "flat" lines. +

    +

    + We can do this is by saying "we want X segments", and then sampling the curve at intervals that are spaced such that we end up with the + number of segments we wanted. The advantage of this method is that it's fast: instead of evaluating 100 or even 1000 curve coordinates, we + can sample a much lower number and still end up with a curve that sort-of-kind-of looks good enough. The disadvantage of course is that we + lose the precision of working with "the real curve", so we usually can't use the flattened for doing true intersection detection, or + curvature alignment. +

    +
    + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + + + + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + + + +
    -
    -
    -

    Simplified drawing

    -

    We can also simplify the drawing process by "sampling" the curve at certain points, and then joining those points up with straight lines, a process known as "flattening", as we are reducing a curve to a simple sequence of straight, "flat" lines.

    -

    We can do this is by saying "we want X segments", and then sampling the curve at intervals that are spaced such that we end up with the number of segments we wanted. The advantage of this method is that it's fast: instead of evaluating 100 or even 1000 curve coordinates, we can sample a much lower number and still end up with a curve that sort-of-kind-of looks good enough. The disadvantage of course is that we lose the precision of working with "the real curve", so we usually can't use the flattened for doing true intersection detection, or curvature alignment.

    -
    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - -
    +

    + Try clicking on the sketch and using your up and down arrow keys to lower the number of segments for both the quadratic and cubic curve. + You'll notice that for certain curvatures, a low number of segments works quite well, but for more complex curvatures (try this for the + cubic curve), a higher number is required to capture the curvature changes properly. +

    +
    +

    How to implement curve flattening

    +

    Let's just use the algorithm we just specified, and implement that:

    -

    Try clicking on the sketch and using your up and down arrow keys to lower the number of segments for both the quadratic and cubic curve. You'll notice that for certain curvatures, a low number of segments works quite well, but for more complex curvatures (try this for the cubic curve), a higher number is required to capture the curvature changes properly.

    -
    - -

    How to implement curve flattening

    -

    Let's just use the algorithm we just specified, and implement that:

    - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    - -

    And done, that's the algorithm implemented. That just leaves drawing the resulting "curve" as a sequence of lines:

    + return coordinates; + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    - -

    We start with the first coordinate as reference point, and then just draw lines between each point and its next point.

    -
    + coord = _coord + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + -
    -
    -

    Splitting curves

    -

    Using de Casteljau's algorithm, we can also find all the points we need to split up a Bézier curve into two, smaller curves, which taken together form the original curve. When we construct de Casteljau's skeleton for some value t, the procedure gives us all the points we need to split a curve at that t value: one curve is defined by all the inside skeleton points found prior to our on-curve point, with the other curve being defined by all the inside skeleton points after our on-curve point.

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - -
    +

    We start with the first coordinate as reference point, and then just draw lines between each point and its next point.

    +
    +
    +
    +

    + + Splitting curves +

    +

    + Using de Casteljau's algorithm, we can also find all the points we need to split up a Bézier curve into two, smaller curves, which taken + together form the original curve. When we construct de Casteljau's skeleton for some value t, the procedure gives us all the + points we need to split a curve at that t value: one curve is defined by all the inside skeleton points found prior to our + on-curve point, with the other curve being defined by all the inside skeleton points after our on-curve point. +

    + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + + + +
    +

    implementing curve splitting

    +

    We can implement curve splitting by bolting some extra logging onto the de Casteljau function:

    -

    implementing curve splitting

    -

    We can implement curve splitting by bolting some extra logging onto the de Casteljau function:

    - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    - -

    After running this function for some value t, the left and right arrays will contain all the coordinates for two new curves - one to the "left" of our t value, the other on the "right". These new curves will have the same order as the original curve, and can be overlaid exactly on the original curve.

    -
    + drawCurvePoint(newpoints, t) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + -
    -
    -

    Splitting curves using matrices

    -

    Another way to split curves is to exploit the matrix representation of a Bézier curve. In the section on matrices, we saw that we can represent curves as matrix multiplications. Specifically, we saw these two forms for the quadratic and cubic curves respectively: (we'll reverse the Bézier coefficients vector for legibility)

    - - -

    and

    - - -

    Let's say we want to split the curve at some point t = z, forming two new (obviously smaller) Bézier curves. To find the coordinates for these two Bézier curves, we can use the matrix representation and some linear algebra. First, we separate out the actual "point on the curve" information into a new matrix multiplication:

    - - -

    and

    - - -

    If we could compact these matrices back to the form [t values] · [Bézier matrix] · [column matrix], with the first two staying the same, then that column matrix on the right would be the coordinates of a new Bézier curve that describes the first segment, from t = 0 to t = z. As it turns out, we can do this quite easily, by exploiting some simple rules of linear algebra (and if you don't care about the derivations, just skip to the end of the box for the results!).

    -
    - -

    Deriving new hull coordinates

    -

    Deriving the two segments upon splitting a curve takes a few steps, and the higher the curve order, the more work it is, so let's look at the quadratic curve first:

    - - - - - - - - -

    We can do this because [M · M-1] is the identity matrix. It's a bit like multiplying something by x/x in calculus: it doesn't do anything to the function, but it does allow you to rewrite it to something that may be easier to work with, or can be broken up differently. In the same way, multiplying our matrix by [M · M-1] has no effect on the total formula, but it does allow us to change the matrix sequence [something · M] to a sequence [M · something], and that makes a world of difference: if we know what [M-1 · Z · M] is, we can apply that to our coordinates, and be left with a proper matrix representation of a quadratic Bézier curve (which is [T · M · P]), with a new set of coordinates that represent the curve from t = 0 to t = z. So let's get computing:

    - - -

    Excellent! Now we can form our new quadratic curve:

    - - - - - - -

    Brilliant: if we want a subcurve from t = 0 to t = z, we can keep the first coordinate the same (which makes sense), our control point becomes a z-ratio mixture of the original control point and the start point, and the new end point is a mixture that looks oddly similar to a Bernstein polynomial of degree two. These new coordinates are actually really easy to compute directly!

    -

    Of course, that's only one of the two curves. Getting the section from t = z to t = 1 requires doing this again. We first observe that in the previous calculation, we actually evaluated the general interval [0,z]. We were able to write it down in a more simple form because of the zero, but what we actually evaluated, making the zero explicit, was:

    - - - - -

    If we want the interval [z,1], we will be evaluating this instead:

    - - - - -

    We're going to do the same trick of multiplying by the identity matrix, to turn [something · M] into [M · something]:

    - - -

    So, our final second curve looks like:

    - - - - - - -

    Nice. We see the same thing as before: we can keep the last coordinate the same (which makes sense); our control point becomes a z-ratio mixture of the original control point and the end point, and the new start point is a mixture that looks oddly similar to a bernstein polynomial of degree two, except this time it uses (z-1) rather than (1-z). These new coordinates are also really easy to compute directly!

    -
    + +

    + Nice. We see the same thing as before: we can keep the last coordinate the same (which makes sense); our control point becomes a z-ratio + mixture of the original control point and the end point, and the new start point is a mixture that looks oddly similar to a bernstein + polynomial of degree two, except this time it uses (z-1) rather than (1-z). These new coordinates are also really easy to + compute directly! +

    + -

    So, using linear algebra rather than de Casteljau's algorithm, we have determined that, for any quadratic curve split at some value t = z, we get two subcurves that are described as Bézier curves with simple-to-derive coordinates:

    - - -

    and

    - - -

    We can do the same for cubic curves. However, I'll spare you the actual derivation (don't let that stop you from writing that out yourself, though) and simply show you the resulting new coordinate sets:

    - - -

    and

    - - -

    So, looking at our matrices, did we really need to compute the second segment matrix? No, we didn't. Actually having one segment's matrix means we implicitly have the other: push the values of each row in the matrix Q to the right, with zeroes getting pushed off the right edge and appearing back on the left, and then flip the matrix vertically. Presto, you just "calculated" Q'.

    -

    Implementing curve splitting this way requires less recursion, and is just straight arithmetic with cached values, so can be cheaper on systems where recursion is expensive. If you're doing computation with devices that are good at matrix multiplication, chopping up a Bézier curve with this method will be a lot faster than applying de Casteljau.

    - -
    -
    -

    Lowering and elevating curve order

    -

    One interesting property of Bézier curves is that an nth order curve can always be perfectly represented by an (n+1)th order curve, by giving the higher-order curve specific control points.

    -

    If we have a curve with three points, then we can create a curve with four points that exactly reproduces the original curve. First, we give it the same start and end points, and for its two control points we pick "1/3rd start + 2/3rd control" and "2/3rd control + 1/3rd end". Now we have exactly the same curve as before, except represented as a cubic curve rather than a quadratic curve.

    -

    The general rule for raising an nth order curve to an (n+1)th order curve is as follows (observing that the start and end weights are the same as the start and end weights for the old curve):

    - - -

    However, this rule also has as direct consequence that you cannot generally safely lower a curve from nth order to (n-1)th order, because the control points cannot be "pulled apart" cleanly. We can try to, but the resulting curve will not be identical to the original, and may in fact look completely different.

    -

    However, there is a surprisingly good way to ensure that a lower order curve looks "as close as reasonably possible" to the original curve: we can optimise the "least-squares distance" between the original curve and the lower order curve, in a single operation (also explained over on Sirver's Castle). However, to use it, we'll need to do some calculus work and then switch over to linear algebra. As mentioned in the section on matrix representations, some things can be done much more easily with matrices than with calculus functions, and this is one of those things. So... let's go!

    -

    We start by taking the standard Bézier function, and condensing it a little:

    - - -

    Then, we apply one of those silly (actually, super useful) calculus tricks: since our t value is always between zero and one (inclusive), we know that (1-t) plus t always sums to 1. As such, we can express any value as a sum of t and 1-t:

    - - -

    So, with that seemingly trivial observation, we rewrite that Bézier function by splitting it up into a sum of a (1-t) and t component:

    - - -

    So far so good. Now, to see why we did this, let's write out the (1-t) and t parts, and see what that gives us. I promise, it's about to make sense. We start with (1-t):

    - - -

    So by using this seemingly silly trick, we can suddenly express part of our nth order Bézier function in terms of an (n+1)th order Bézier function. And that sounds a lot like raising the curve order! Of course we need to be able to repeat that trick for the t part, but that's not a problem:

    - - -

    So, with both of those changed from an order n expression to an order (n+1) expression, we can put them back together again. Now, where the order n function had a summation from 0 to n, the order n+1 function uses a summation from 0 to n+1, but this shouldn't be a problem as long as we can add some new terms that "contribute nothing". In the next section on derivatives, there is a discussion about why "higher terms than there is a binomial for" and "lower than zero terms" both "contribute nothing". So as long as we can add terms that have the same form as the terms we need, we can just include them in the summation, they'll sit there and do nothing, and the resulting function stays identical to the lower order curve.

    -

    Let's do this:

    - - -

    And this is where we switch over from calculus to linear algebra, and matrices: we can now express this relation between Bézier(n,t) and Bézier(n+1,t) as a very simple matrix multiplication:

    - - -

    where the matrix M is an n+1 by n matrix, and looks like:

    - - -

    That might look unwieldy, but it's really just a mostly-zeroes matrix, with a very simply fraction on the diagonal, and an even simpler fraction to the left of it. Multiplying a list of coordinates with this matrix means we can plug the resulting transformed coordinates into the one-order-higher function and get an identical looking curve.

    -

    Not too bad!

    -

    Equally interesting, though, is that with this matrix operation established, we can now use an incredibly powerful and ridiculously simple way to find out a "best fit" way to reverse the operation, called the normal equation. What it does is minimize the sum of the square differences between one set of values and another set of values. Specifically, if we can express that as some function A x = b, we can use it. And as it so happens, that's exactly what we're dealing with, so:

    - - -

    The steps taken here are:

    -
      -
    1. We have a function in a form that the normal equation can be used with, so
    2. -
    3. apply the normal equation!
    4. -
    5. Then, we want to end up with just Bn on the left, so we start by left-multiply both sides such that we'll end up with lots of stuff on the left that simplified to "a factor 1", which in matrix maths is the identity matrix.
    6. -
    7. In fact, by left-multiplying with the inverse of what was already there, we've effectively "nullified" (but really, one-inified) that big, unwieldy block into the identity matrix I. So we substitute the mess with I, and then
    8. -
    9. because multiplication with the identity matrix does nothing (like multiplying by 1 does nothing in regular algebra), we just drop it.
    10. -
    -

    And we're done: we now have an expression that lets us approximate an n+1th order curve with a lower nth order curve. It won't be an exact fit, but it's definitely a best approximation. So, let's implement these rules for raising and lowering curve order to a (semi) random curve, using the following graphic. Select the sketch, which has movable control points, and press your up and down arrow keys to raise or lower the curve order.

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - - - -
    -
    -

    Derivatives

    -

    There's a number of useful things that you can do with Bézier curves based on their derivative, and one of the more amusing observations about Bézier curves is that their derivatives are, in fact, also Bézier curves. In fact, the differentiation of a Bézier curve is relatively straightforward, although we do need a bit of math.

    -

    First, let's look at the derivative rule for Bézier curves, which is:

    - - -

    which we can also write (observing that b in this formula is the same as our w weights, and that n times a summation is the same as a summation where each term is multiplied by n) as:

    - - -

    Or, in plain text: the derivative of an nth degree Bézier curve is an (n-1)th degree Bézier curve, with one fewer term, and new weights w'0...w'n-1 derived from the original weights as n(wi+1 - wi). So for a 3rd degree curve, with four weights, the derivative has three new weights: w'0 = 3(w1-w0), w'1 = 3(w2-w1) and w'2 = 3(w3-w2).

    -
    - -

    "Slow down, why is that true?"

    -

    Sometimes just being told "this is the derivative" is nice, but you might want to see why this is indeed the case. As such, let's have a look at the proof for this derivative. First off, the weights are independent of the full Bézier function, so the derivative involves only the derivative of the polynomial basis function. So, let's find that:

    - - -

    Applying the product and chain rules gives us:

    - - -

    Which is hard to work with, so let's expand that properly:

    - - -

    Now, the trick is to turn this expression into something that has binomial coefficients again, so we want to end up with things that look like "x! over y!(x-y)!". If we can do that in a way that involves terms of n-1 and k-1, we'll be on the right track.

    - - -

    And that's the first part done: the two components inside the parentheses are actually regular, lower-order Bézier expressions:

    - - -

    Now to apply this to our weighted Bézier curves. We'll write out the plain curve formula that we saw earlier, and then work our way through to its derivative:

    - - -

    If we expand this (with some color to show how terms line up), and reorder the terms by increasing values for k we see the following:

    - - -

    Two of these terms fall way: the first term falls away because there is no -1st term in a summation. As such, it always contributes "nothing", so we can safely completely ignore it for the purpose of finding the derivative function. The other term is the very last term in this expansion: one involving Bn-1,n. This term would have a binomial coefficient of [i choose i+1], which is a non-existent binomial coefficient. Again, this term would contribute "nothing", so we can ignore it, too. This means we're left with:

    - - -

    And that's just a summation of lower order curves:

    - - -

    We can rewrite this as a normal summation, and we're done:

    - - -
    + + -

    Let's rewrite that in a form similar to our original formula, so we can see the difference. We will first list our original formula for Bézier curves, and then the derivative:

    - - - - -

    What are the differences? In terms of the actual Bézier curve, virtually nothing! We lowered the order (rather than n, it's now n-1), but it's still the same Bézier function. The only real difference is in how the weights change when we derive the curve's function. If we have four points A, B, C, and D, then the derivative will have three points, the second derivative two, and the third derivative one:

    - - -

    We can keep performing this trick for as long as we have more than one weight. Once we have one weight left, the next step will see k = 0, and the result of our "Bézier function" summation is zero, because we're not adding anything at all. As such, a quadratic curve has no second derivative, a cubic curve has no third derivative, and generalized: an nth order curve has n-1 (meaningful) derivatives, with any further derivative being zero.

    - -
    -
    -

    Tangents and normals

    -

    If you want to move objects along a curve, or "away from" a curve, the two vectors you're most interested in are the tangent vector and normal vector for curve points. These are actually really easy to find. For moving and orienting along a curve, we use the tangent, which indicates the direction of travel at specific points, and is literally just the first derivative of our curve:

    - - -

    This gives us the directional vector we want. We can normalize it to give us uniform directional vectors (having a length of 1.0) at each point, and then do whatever it is we want to do based on those directions:

    - - -

    The tangent is very useful for moving along a line, but what if we want to move away from the curve instead, perpendicular to the curve at some point t? In that case we want the normal vector. This vector runs at a right angle to the direction of the curve, and is typically of length 1.0, so all we have to do is rotate the normalized directional vector and we're done:

    - - -
    - -

    Rotating coordinates is actually very easy, if you know the rule for it. You might find it explained as "applying a rotation matrix, which is what we'll look at here, too. Essentially, the idea is to take the circles over which we can rotate, and simply "sliding the coordinates" over these circles by the desired -angle. If we want a quarter circle turn, we take the coordinate, slide it along the circle by a quarter turn, and done.

    -

    To turn any point (x,y) into a rotated point (x',y') (over 0,0) by some angle φ, we apply this nice and easy computation:

    - - -

    Which is the "long" version of the following matrix transformation:

    - - -

    And that's all we need to rotate any coordinate. Note that for quarter, half, and three-quarter turns these functions become even easier, since sin and cos for these angles are, respectively: 0 and 1, -1 and 0, and 0 and -1.

    -

    But why does this work? Why this matrix multiplication? Wikipedia (technically, Thomas Herter and Klaus Lott) tells us that a rotation matrix can be -treated as a sequence of three (elementary) shear operations. When we combine this into a single matrix operation (because all matrix multiplications can be collapsed), we get the matrix that you see above. DataGenetics have an excellent article about this very thing: it's really quite cool, and I strongly recommend taking a quick break from this primer to read that article.

    -
    + +

    + And that's all we need to rotate any coordinate. Note that for quarter, half, and three-quarter turns these functions become even + easier, since sin and cos for these angles are, respectively: 0 and 1, -1 and 0, and 0 and -1. +

    +

    + But why does this work? Why this matrix multiplication? + Wikipedia (technically, Thomas Herter and Klaus + Lott) tells us that a rotation matrix can be treated as a sequence of three (elementary) shear operations. When we combine this into a + single matrix operation (because all matrix multiplications can be collapsed), we get the matrix that you see above. + DataGenetics have an excellent article about this very thing: it's + really quite cool, and I strongly recommend taking a quick break from this primer to read that article. +

    + -

    The following two graphics show the tangent and normal along a quadratic and cubic curve, with the direction vector coloured blue, and the normal vector coloured red (the markers are spaced out evenly as t-intervals, not spaced equidistant).

    -
    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - -
    +

    + The following two graphics show the tangent and normal along a quadratic and cubic curve, with the direction vector coloured blue, and the + normal vector coloured red (the markers are spaced out evenly as t-intervals, not spaced equidistant). +

    +
    + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + +
    +
    +
    +

    + + Working with 3D normals +

    +

    + Before we move on to the next section we need to spend a little bit of time on the difference between 2D and 3D. While for many things + this difference is irrelevant and the procedures are identical (for instance, getting the 3D tangent is just doing what we do for 2D, but + for x, y, and z, instead of just for x and y), when it comes to normals things are a little more complex, and thus more work. Mind you, + it's not "super hard", but there are more steps involved and we should have a look at those. +

    +

    + Getting normals in 3D is in principle the same as in 2D: we take the normalised tangent vector, and then rotate it by a quarter turn. + However, this is where things get that little more complex: we can turn in quite a few directions, since "the normal" in 3D is a plane, + not a single vector, so we basically need to define what "the" normal is in the 3D case. +

    +

    + The "naïve" approach is to construct what is known as the + Frenet normal, where we follow a simple recipe that works in + many cases (but does super bizarre things in some others). The idea is that even though there are infinitely many vectors that are + perpendicular to the tangent (i.e. make a 90 degree angle with it), the tangent itself sort of lies on its own plane already: since each + point on the curve (no matter how closely spaced) has its own tangent vector, we can say that each point lies in the same plane as the + local tangent, as well as the tangents "right next to it". +

    +

    + Even if that difference in tangent vectors is minute, "any difference" is all we need to find out what that plane is - or rather, what the + vector perpendicular to that plane is. Which is what we need: if we can calculate that vector, and we have the tangent vector that we know + lies on a plane, then we can rotate the tangent vector over the perpendicular, and presto. We have computed the normal using the same + logic we used for the 2D case: "just rotate it 90 degrees". +

    +

    So let's do that! And in a twist surprise, we can do this in four lines:

    +
      +
    • a = normalize(B'(t))
    • +
    • b = normalize(a + B''(t))
    • +
    • r = normalize(b × a)
    • +
    • normal = normalize(r × a)
    • +
    +

    Let's unpack that a little:

    +
      +
    • + We start by taking the normalized vector for the derivative at some point on the + curve. We normalize it so the maths is less work. Less work is good. +
    • +
    • + Then, we compute b which represents what a next point's tangent would be if the curve stopped changing at our point and + just had the same derivative and second derivative from that point on. +
    • +
    • + This lets us find two vectors (the derivative, and the second derivative added to the derivative) that lie on the same plane, which + means we can use them to compute a vector perpendicular to that plane, using an elementary vector operation called the + cross product. (Note that while that operation uses the × operator, it's most + definitely not a multiplication!) The result of that gives us a vector that we can use as the "axis of rotation" for turning the tangent + a quarter circle to get our normal, just like we did in the 2D case. +
    • +
    • + Since the cross product lets us find a vector that is perpendicular to some plane defined by two other vectors, and since the normal + vector should be perpendicular to the plane that the tangent and the axis of rotation lie in, we can use the cross product a second + time, and immediately get our normal vector. +
    • +
    +

    + And then we're done, we found "the" normal vector for a 3D curve. Let's see what that looks like for a sample curve, shall we? You can + move your cursor across the graphic from left to right, to show the normal at a point with a t value that is based on your cursor + position: all the way on the left is 0, all the way on the right = 1, midway is t=0.5, etc: +

    + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + + + +

    + However, if you've played with that graphic a bit, you might have noticed something odd. The normal seems to "suddenly twist around the + curve" between t=0.65 and t=0.75... Why is it doing that? +

    +

    + As it turns out, it's doing that because that's how the maths works, and that's the problem with Frenet normals: while they are + "mathematically correct", they are "practically problematic", and so for any kind of graphics work what we really want is a way to compute + normals that just... look good. +

    +

    Thankfully, Frenet normals are not our only option.

    +

    + Another option is to take a slightly more algorithmic approach and compute a form of + Rotation Minimising Frame + (also known as "parallel transport frame" or "Bishop frame") instead, where a "frame" is a set made up of the tangent, the rotational + axis, and the normal vector, centered on an on-curve point. +

    +

    + These type of frames are computed based on "the previous frame", so we cannot simply compute these "on demand" for single points, as we + could for Frenet frames; we have to compute them for the entire curve. Thankfully, the procedure is pretty simple, and can be performed at + the same time that you're building lookup tables for your curve. +

    +

    + The idea is to take a starting "tangent/rotation axis/normal" frame at t=0, and then compute what the next frame "should" look like by + applying some rules that yield a good looking next frame. In the case of the RMF paper linked above, those rules are: +

    +
      +
    • Take a point on the curve for which we know the RM frame already,
    • +
    • take a next point on the curve for which we don't know the RM frame yet, and
    • +
    • + reflect the known frame onto the next point, by treating the plane through the curve at the point exactly between the next and previous + points as a "mirror". +
    • +
    • + This gives the next point a tangent vector that's essentially pointing in the opposite direction of what it should be, and a normal + that's slightly off-kilter, so: +
    • +
    • + reflect the vectors of our "mirrored frame" a second time, but this time using the plane through the "next point" itself as "mirror". +
    • +
    • Done: the tangent and normal have been fixed, and we have a good looking frame to work with.
    • +
    +

    So, let's write some code for that!

    +
    +

    Implementing Rotation Minimising Frames

    +

    + We first assume we have a function for calculating the Frenet frame at a point, which we already discussed above, inn a way that it + yields a frame with properties: +

    -
    -
    -

    Working with 3D normals

    -

    Before we move on to the next section we need to spend a little bit of time on the difference between 2D and 3D. While for many things this difference is irrelevant and the procedures are identical (for instance, getting the 3D tangent is just doing what we do for 2D, but for x, y, and z, instead of just for x and y), when it comes to normals things are a little more complex, and thus more work. Mind you, it's not "super hard", but there are more steps involved and we should have a look at those.

    -

    Getting normals in 3D is in principle the same as in 2D: we take the normalised tangent vector, and then rotate it by a quarter turn. However, this is where things get that little more complex: we can turn in quite a few directions, since "the normal" in 3D is a plane, not a single vector, so we basically need to define what "the" normal is in the 3D case.

    -

    The "naïve" approach is to construct what is known as the Frenet normal, where we follow a simple recipe that works in many cases (but does super bizarre things in some others). The idea is that even though there are infinitely many vectors that are perpendicular to the tangent (i.e. make a 90 degree angle with it), the tangent itself sort of lies on its own plane already: since each point on the curve (no matter how closely spaced) has its own tangent vector, we can say that each point lies in the same plane as the local tangent, as well as the tangents "right next to it".

    -

    Even if that difference in tangent vectors is minute, "any difference" is all we need to find out what that plane is - or rather, what the vector perpendicular to that plane is. Which is what we need: if we can calculate that vector, and we have the tangent vector that we know lies on a plane, then we can rotate the tangent vector over the perpendicular, and presto. We have computed the normal using the same logic we used for the 2D case: "just rotate it 90 degrees".

    -

    So let's do that! And in a twist surprise, we can do this in four lines:

    -
      -
    • a = normalize(B'(t))
    • -
    • b = normalize(a + B''(t))
    • -
    • r = normalize(b × a)
    • -
    • normal = normalize(r × a)
    • -
    -

    Let's unpack that a little:

    -
      -
    • We start by taking the normalized vector for the derivative at some point on the curve. We normalize it so the maths is less work. Less work is good.
    • -
    • Then, we compute b which represents what a next point's tangent would be if the curve stopped changing at our point and just had the same derivative and second derivative from that point on.
    • -
    • This lets us find two vectors (the derivative, and the second derivative added to the derivative) that lie on the same plane, which means we can use them to compute a vector perpendicular to that plane, using an elementary vector operation called the cross product. (Note that while that operation uses the × operator, it's most definitely not a multiplication!) The result of that gives us a vector that we can use as the "axis of rotation" for turning the tangent a quarter circle to get our normal, just like we did in the 2D case.
    • -
    • Since the cross product lets us find a vector that is perpendicular to some plane defined by two other vectors, and since the normal vector should be perpendicular to the plane that the tangent and the axis of rotation lie in, we can use the cross product a second time, and immediately get our normal vector.
    • -
    -

    And then we're done, we found "the" normal vector for a 3D curve. Let's see what that looks like for a sample curve, shall we? You can move your cursor across the graphic from left to right, to show the normal at a point with a t value that is based on your cursor position: all the way on the left is 0, all the way on the right = 1, midway is t=0.5, etc:

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - -

    However, if you've played with that graphic a bit, you might have noticed something odd. The normal seems to "suddenly twist around the curve" between t=0.65 and t=0.75... Why is it doing that?

    -

    As it turns out, it's doing that because that's how the maths works, and that's the problem with Frenet normals: while they are "mathematically correct", they are "practically problematic", and so for any kind of graphics work what we really want is a way to compute normals that just... look good.

    -

    Thankfully, Frenet normals are not our only option.

    -

    Another option is to take a slightly more algorithmic approach and compute a form of Rotation Minimising Frame (also known as "parallel transport frame" or "Bishop frame") instead, where a "frame" is a set made up of the tangent, the rotational axis, and the normal vector, centered on an on-curve point.

    -

    These type of frames are computed based on "the previous frame", so we cannot simply compute these "on demand" for single points, as we could for Frenet frames; we have to compute them for the entire curve. Thankfully, the procedure is pretty simple, and can be performed at the same time that you're building lookup tables for your curve.

    -

    The idea is to take a starting "tangent/rotation axis/normal" frame at t=0, and then compute what the next frame "should" look like by applying some rules that yield a good looking next frame. In the case of the RMF paper linked above, those rules are:

    -
      -
    • Take a point on the curve for which we know the RM frame already,
    • -
    • take a next point on the curve for which we don't know the RM frame yet, and
    • -
    • reflect the known frame onto the next point, by treating the plane through the curve at the point exactly between the next and previous points as a "mirror".
    • -
    • This gives the next point a tangent vector that's essentially pointing in the opposite direction of what it should be, and a normal that's slightly off-kilter, so:
    • -
    • reflect the vectors of our "mirrored frame" a second time, but this time using the plane through the "next point" itself as "mirror".
    • -
    • Done: the tangent and normal have been fixed, and we have a good looking frame to work with.
    • -
    -

    So, let's write some code for that!

    -
    - -

    Implementing Rotation Minimising Frames

    -

    We first assume we have a function for calculating the Frenet frame at a point, which we already discussed above, inn a way that it yields a frame with properties:

    - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    - -

    Then, we can write a function that generates a sequence of RM frames in the following manner:

    +} + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    20
    21
    22
    23
    24
    25
    26
    27
    28
    29
    30
    31
    32
    33
    34
    35
    36
    - -

    Ignoring comments, this is certainly more code than when we were just computing a single Frenet frame, but it's not a crazy amount more code to get much better looking normals.

    -
    + frames.add(x1) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + 18 + + + 19 + + + 20 + + + 21 + + + 22 + + + 23 + + + 24 + + + 25 + + + 26 + + + 27 + + + 28 + + + 29 + + + 30 + + + 31 + + + 32 + + + 33 + + + 34 + + + 35 + + + 36 + + -

    Speaking of better looking, what does this actually look like? Let's revisit that earlier curve, but this time use rotation minimising frames rather than Frenet frames:

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - -

    That looks so much better!

    -

    For those reading along with the code: we don't even strictly speaking need a Frenet frame to start with: we could, for instance, treat the z-axis as our initial axis of rotation, so that our initial normal is (0,0,1) × tangent, and then take things from there, but having that initial "mathematically correct" frame so that the initial normal seems to line up based on the curve's orientation in 3D space is just nice.

    +

    + Ignoring comments, this is certainly more code than when we were just computing a single Frenet frame, but it's not a crazy amount more + code to get much better looking normals. +

    + -
    -
    -

    Component functions

    -

    One of the first things people run into when they start using Bézier curves in their own programs is "I know how to draw the curve, but how do I determine the bounding box?". It's actually reasonably straightforward to do so, but it requires having some knowledge on exploiting math to get the values we need. For bounding boxes, we aren't actually interested in the curve itself, but only in its "extremities": the minimum and maximum values the curve has for its x- and y-axis values. If you remember your calculus (provided you ever took calculus, otherwise it's going to be hard to remember) we can determine function extremities using the first derivative of that function, but this poses a problem, since our function is parametric: every axis has its own function.

    -

    The solution: compute the derivative for each axis separately, and then fit them back together in the same way we do for the original.

    -

    Let's look at how a parametric Bézier curve "splits up" into two normal functions, one for the x-axis and one for the y-axis. Note the leftmost figure is again an interactive curve, without labeled axes (you get coordinates in the graph instead). The center and rightmost figures are the component functions for computing the x-axis value, given a value for t (between 0 and 1 inclusive), and the y-axis value, respectively.

    -

    If you move points in a curve sideways, you should only see the middle graph change; likewise, moving points vertically should only show a change in the right graph.

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - -

     

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - -
    -
    -

    Finding extremities: root finding

    -

    Now that we understand (well, superficially anyway) the component functions, we can find the extremities of our Bézier curve by finding maxima and minima on the component functions, by solving the equation B'(t) = 0. We've already seen that the derivative of a Bézier curve is a simpler Bézier curve, but how do we solve the equality? Fairly easily, actually, until our derivatives are 4th order or higher... then things get really hard. But let's start simple:

    -

    Quadratic curves: linear derivatives.

    -

    The derivative of a quadratic Bézier curve is a linear Bézier curve, interpolating between just two terms, which means finding the solution for "where is this line 0" is effectively trivial by rewriting it to a function of t and solving. First we turn our quadratic Bézier function into a linear one, by following the rule mentioned at the end of the derivatives section:

    - - -

    And then we turn this into our solution for t using basic arithmetics:

    - - -

    Done.

    -

    Although with the caveat that if b-a is zero, there is no solution and we probably shouldn't try to perform that division.

    -

    Cubic curves: the quadratic formula.

    -

    The derivative of a cubic Bézier curve is a quadratic Bézier curve, and finding the roots for a quadratic polynomial means we can apply the Quadratic formula. If you've seen it before, you'll remember it, and if you haven't, it looks like this:

    - - -

    So, if we can rewrite the Bézier component function as a plain polynomial, we're done: we just plug in the values into the quadratic formula, check if that square root is negative or not (if it is, there are no roots) and then just compute the two values that come out (because of that plus/minus sign we get two). Any value between 0 and 1 is a root that matters for Bézier curves, anything below or above that is irrelevant (because Bézier curves are only defined over the interval [0,1]). So, how do we convert?

    -

    First we turn our cubic Bézier function into a quadratic one, by following the rule mentioned at the end of the derivatives section:

    - - -

    And then, using these v values, we can find out what our a, b, and c should be:

    - - -

    This gives us three coefficients {a, b, c} that are expressed in terms of v values, where the v values are expressions of our original coordinate values, so we can do some substitution to get:

    - - -

    Easy-peasy. We can now almost trivially find the roots by plugging those values into the quadratic formula.

    -

    And as a cubic curve, there is also a meaningful second derivative, which we can compute by simple taking the derivative of the derivative.

    -

    Quartic curves: Cardano's algorithm.

    -

    We haven't really looked at them before now, but the next step up would be a Quartic curve, a fourth degree Bézier curve. As expected, these have a derivative that is a cubic function, and now things get much harder. Cubic functions don't have a "simple" rule to find their roots, like the quadratic formula, and instead require quite a bit of rewriting to a form that we can even start to try to solve.

    -

    Back in the 16th century, before Bézier curves were a thing, and even before calculus itself was a thing, Gerolamo Cardano figured out that even if the general cubic function is really hard to solve, it can be rewritten to a form for which finding the roots is "easier" (even if not "easy"):

    - - -

    We can see that the easier formula only has two constants, rather than four, and only two expressions involving t, rather than three: this makes things considerably easier to solve because it lets us use regular calculus to find the values that satisfy the equation.

    -

    Now, there is one small hitch: as a cubic function, the solutions may be complex numbers rather than plain numbers... And Cardano realised this, centuries before complex numbers were a well-understood and established part of number theory. His interpretation of them was "these numbers are impossible but that's okay because they disappear again in later steps", allowing him to not think about them too much, but we have it even easier: as we're trying to find the roots for display purposes, we don't even care about complex numbers: we're going to simplify Cardano's approach just that tiny bit further by throwing away any solution that's not a plain number.

    -

    So, how do we rewrite the hard formula into the easier formula? This is explained in detail over at Ken J. Ward's page for solving the cubic equation, so instead of showing the maths, I'm simply going to show the programming code for solving the cubic equation, with the complex roots getting totally ignored, but if you're interested you should definitely head over to Ken's page and give the procedure a read-through.

    -
    + +

    + We can see that the easier formula only has two constants, rather than four, and only two expressions involving t, rather + than three: this makes things considerably easier to solve because it lets us use + regular calculus to find the values that satisfy the equation. +

    +

    + Now, there is one small hitch: as a cubic function, the solutions may be + complex numbers rather than plain numbers... And Cardano realised this, + centuries before complex numbers were a well-understood and established part of number theory. His interpretation of them was "these + numbers are impossible but that's okay because they disappear again in later steps", allowing him to not think about them too much, but we + have it even easier: as we're trying to find the roots for display purposes, we don't even care about complex numbers: we're + going to simplify Cardano's approach just that tiny bit further by throwing away any solution that's not a plain number. +

    +

    + So, how do we rewrite the hard formula into the easier formula? This is explained in detail over at + Ken J. Ward's page for solving the + cubic equation, so instead of showing the maths, I'm simply going to show the programming code for solving the cubic equation, with the + complex roots getting totally ignored, but if you're interested you should definitely head over to Ken's page and give the procedure a + read-through. +

    +
    +

    Implementing Cardano's algorithm for finding all real roots

    +

    + The "real roots" part is fairly important, because while you cannot take a square, cube, etc. root of a negative number in the "real" + number space (denoted with ℝ), this is perfectly fine in the + "complex" number space (denoted with ℂ). And, as it so happens, Cardano is + also attributed as the first mathematician in history to have made use of complex numbers in his calculations. For this very algorithm! +

    -

    Implementing Cardano's algorithm for finding all real roots

    -

    The "real roots" part is fairly important, because while you cannot take a square, cube, etc. root of a negative number in the "real" number space (denoted with ℝ), this is perfectly fine in the "complex" number space (denoted with ℂ). And, as it so happens, Cardano is also attributed as the first mathematician in history to have made use of complex numbers in his calculations. For this very algorithm!

    - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    20
    21
    22
    23
    24
    25
    26
    27
    28
    29
    30
    31
    32
    33
    34
    35
    36
    37
    38
    39
    40
    41
    42
    43
    44
    45
    46
    47
    48
    49
    50
    51
    52
    53
    54
    55
    56
    57
    58
    59
    60
    61
    62
    63
    64
    65
    66
    67
    68
    69
    70
    71
    72
    73
    74
    75
    76
    77
    78
    79
    80
    81
    - -
    +} + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + 18 + + + 19 + + + 20 + + + 21 + + + 22 + + + 23 + + + 24 + + + 25 + + + 26 + + + 27 + + + 28 + + + 29 + + + 30 + + + 31 + + + 32 + + + 33 + + + 34 + + + 35 + + + 36 + + + 37 + + + 38 + + + 39 + + + 40 + + + 41 + + + 42 + + + 43 + + + 44 + + + 45 + + + 46 + + + 47 + + + 48 + + + 49 + + + 50 + + + 51 + + + 52 + + + 53 + + + 54 + + + 55 + + + 56 + + + 57 + + + 58 + + + 59 + + + 60 + + + 61 + + + 62 + + + 63 + + + 64 + + + 65 + + + 66 + + + 67 + + + 68 + + + 69 + + + 70 + + + 71 + + + 72 + + + 73 + + + 74 + + + 75 + + + 76 + + + 77 + + + 78 + + + 79 + + + 80 + + + 81 + + +
    -

    And that's it. The maths is complicated, but the code is pretty much just "follow the maths, while caching as many values as we can to prevent recomputing things as much as possible" and now we have a way to find all roots for a cubic function and can just move on with using that to find extremities of our curves.

    -

    And of course, as a quartic curve also has meaningful second and third derivatives, we can quite easily compute those by using the derivative of the derivative (of the derivative), just as for cubic curves.

    -

    Quintic and higher order curves: finding numerical solutions

    -

    And this is where thing stop, because we cannot find the roots for polynomials of degree 5 or higher using algebra (a fact known as the Abel–Ruffini theorem). Instead, for occasions like these, where algebra simply cannot yield an answer, we turn to numerical analysis.

    -

    That's a fancy term for saying "rather than trying to find exact answers by manipulating symbols, find approximate answers by describing the underlying process as a combination of steps, each of which can be assigned a number via symbolic manipulation". For example, trying to mathematically compute how much water fits in a completely crazy three dimensional shape is very hard, even if it got you the perfect, precise answer. A much easier approach, which would be less perfect but still entirely useful, would be to just grab a buck and start filling the shape until it was full: just count the number of buckets of water you used. And if we want a more precise answer, we can use smaller buckets.

    -

    So that's what we're going to do here, too: we're going to treat the problem as a sequence of steps, and the smaller we can make each step, the closer we'll get to that "perfect, precise" answer. And as it turns out, there is a really nice numerical root-finding algorithm, called the Newton-Raphson root finding method (yes, after that Newton), which we can make use of. The Newton-Raphson approach consists of taking our impossible-to-solve function f(x), picking some initial value x (literally any value will do), and calculating f(x). We can think of that value as the "height" of the function at x. If that height is zero, we're done, we have found a root. If it isn't, we calculate the tangent line at f(x) and calculate at which x value its height is zero (which we've already seen is very easy). That will give us a new x and we repeat the process until we find a root.

    -

    Mathematically, this means that for some x, at step n=1, we perform the following calculation until fy(x) is zero, so that the next t is the same as the one we already have:

    - - -

    (The Wikipedia article has a decent animation for this process, so I will not add a graphic for that here)

    -

    Now, this works well only if we can pick good starting points, and our curve is continuously differentiable and doesn't have oscillations. Glossing over the exact meaning of those terms, the curves we're dealing with conform to those constraints, so as long as we pick good starting points, this will work. So the question is: which starting points do we pick?

    -

    As it turns out, Newton-Raphson is so blindingly fast that we could get away with just not picking: we simply run the algorithm from t=0 to t=1 at small steps (say, 1/200th) and the result will be all the roots we want. Of course, this may pose problems for high order Bézier curves: 200 steps for a 200th order Bézier curve is going to go wrong, but that's okay: there is no reason (at least, none that I know of) to ever use Bézier curves of crazy high orders. You might use a fifth order curve to get the "nicest still remotely workable" approximation of a full circle with a single Bézier curve, but that's pretty much as high as you'll ever need to go.

    -

    In conclusion:

    -

    So now that we know how to do root finding, we can determine the first and second derivative roots for our Bézier curves, and show those roots overlaid on the previous graphics. For the quadratic curve, that means just the first derivative, in red:

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - -

    And for cubic curves, that means first and second derivatives, in red and purple respectively:

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - + +

    (The Wikipedia article has a decent animation for this process, so I will not add a graphic for that here)

    +

    + Now, this works well only if we can pick good starting points, and our curve is + continuously differentiable and doesn't have + oscillations. Glossing over the exact meaning of those terms, the + curves we're dealing with conform to those constraints, so as long as we pick good starting points, this will work. So the question is: + which starting points do we pick? +

    +

    + As it turns out, Newton-Raphson is so blindingly fast that we could get away with just not picking: we simply run the algorithm from + t=0 to t=1 at small steps (say, 1/200th) and the result will be all the roots we want. Of course, this may + pose problems for high order Bézier curves: 200 steps for a 200th order Bézier curve is going to go wrong, but that's okay: + there is no reason (at least, none that I know of) to ever use Bézier curves of crazy high orders. You might use a fifth order + curve to get the "nicest still remotely workable" approximation of a full circle with a single Bézier curve, but that's pretty much as + high as you'll ever need to go. +

    +

    In conclusion:

    +

    + So now that we know how to do root finding, we can determine the first and second derivative roots for our Bézier curves, and show those + roots overlaid on the previous graphics. For the quadratic curve, that means just the first derivative, in red: +

    + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + +

    And for cubic curves, that means first and second derivatives, in red and purple respectively:

    + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + +
    +
    +

    + + Bounding boxes +

    +

    + If we have the extremities, and the start/end points, a simple for-loop that tests for min/max values for x and y means we have the four + values we need to box in our curve: +

    +

    Computing the bounding box for a Bézier curve:

    +
      +
    1. Find all t value(s) for the curve derivative's x- and y-roots.
    2. +
    3. Discard any t value that's lower than 0 or higher than 1, because Bézier curves only use the interval [0,1].
    4. +
    5. + Determine the lowest and highest value when plugging the values t=0, t=1 and each of the found roots into the original + functions: the lowest value is the lower bound, and the highest value is the upper bound for the bounding box we want to construct. +
    6. +
    +

    + Applying this approach to our previous root finding, we get the following + axis-aligned bounding boxes (with all curve extremity points + shown on the curve): +

    +
    + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + +
    -
    -
    -

    Bounding boxes

    -

    If we have the extremities, and the start/end points, a simple for-loop that tests for min/max values for x and y means we have the four values we need to box in our curve:

    -

    Computing the bounding box for a Bézier curve:

    -
      -
    1. Find all t value(s) for the curve derivative's x- and y-roots.
    2. -
    3. Discard any t value that's lower than 0 or higher than 1, because Bézier curves only use the interval [0,1].
    4. -
    5. Determine the lowest and highest value when plugging the values t=0, t=1 and each of the found roots into the original functions: the lowest value is the lower bound, and the highest value is the upper bound for the bounding box we want to construct.
    6. -
    -

    Applying this approach to our previous root finding, we get the following axis-aligned bounding boxes (with all curve extremity points shown on the curve):

    -
    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - -
    - -

    We can construct even nicer boxes by aligning them along our curve, rather than along the x- and y-axis, but in order to do so we first need to look at how aligning works.

    - -
    -
    -

    Aligning curves

    -

    While there are an incredible number of curves we can define by varying the x- and y-coordinates for the control points, not all curves are actually distinct. For instance, if we define a curve, and then rotate it 90 degrees, it's still the same curve, and we'll find its extremities in the same spots, just at different draw coordinates. As such, one way to make sure we're working with a "unique" curve is to "axis-align" it.

    -

    Aligning also simplifies a curve's functions. We can translate (move) the curve so that the first point lies on (0,0), which turns our n term polynomial functions into n-1 term functions. The order stays the same, but we have less terms. Then, we can rotate the curves so that the last point always lies on the x-axis, too, making its coordinate (...,0). This further simplifies the function for the y-component to an n-2 term function. For instance, if we have a cubic curve such as this:

    - - -

    Then translating it so that the first coordinate lies on (0,0), moving all x coordinates by -120, and all y coordinates by -160, gives us:

    - - -

    If we then rotate the curve so that its end point lies on the x-axis, the coordinates (integer-rounded for illustrative purposes here) become:

    - - -

    If we drop all the zero-terms, this gives us:

    - - -

    We can see that our original curve definition has been simplified considerably. The following graphics illustrate the result of aligning our example curves to the x-axis, with the cubic case using the coordinates that were just used in the example formulae:

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - -

     

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - + +

    + We can see that our original curve definition has been simplified considerably. The following graphics illustrate the result of aligning + our example curves to the x-axis, with the cubic case using the coordinates that were just used in the example formulae: +

    + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + +

     

    + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + +
    +
    +

    + + Tight bounding boxes +

    +

    + With our knowledge of bounding boxes, and curve alignment, We can now form the "tight" bounding box for curves. We first align our curve, + recording the translation we performed, "T", and the rotation angle we used, "R". We then determine the aligned curve's normal bounding + box. Once we have that, we can map that bounding box back to our original curve by rotating it by -R, and then translating it by -T. +

    +

    We now have nice tight bounding boxes for our curves:

    +
    + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + +
    -
    -
    -

    Tight bounding boxes

    -

    With our knowledge of bounding boxes, and curve alignment, We can now form the "tight" bounding box for curves. We first align our curve, recording the translation we performed, "T", and the rotation angle we used, "R". We then determine the aligned curve's normal bounding box. Once we have that, we can map that bounding box back to our original curve by rotating it by -R, and then translating it by -T.

    -

    We now have nice tight bounding boxes for our curves:

    -
    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - -
    - -

    These are, strictly speaking, not necessarily the tightest possible bounding boxes. It is possible to compute the optimal bounding box by determining which spanning lines we need to effect a minimal box area, but because of the parametric nature of Bézier curves this is actually a rather costly operation, and the gain in bounding precision is often not worth it.

    - -
    -
    -

    Curve inflections

    -

    Now that we know how to align a curve, there's one more thing we can calculate: inflection points. Imagine we have a variable size circle that we can slide up against our curve. We place it against the curve and adjust its radius so that where it touches the curve, the curvatures of the curve and the circle are the same, and then we start to slide the circle along the curve - for quadratic curves, we can always do this without the circle behaving oddly: we might have to change the radius of the circle as we slide it along, but it'll always sit against the same side of the curve.

    -

    But what happens with cubic curves? Imagine we have an S curve and we place our circle at the start of the curve, and start sliding it along. For a while we can simply adjust the radius and things will be fine, but once we get to the midpoint of that S, something odd happens: the circle "flips" from one side of the curve to the other side, in order for the curvatures to keep matching. This is called an inflection, and we can find out where those happen relatively easily.

    -

    What we need to do is solve a simple equation:

    - - -

    What we're saying here is that given the curvature function C(t), we want to know for which values of t this function is zero, meaning there is no "curvature", which will be exactly at the point between our circle being on one side of the curve, and our circle being on the other side of the curve. So what does C(t) look like? Actually something that seems not too hard:

    - - -

    The function C(t) is the cross product between the first and second derivative functions for the parametric dimensions of our curve. And, as already shown, derivatives of Bézier curves are just simpler Bézier curves, with very easy to compute new coefficients, so this should be pretty easy.

    -

    However as we've seen in the section on aligning, aligning lets us simplify things a lot, by completely removing the contributions of the first coordinate from most mathematical evaluations, and removing the last y coordinate as well by virtue of the last point lying on the x-axis. So, while we can evaluate C(t) = 0 for our curve, it'll be much easier to first axis-align the curve and then evaluating the curvature function.

    -
    - -

    Let's derive the full formula anyway

    -

    Of course, before we do our aligned check, let's see what happens if we compute the curvature function without axis-aligning. We start with the first and second derivatives, given our basis functions:

    - - -

    And of course the same functions for y:

    - - -

    Asking a computer to now compose the C(t) function for us (and to expand it to a readable form of simple terms) gives us this rather overly complicated set of arithmetic expressions:

    - - -

    That is... unwieldy. So, we note that there are a lot of terms that involve multiplications involving x1, y1, and y4, which would all disappear if we axis-align our curve, which is why aligning is a great idea.

    -
    + +

    + That is... unwieldy. So, we note that there are a lot of terms that involve multiplications involving x1, y1, and y4, which would all + disappear if we axis-align our curve, which is why aligning is a great idea. +

    + -

    Aligning our curve so that three of the eight coefficients become zero, and observing that scale does not affect finding t values, we end up with the following simple term function for C(t):

    - - -

    That's a lot easier to work with: we see a fair number of terms that we can compute and then cache, giving us the following simplification:

    - - -

    This is a plain quadratic curve, and we know how to solve C(t) = 0; we use the quadratic formula:

    - - -

    We can easily compute this value if the discriminator isn't a negative number (because we only want real roots, not complex roots), and if x is not zero, because divisions by zero are rather useless.

    -

    Taking that into account, we compute t, we disregard any t value that isn't in the Bézier interval [0,1], and we now know at which t value(s) our curve will inflect.

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - -
    -
    -

    The canonical form (for cubic curves)

    -

    While quadratic curves are relatively simple curves to analyze, the same cannot be said of the cubic curve. As a curvature is controlled by more than one control point, it exhibits all kinds of features like loops, cusps, odd colinear features, and as many as two inflection points because the curvature can change direction up to three times. Now, knowing what kind of curve we're dealing with means that some algorithms can be run more efficiently than if we have to implement them as generic solvers, so is there a way to determine the curve type without lots of work?

    -

    As it so happens, the answer is yes, and the solution we're going to look at was presented by Maureen C. Stone from Xerox PARC and Tony D. deRose from the University of Washington in their joint paper "A Geometric Characterization of Parametric Cubic curves". It was published in 1989, and defines curves as having a "canonical" form (i.e. a form that all curves can be reduced to) from which we can immediately tell what features a curve will have. So how does it work?

    -

    The first observation that makes things work is that if we have a cubic curve with four points, we can apply a linear transformation to these points such that three of the points end up on (0,0), (0,1) and (1,1), with the last point then being "somewhere". After applying that transformation, the location of that last point can then tell us what kind of curve we're dealing with. Specifically, we see the following breakdown:

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - -

    This is a fairly funky image, so let's see what the various parts of it mean...

    -

    We see the three fixed points at (0,0), (0,1) and (1,1). The various regions and boundaries indicate what property the original curve will have, if the fourth point is in/on that region or boundary. Specifically, if the fourth point is...

    -
      -
    1. ...anywhere inside the red zone, but not on its boundaries, the curve will be self-intersecting (yielding a loop). We won't know where it self-intersects (in terms of t values), but we are guaranteed that it does.

      -
    2. -
    3. ...on the left (red) edge of the red zone, the curve will have a cusp. We again don't know where, but we know there is one. This edge is described by the function:

      - - -
    4. -
    5. ...on the almost circular, lower right (pink) edge, the curve's end point touches the curve, forming a loop. This edge is described by the function:

      - - -
    6. -
    7. ...on the top (blue) edge, the curve's start point touches the curve, forming a loop. This edge is described by the function:

      - - -
    8. -
    9. ...inside the lower (green) zone, past y=1, the curve will have a single inflection (switching concave/convex once).

      -
    10. -
    11. ...between the left and lower boundaries (below the cusp line but above the single-inflection line), the curve will have two inflections (switching from concave to convex and then back again, or from convex to concave and then back again).

      -
    12. -
    13. ...anywhere on the right of self-intersection zone, the curve will have no inflections. It'll just be a simple arch.

      -
    14. -
    -

    Of course, this map is fairly small, but the regions extend to infinity, with well defined boundaries.

    -
    + + +
  • +

    ...inside the lower (green) zone, past y=1, the curve will have a single inflection (switching concave/convex once).

    +
  • +
  • +

    + ...between the left and lower boundaries (below the cusp line but above the single-inflection line), the curve will have two + inflections (switching from concave to convex and then back again, or from convex to concave and then back again). +

    +
  • +
  • ...anywhere on the right of self-intersection zone, the curve will have no inflections. It'll just be a simple arch.

  • + +

    Of course, this map is fairly small, but the regions extend to infinity, with well defined boundaries.

    +
    +

    Wait, where do those lines come from?

    +

    + Without repeating the paper mentioned at the top of this section, the loop-boundaries come from rewriting the curve into canonical form, + and then solving the formulae for which constraints must hold for which possible curve properties. In the paper these functions yield + formulae for where you will find cusp points, or loops where we know t=0 or t=1, but those functions are derived for the full cubic + expression, meaning they apply to t=-∞ to t=∞... For Bézier curves we only care about the "clipped interval" t=0 to t=1, so some of the + properties that apply when you look at the curve over an infinite interval simply don't apply to the Bézier curve interval. +

    +

    + The right bound for the loop region, indicating where the curve switches from "having inflections" to "having a loop", for the general + cubic curve, is actually mirrored over x=1, but for Bézier curves this right half doesn't apply, so we don't need to pay attention to + it. Similarly, the boundaries for t=0 and t=1 loops are also nice clean curves but get "cut off" when we only look at what the general + curve does over the interval t=0 to t=1. +

    +

    + For the full details, head over to the paper and read through sections 3 and 4. If you still remember your high school pre-calculus, you + can probably follow along with this paper, although you might have to read it a few times before all the bits "click". +

    +
    -

    Wait, where do those lines come from?

    -

    Without repeating the paper mentioned at the top of this section, the loop-boundaries come from rewriting the curve into canonical form, and then solving the formulae for which constraints must hold for which possible curve properties. In the paper these functions yield formulae for where you will find cusp points, or loops where we know t=0 or t=1, but those functions are derived for the full cubic expression, meaning they apply to t=-∞ to t=∞... For Bézier curves we only care about the "clipped interval" t=0 to t=1, so some of the properties that apply when you look at the curve over an infinite interval simply don't apply to the Bézier curve interval.

    -

    The right bound for the loop region, indicating where the curve switches from "having inflections" to "having a loop", for the general cubic curve, is actually mirrored over x=1, but for Bézier curves this right half doesn't apply, so we don't need to pay attention to it. Similarly, the boundaries for t=0 and t=1 loops are also nice clean curves but get "cut off" when we only look at what the general curve does over the interval t=0 to t=1.

    -

    For the full details, head over to the paper and read through sections 3 and 4. If you still remember your high school pre-calculus, you can probably follow along with this paper, although you might have to read it a few times before all the bits "click".

    -
    - -

    So now the question becomes: how do we manipulate our curve so that it fits this canonical form, with three fixed points, and one "free" point? Enter linear algebra. Don't worry, I'll be doing all the math for you, as well as show you what the effect is on our curves, but basically we're going to be using linear algebra, rather than calculus, because "it's way easier". Sometimes a calculus approach is very hard to work with, when the equivalent geometrical solution is super obvious.

    -

    The approach is going to start with a curve that doesn't have all-colinear points (so we need to make sure the points don't all fall on a straight line), and then applying three graphics operations that you will probably have heard of: translation (moving all points by some fixed x- and y-distance), scaling (multiplying all points by some x and y scale factor), and shearing (an operation that turns rectangles into parallelograms).

    -

    Step 1: we translate any curve by -p1.x and -p1.y, so that the curve starts at (0,0). We're going to make use of an interesting trick here, by pretending our 2D coordinates are 3D, with the z coordinate simply always being 1. This is an old trick in graphics to overcome the limitations of 2D transformations: without it, we can only turn (x,y) coordinates into new coordinates of the form (ax + by, cx + dy), which means we can't do translation, since that requires we end up with some kind of (x + a, y + b). If we add a bogus z coordinate that is always 1, then we can suddenly add arbitrary values. For example:

    - - -

    Sweet! z stays 1, so we can effectively ignore it entirely, but we added some plain values to our x and y coordinates. So, if we want to subtract p1.x and p1.y, we use:

    - - -

    Running all our coordinates through this transformation gives a new set of coordinates, let's call those U, where the first coordinate lies on (0,0), and the rest is still somewhat free. Our next job is to make sure point 2 ends up lying on the x=0 line, so what we want is a transformation matrix that, when we run it, subtracts x from whatever x we currently have. This is called shearing, and the typical x-shear matrix and its transformation looks like this:

    - - -

    So we want some shearing value that, when multiplied by y, yields -x, so our x coordinate becomes zero. That value is simply -x/y, because *-x/y * y = -x*. Done:

    - - -

    Now, running this on all our points generates a new set of coordinates, let's call those V, which now have point 1 on (0,0) and point 2 on (0, some-value), and we wanted it at (0,1), so we need to do some scaling to make sure it ends up at (0,1). Additionally, we want point 3 to end up on (1,1), so we can also scale x to make sure its x-coordinate will be 1 after we run the transform. That means we'll be x-scaling by 1/point3x, and y-scaling by point2y. This is really easy:

    - - -

    Then, finally, this generates a new set of coordinates, let's call those W, of which point 1 lies on (0,0), point 2 lies on (0,1), and point three lies on (1, ...) so all that's left is to make sure point 3 ends up at (1,1) - but we can't scale! Point 2 is already in the right place, and y-scaling would move it out of (0,1) again, so our only option is to y-shear point three, just like how we x-sheared point 2 earlier. In this case, we do the same trick, but with y/x rather than x/y because we're not x-shearing but y-shearing. Additionally, we don't actually want to end up at zero (which is what we did before) so we need to shear towards an offset, in this case 1:

    - - -

    And this generates our final set of four coordinates. Of these, we already know that points 1 through 3 are (0,0), (0,1) and (1,1), and only the last coordinate is "free". In fact, given any four starting coordinates, the resulting "transformation mapped" coordinate will be:

    - - -

    Okay, well, that looks plain ridiculous, but: notice that every coordinate value is being offset by the initial translation, and also notice that a lot of terms in that expression are repeated. Even though the maths looks crazy as a single expression, we can just pull this apart a little and end up with an easy-to-calculate bit of code!

    -

    First, let's just do that translation step as a "preprocessing" operation so we don't have to subtract the values all the time. What does that leave?

    - - -

    Suddenly things look a lot simpler: the mapped x is fairly straight forward to compute, and we see that the mapped y actually contains the mapped x in its entirety, so we'll have that part already available when we need to evaluate it. In fact, let's pull out all those common factors to see just how simple this is:

    - - -

    That's kind of super-simple to write out in code, I think you'll agree. Coding math tends to be easier than the formulae initially make it look!

    -
    + +

    + That's kind of super-simple to write out in code, I think you'll agree. Coding math tends to be easier than the formulae initially make it + look! +

    +
    +

    How do you track all that?

    +

    + Doing maths can be a pain, so whenever possible, I like to make computers do the work for me. Especially for things like this, I simply + use Mathematica. Tracking all this math by hand is insane, and we invented computers, + literally, to do this for us. I have no reason to use pen and paper when I can write out what I want to do in a program, and have the + program do the math for me. And real math, too, with symbols, not with numbers. In fact, + here's the Mathematica notebook if you want to see how + this works for yourself. +

    +

    + Now, I know, you're thinking "but Mathematica is super expensive!" and that's true, it's + $344 for home use, up from $295 when I original wrote this, but it's + also free when you buy a $35 raspberry pi. Obviously, I bought a + raspberry pi, and I encourage you to do the same. With that, as long as you know what you want to do, Mathematica can just do + it for you. And we don't have to be geniuses to work out what the maths looks like. That's what we have computers for. +

    +
    -

    How do you track all that?

    -

    Doing maths can be a pain, so whenever possible, I like to make computers do the work for me. Especially for things like this, I simply use Mathematica. Tracking all this math by hand is insane, and we invented computers, literally, to do this for us. I have no reason to use pen and paper when I can write out what I want to do in a program, and have the program do the math for me. And real math, too, with symbols, not with numbers. In fact, here's the Mathematica notebook if you want to see how this works for yourself.

    -

    Now, I know, you're thinking "but Mathematica is super expensive!" and that's true, it's $344 for home use, up from $295 when I original wrote this, but it's also free when you buy a $35 raspberry pi. Obviously, I bought a raspberry pi, and I encourage you to do the same. With that, as long as you know what you want to do, Mathematica can just do it for you. And we don't have to be geniuses to work out what the maths looks like. That's what we have computers for.

    -
    - -

    So, let's write up a sketch that'll show us the canonical form for any curve drawn in blue, overlaid on our canonical map, so that we can immediately tell which features our curve must have, based on where the fourth coordinate is located on the map:

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - -
    -
    -

    Finding Y, given X

    -

    One common task that pops up in things like CSS work, or parametric equalizers, or image leveling, or any other number of applications where Bézier curves are used as control curves in a way that there is really only ever one "y" value associated with one "x" value, you might want to cut out the middle man, as it were, and compute "y" directly based on "x". After all, the function looks simple enough, finding the "y" value should be simple too, right? Unfortunately, not really. However, it is possible and as long as you have some code in place to help, it's not a lot of a work either.

    -

    We'll be tackling this problem in two stages: the first, which is the hard part, is figuring out which "t" value belongs to any given "x" value. For instance, have a look at the following graphic. On the left we have a Bézier curve that looks for all intents and purposes like it fits our criteria: every "x" has one and only one associated "y" value. On the right we see the function for just the "x" values: that's a cubic curve, but not a really crazy cubic curve. If you move the graphic's slider, you will see a red line drawn that corresponds to the x coordinate: this is a vertical line in the left graphic, and a horizontal line on the right.

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - -

    Now, if you look more closely at that right graphic, you'll notice something interesting: if we treat the red line as "the x axis", then the point where the function crosses our line is really just a root for the cubic function x(t) through a shifted "x-axis"... and we've already seen how to calculate roots, so let's just run cubic root finding - and not even the complicated cubic case either: because of the kind of curve we're starting with, we know there is only root, simplifying the code we need!

    -

    First, let's look at the function for x(t):

    - - -

    We can rewrite this to a plain polynomial form, by just fully writing out the expansion and then collecting the polynomial factors, as:

    - - -

    Nothing special here: that's a standard cubic polynomial in "power" form (i.e. all the terms are ordered by their power of t). So, given that a, b, c, d, and x(t) are all known constants, we can trivially rewrite this (by moving the x(t) across the equal sign) as:

    - - -

    You might be wondering "where did all the other 'minus x' for all the other values a, b, c, and d go?" and the answer there is that they all cancel out, so the only one we actually need to subtract is the one at the end. Handy! So now we just solve this equation using Cardano's algorithm, and we're left with some rather short code:

    + +

    + You might be wondering "where did all the other 'minus x' for all the other values a, b, c, and d go?" and the answer there is that they + all cancel out, so the only one we actually need to subtract is the one at the end. Handy! So now we just solve this equation using + Cardano's algorithm, and we're left with some rather short code: +

    - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    - -

    So the procedure is fairly straight forward: pick an x, find the associated t value, evaluate our curve for that t value, which gives us the curve's {x,y} coordinate, which means we know y for this x. Move the slider for the following graphic to see this in action:

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - +y = curve.get(t).y + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + -
    -
    -

    Arc length

    -

    How long is a Bézier curve? As it turns out, that's not actually an easy question, because the answer requires maths that —much like root finding— cannot generally be solved the traditional way. If we have a parametric curve with fx(t) and fy(t), then the length of the curve, measured from start point to some point t = z, is computed using the following seemingly straight forward (if a bit overwhelming) formula:

    - - -

    or, more commonly written using Leibnitz notation as:

    - - -

    This formula says that the length of a parametric curve is in fact equal to the area underneath a function that looks a remarkable amount like Pythagoras' rule for computing the diagonal of a straight angled triangle. This sounds pretty simple, right? Sadly, it's far from simple... cutting straight to after the chase is over: for quadratic curves, this formula generates an unwieldy computation, and we're simply not going to implement things that way. For cubic Bézier curves, things get even more fun, because there is no "closed form" solution, meaning that due to the way calculus works, there is no generic formula that allows you to calculate the arc length. Let me just repeat this, because it's fairly crucial: for cubic and higher Bézier curves, there is no way to solve this function if you want to use it "for all possible coordinates".

    -

    Seriously: It cannot be done.

    -

    So we turn to numerical approaches again. The method we'll look at here is the Gauss quadrature. This approximation is a really neat trick, because for any nth degree polynomial it finds approximated values for an integral really efficiently. Explaining this procedure in length is way beyond the scope of this page, so if you're interested in finding out why it works, I can recommend the University of South Florida video lecture on the procedure, linked in this very paragraph. The general solution we're looking for is the following:

    - - -

    In plain text: an integral function can always be treated as the sum of an (infinite) number of (infinitely thin) rectangular strips sitting "under" the function's plotted graph. To illustrate this idea, the following graph shows the integral for a sinusoid function. The more strips we use (and of course the more we use, the thinner they get) the closer we get to the true area under the curve, and thus the better the approximation:

    -
    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - -
    + +

    + In plain text: an integral function can always be treated as the sum of an (infinite) number of (infinitely thin) rectangular strips + sitting "under" the function's plotted graph. To illustrate this idea, the following graph shows the integral for a sinusoid function. The + more strips we use (and of course the more we use, the thinner they get) the closer we get to the true area under the curve, and thus the + better the approximation: +

    +
    + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + +
    -

    Now, infinitely many terms to sum and infinitely thin rectangles are not something that computers can work with, so instead we're going to approximate the infinite summation by using a sum of a finite number of "just thin" rectangular strips. As long as we use a high enough number of thin enough rectangular strips, this will give us an approximation that is pretty close to what the real value is.

    -

    So, the trick is to come up with useful rectangular strips. A naive way is to simply create n strips, all with the same width, but there is a far better way using special values for C and f(t) depending on the value of n, which indicates how many strips we'll use, and it's called the Legendre-Gauss quadrature.

    -

    This approach uses strips that are not spaced evenly, but instead spaces them in a special way based on describing the function as a polynomial (the more strips, the more accurate the polynomial), and then computing the exact integral for that polynomial. We're essentially performing arc length computation on a flattened curve, but flattening it based on the intervals dictated by the Legendre-Gauss solution.

    -
    - -

    Note that one requirement for the approach we'll use is that the integral must run from -1 to 1. That's no good, because we're dealing with Bézier curves, and the length of a section of curve applies to values which run from 0 to "some value smaller than or equal to 1" (let's call that value z). Thankfully, we can quite easily transform any integral interval to any other integral interval, by shifting and scaling the inputs. Doing so, we get the following:

    - - -

    That may look a bit more complicated, but the fraction involving z is a fixed number, so the summation, and the evaluation of the f(t) values are still pretty simple.

    -

    So, what do we need to perform this calculation? For one, we'll need an explicit formula for f(t), because that derivative notation is handy on paper, but not when we have to implement it. We'll also need to know what these Ci and ti values should be. Luckily, that's less work because there are actually many tables available that give these values, for any n, so if we want to approximate our integral with only two terms (which is a bit low, really) then these tables would tell us that for n=2 we must use the following values:

    - - -

    Which means that in order for us to approximate the integral, we must plug these values into the approximate function, which gives us:

    - - -

    We can program that pretty easily, provided we have that f(t) available, which we do, as we know the full description for the Bézier curve functions Bx(t) and By(t).

    -
    + +

    + We can program that pretty easily, provided we have that f(t) available, which we do, as we know the full description for the + Bézier curve functions Bx(t) and By(t). +

    + -

    If we use the Legendre-Gauss values for our C values (thickness for each strip) and t values (location of each strip), we can determine the approximate length of a Bézier curve by computing the Legendre-Gauss sum. The following graphic shows a cubic curve, with its computed lengths; Go ahead and change the curve, to see how its length changes. One thing worth trying is to see if you can make a straight line, and see if the length matches what you'd expect. What if you form a line with the control points on the outside, and the start/end points on the inside?

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - +

    + If we use the Legendre-Gauss values for our C values (thickness for each strip) and t values (location of each strip), + we can determine the approximate length of a Bézier curve by computing the Legendre-Gauss sum. The following graphic shows a cubic curve, + with its computed lengths; Go ahead and change the curve, to see how its length changes. One thing worth trying is to see if you can make + a straight line, and see if the length matches what you'd expect. What if you form a line with the control points on the outside, and the + start/end points on the inside? +

    + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + +
    +
    +

    + + Approximated arc length +

    +

    + Sometimes, we don't actually need the precision of a true arc length, and we can get away with simply computing the approximate arc length + instead. The by far fastest way to do this is to flatten the curve and then simply calculate the linear distance from point to point. This + will come with an error, but this can be made arbitrarily small by increasing the segment count. +

    +

    + If we combine the work done in the previous sections on curve flattening and arc length computation, we can implement these with minimal + effort: +

    +
    + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + + + + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + + + +
    -
    -
    -

    Approximated arc length

    -

    Sometimes, we don't actually need the precision of a true arc length, and we can get away with simply computing the approximate arc length instead. The by far fastest way to do this is to flatten the curve and then simply calculate the linear distance from point to point. This will come with an error, but this can be made arbitrarily small by increasing the segment count.

    -

    If we combine the work done in the previous sections on curve flattening and arc length computation, we can implement these with minimal effort:

    -
    - - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - -
    - -

    You may notice that even though the error in length is actually pretty significant in absolute terms, even at a low number of segments we get a length that agrees with the true length when it comes to just the integer part of the arc length. Quite often, approximations can drastically speed things up!

    - -
    -
    -

    Curvature of a curve

    -

    If we have two curves, and we want to line them in up in a way that "looks right", what would we use as metric to let a computer decide what "looks right" means?

    -

    For instance, we can start by ensuring that the two curves share an end coordinate, so that there is no "gap" between the end of one and the start of the next curve, but that won't guarantee that things look right: both curves can be going in wildly different directions, and the resulting joined geometry will have a corner in it, rather than a smooth transition from one curve to the next.

    -

    What we want is to ensure that the curvature at the transition from one curve to the next "looks good". So, we start with a shared coordinate, and then also require that derivatives for both curves match at that coordinate. That way, we're assured that their tangents line up, which must mean the curve transition is perfectly smooth. We can even make the second, third, etc. derivatives match up for better and better transitions.

    -

    Problem solved!

    -

    However, there's a problem with this approach: if we think about this a little more, we realise that "what a curve looks like" and its derivative values are pretty much entirely unrelated. After all, the section on reordering curves showed us that the same looking curve can have an infinite number of curve expressions of arbitrarily high Bézier degree, and each of those will have wildly different derivative values.

    -

    So what we really want is some kind of expression that's not based on any particular expression of t, but is based on something that is invariant to the kind of function(s) we use to draw our curve. And the prime candidate for this is our curve expression, reparameterised for distance: no matter what order of Bézier curve we use, if we were able to rewrite it as a function of distance-along-the-curve, all those different degree Bézier functions would end up being the same function for "coordinate at some distance D along the curve".

    -

    We've seen this before... that's the arc length function.

    -

    So you might think that in order to find the curvature of a curve, we now need to solve the arc length function itself, and that this would be quite a problem because we just saw that there is no way to actually do that. Thankfully, we don't. We only need to know the form of the arc length function, which we saw above and is fairly simple, rather than needing to solve the arc length function. If we start with the arc length expression and the run through the steps necessary to determine its derivative (with an alternative, shorter demonstration of how to do this found over on Stackexchange), then the integral that was giving us so much problems in solving the arc length function disappears entirely (because of the fundamental theorem of calculus), and what we're left with us some surprisingly simple maths that relates curvature (denoted as κ, "kappa") to—and this is the truly surprising bit—a specific combination of derivatives of our original function.

    -

    Let me highlight what just happened, because it's pretty special:

    -
      -
    1. we wanted to make curves line up, and initially thought to match the curves' derivatives, but
    2. -
    3. that turned out to be a really bad choice, so instead
    4. -
    5. we picked a function that is basically impossible to work with, and then worked with that, which
    6. -
    7. gives us a simple formula that is and expression using the curves' derivatives.
    8. -
    -

    That's crazy!

    -

    But that's also one of the things that makes maths so powerful: even if your initial ideas are off the mark, you might be much closer than you thought you were, and the journey from "thinking we're completely wrong" to "actually being remarkably close to being right" is where we can find a lot of insight.

    -

    So, what does the function look like? This:

    - - -

    Which is really just a "short form" that glosses over the fact that we're dealing with functions of t, so let's expand that a tiny bit:

    - - -

    And while that's a little more verbose, it's still just as simple to work with as the first function: the curvature at some point on any (and this cannot be overstated: any) curve is a ratio between the first and second derivative cross product, and something that looks oddly similar to the standard Euclidean distance function. And nothing in these functions is hard to calculate either: for Bézier curves, simply knowing our curve coordinates means we know what the first and second derivatives are, and so evaluating this function for any t value is just a matter of basic arithematics.

    -

    In fact, let's just implement it right now:

    + +

    + And while that's a little more verbose, it's still just as simple to work with as the first function: the curvature at some point on any + (and this cannot be overstated: any) curve is a ratio between the first and second derivative cross product, and something that + looks oddly similar to the standard Euclidean distance function. And nothing in these functions is hard to calculate either: for Bézier + curves, simply knowing our curve coordinates means we know what the first and second derivatives are, and so + evaluating this function for any t value is just a matter of basic arithematics. +

    +

    In fact, let's just implement it right now:

    - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    - -

    That was easy! (Well okay, that "not a number" value will need to be taken into account by downstream code, but that's a reality of programming anyway)

    -

    With all of that covered, let's line up some curves! The following graphic gives you two curves that look identical, but use quadratic and cubic functions, respectively. As you can see, despite their derivatives being necessarily different, their curvature (thanks to being derived based on maths that "ignores" specific function derivative, and instead gives a formula that smooths out any differences) is exactly the same. And because of that, we can put them together such that the point where they overlap has the same curvature for both curves, giving us the smoothest transition.

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - -

    One thing you may have noticed in this sketch is that sometimes the curvature looks fine, but seems to be pointing in the wrong direction, making it hard to line up the curves properly. A way around that, of course, is to show the curvature on both sides of the curve, so let's just do that. But let's take it one step further: we can also compute the associated "radius of curvature", which gives us the implicit circle that "fits" the curve's curvature at any point, using what is possibly the simplest bit of maths found in this entire primer:

    - - -

    So let's revisit the previous graphic with the curvature visualised on both sides of our curves, as well as showing the circle that "fits" our curve at some point that we can control by using a slider:

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - + +

    + So let's revisit the previous graphic with the curvature visualised on both sides of our curves, as well as showing the circle that "fits" + our curve at some point that we can control by using a slider: +

    + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + + + +
    +
    +

    + + Tracing a curve at fixed distance intervals +

    +

    + Say you want to draw a curve with a dashed line, rather than a solid line, or you want to move something along the curve at fixed distance + intervals over time, like a train along a track, and you want to use Bézier curves. +

    +

    Now you have a problem.

    +

    + The reason you have a problem is that Bézier curves are parametric functions with non-linear behaviour, whereas moving a train along a + track is about as close to a practical example of linear behaviour as you can get. The problem we're faced with is that we can't just pick + t values at some fixed interval and expect the Bézier functions to generate points that are spaced a fixed distance apart. In + fact, let's look at the relation between "distance along a curve" and "t value", by plotting them against one another. +

    +

    + The following graphic shows a particularly illustrative curve, and its distance-for-t plot. For linear traversal, this line needs to be + straight, running from (0,0) to (length,1). That is, it's safe to say, not what we'll see: we'll see something very wobbly, instead. To + make matters even worse, the distance-for-t function is also of a much higher order than our curve is: while the curve we're using for + this exercise is a cubic curve, which can switch concave/convex form twice at best, the distance function is our old friend the arc length + function, which can have more inflection points. +

    + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + +

    + So, how do we "cut up" the arc length function at regular intervals, when we can't really work with it? We basically cheat: we run through + the curve using t values, determine the distance-for-this-t-value at each point we generate during the run, and + then we find "the closest t value that matches some required distance" using those values instead. If we have a low number of + points sampled, we can then even refine which t value "should" work for our desired distance by interpolating between two + points, but if we have a high enough number of samples, we don't even need to bother. +

    +

    + So let's do exactly that: the following graph is similar to the previous one, showing how we would have to "chop up" our distance-for-t + curve in order to get regularly spaced points on the curve. It also shows what using those t values on the real curve looks + like, by coloring each section of curve between two distance markers differently: +

    + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + + + +

    Use the slider to increase or decrease the number of equidistant segments used to colour the curve.

    +

    + However, are there better ways? One such way is discussed in "Moving Along a Curve with Specified Speed" by David Eberly of Geometric Tools, LLC, but basically because we have no explicit length function (or rather, one we don't have to + constantly compute for different intervals), you may simply be better off with a traditional lookup table (LUT). +

    +
    +
    +

    + + Intersections +

    +

    + Let's look at some more things we will want to do with Bézier curves. Almost immediately after figuring out how to get bounding boxes to + work, people tend to run into the problem that even though the minimal bounding box (based on rotation) is tight, it's not sufficient to + perform true collision detection. It's a good first step to make sure there might be a collision (if there is no bounding box + overlap, there can't be one), but in order to do real collision detection we need to know whether or not there's an intersection on the + actual curve. +

    +

    + We'll do this in steps, because it's a bit of a journey to get to curve/curve intersection checking. First, let's start simple, by + implementing a line-line intersection checker. While we can solve this the traditional calculus way (determine the functions for both + lines, then compute the intersection by equating them and solving for two unknowns), linear algebra actually offers a nicer solution. +

    +

    Line-line intersections

    +

    + If we have two line segments with two coordinates each, segments A-B and C-D, we can find the intersection of the lines these segments are + an intervals on by linear algebra, using the procedure outlined in this + top coder + article. Of course, we need to make sure that the intersection isn't just on the lines our line segments lie on, but actually on our line + segments themselves. So after we find the intersection, we need to verify that it lies without the bounds of our original line segments. +

    +

    + The following graphic implements this intersection detection, showing a red point for an intersection on the lines our segments lie on + (thus being a virtual intersection point), and a green point for an intersection that lies on both segments (being a real intersection + point). +

    + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + +
    +

    Implementing line-line intersections

    +

    + Let's have a look at how to implement a line-line intersection checking function. The basics are covered in the article mentioned above, + but sometimes you need more function signatures, because you might not want to call your function with eight distinct parameters. Maybe + you're using point structs for the line. Let's get coding: +

    -
    -
    -

    Tracing a curve at fixed distance intervals

    -

    Say you want to draw a curve with a dashed line, rather than a solid line, or you want to move something along the curve at fixed distance intervals over time, like a train along a track, and you want to use Bézier curves.

    -

    Now you have a problem.

    -

    The reason you have a problem is that Bézier curves are parametric functions with non-linear behaviour, whereas moving a train along a track is about as close to a practical example of linear behaviour as you can get. The problem we're faced with is that we can't just pick t values at some fixed interval and expect the Bézier functions to generate points that are spaced a fixed distance apart. In fact, let's look at the relation between "distance along a curve" and "t value", by plotting them against one another.

    -

    The following graphic shows a particularly illustrative curve, and its distance-for-t plot. For linear traversal, this line needs to be straight, running from (0,0) to (length,1). That is, it's safe to say, not what we'll see: we'll see something very wobbly, instead. To make matters even worse, the distance-for-t function is also of a much higher order than our curve is: while the curve we're using for this exercise is a cubic curve, which can switch concave/convex form twice at best, the distance function is our old friend the arc length function, which can have more inflection points.

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - -

    So, how do we "cut up" the arc length function at regular intervals, when we can't really work with it? We basically cheat: we run through the curve using t values, determine the distance-for-this-t-value at each point we generate during the run, and then we find "the closest t value that matches some required distance" using those values instead. If we have a low number of points sampled, we can then even refine which t value "should" work for our desired distance by interpolating between two points, but if we have a high enough number of samples, we don't even need to bother.

    -

    So let's do exactly that: the following graph is similar to the previous one, showing how we would have to "chop up" our distance-for-t curve in order to get regularly spaced points on the curve. It also shows what using those t values on the real curve looks like, by coloring each section of curve between two distance markers differently:

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - -

    Use the slider to increase or decrease the number of equidistant segments used to colour the curve.

    -

    However, are there better ways? One such way is discussed in "Moving Along a Curve with Specified Speed" by David Eberly of Geometric Tools, LLC, but basically because we have no explicit length function (or rather, one we don't have to constantly compute for different intervals), you may simply be better off with a traditional lookup table (LUT).

    - -
    -
    -

    Intersections

    -

    Let's look at some more things we will want to do with Bézier curves. Almost immediately after figuring out how to get bounding boxes to work, people tend to run into the problem that even though the minimal bounding box (based on rotation) is tight, it's not sufficient to perform true collision detection. It's a good first step to make sure there might be a collision (if there is no bounding box overlap, there can't be one), but in order to do real collision detection we need to know whether or not there's an intersection on the actual curve.

    -

    We'll do this in steps, because it's a bit of a journey to get to curve/curve intersection checking. First, let's start simple, by implementing a line-line intersection checker. While we can solve this the traditional calculus way (determine the functions for both lines, then compute the intersection by equating them and solving for two unknowns), linear algebra actually offers a nicer solution.

    -

    Line-line intersections

    -

    If we have two line segments with two coordinates each, segments A-B and C-D, we can find the intersection of the lines these segments are an intervals on by linear algebra, using the procedure outlined in this top coder article. Of course, we need to make sure that the intersection isn't just on the lines our line segments lie on, but actually on our line segments themselves. So after we find the intersection, we need to verify that it lies without the bounds of our original line segments.

    -

    The following graphic implements this intersection detection, showing a red point for an intersection on the lines our segments lie on (thus being a virtual intersection point), and a green point for an intersection that lies on both segments (being a real intersection point).

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - -
    - -

    Implementing line-line intersections

    -

    Let's have a look at how to implement a line-line intersection checking function. The basics are covered in the article mentioned above, but sometimes you need more function signatures, because you might not want to call your function with eight distinct parameters. Maybe you're using point structs for the line. Let's get coding:

    - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    - -
    + return lli4(line1.p1, line1.p2, line2.p1, line2.p2) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + -

    What about curve-line intersections?

    -

    Curve/line intersection is more work, but we've already seen the techniques we need to use in order to perform it: first we translate/rotate both the line and curve together, in such a way that the line coincides with the x-axis. This will position the curve in a way that makes it cross the line at points where its y-function is zero. By doing this, the problem of finding intersections between a curve and a line has now become the problem of performing root finding on our translated/rotated curve, as we already covered in the section on finding extremities.

    -
    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - -
    +

    What about curve-line intersections?

    +

    + Curve/line intersection is more work, but we've already seen the techniques we need to use in order to perform it: first we + translate/rotate both the line and curve together, in such a way that the line coincides with the x-axis. This will position the curve in + a way that makes it cross the line at points where its y-function is zero. By doing this, the problem of finding intersections between a + curve and a line has now become the problem of performing root finding on our translated/rotated curve, as we already covered in the + section on finding extremities. +

    +
    + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + +
    -

    Curve/curve intersection, however, is more complicated. Since we have no straight line to align to, we can't simply align one of the curves and be left with a simple procedure. Instead, we'll need to apply two techniques we've met before: de Casteljau's algorithm, and curve splitting.

    +

    + Curve/curve intersection, however, is more complicated. Since we have no straight line to align to, we can't simply align one of the + curves and be left with a simple procedure. Instead, we'll need to apply two techniques we've met before: de Casteljau's algorithm, and + curve splitting. +

    +
    +
    +

    + + Curve/curve intersection +

    +

    + Using de Casteljau's algorithm to split the curve we can now implement curve/curve intersection finding using a "divide and conquer" + technique: +

    +
      +
    1. + Take two curves C1 and C2, and treat them as a pair. +
    2. +
    3. If their bounding boxes overlap, split up each curve into two sub-curves
    4. +
    5. + With C1.1, C1.2, C2.1 and C2.2, form four new pairs (C1.1,C2.1), (C1.1, C2.2), (C1.2,C2.1), and (C1.2,C2.2). +
    6. +
    7. + For each pair, check whether their bounding boxes overlap. +
        +
      1. If their bounding boxes do not overlap, discard the pair, as there is no intersection between this pair of curves.
      2. +
      3. If there is overlap, rerun all steps for this pair.
      4. +
      +
    8. +
    9. + Once the sub-curves we form are so small that they effectively occupy sub-pixel areas, we consider an intersection found, noting that we + might have a cluster of multiple intersections at the sub-pixel level, out of which we pick one to act as "found" t value + (we can either throw all but one away, we can average the cluster's t values, or you can do something even more creative). +
    10. +
    +

    + This algorithm will start with a single pair, "balloon" until it runs in parallel for a large number of potential sub-pairs, and then + taper back down as it homes in on intersection coordinates, ending up with as many pairs as there are intersections. +

    +

    + The following graphic applies this algorithm to a pair of cubic curves, one step at a time, so you can see the algorithm in action. Click + the button to run a single step in the algorithm, after setting up your curves in some creative arrangement. You can also change the value + that is used in step 5 to determine whether the curves are small enough. Manipulating the curves or changing the threshold will reset the + algorithm, so you can try this with lots of different curves. +

    +

    (can you find the configuration that yields the maximum number of intersections between two cubic curves? Nine intersections!)

    + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + + + + +

    + Finding self-intersections is effectively the same procedure, except that we're starting with a single curve, so we need to turn that into + two separate curves first. This is trivially achieved by splitting at an inflection point, or if there are none, just splitting at + t=0.5 first, and then running the exact same algorithm as above, with all non-overlapping curve pairs getting removed at each + iteration, and each successive step homing in on the curve's self-intersection points. +

    +
    +
    +

    + + The projection identity +

    +

    + De Casteljau's algorithm is the pivotal algorithm when it comes to Bézier curves. You can use it not just to split curves, but also to + draw them efficiently (especially for high-order Bézier curves), as well as to come up with curves based on three points and a tangent. + Particularly this last thing is really useful because it lets us "mold" a curve, by picking it up at some point, and dragging that point + around to change the curve's shape. +

    +

    How does that work? Succinctly: we run de Casteljau's algorithm in reverse!

    +

    + In order to run de Casteljau's algorithm in reverse, we need a few basic things: a start and end point, a point on the curve that we want + to be moving around, which has an associated t value, and a point we've not explicitly talked about before, and as far as I know + has no explicit name, but lives one iteration higher in the de Casteljau process then our on-curve point does. I like to call it "A" for + reasons that will become obvious. +

    +

    + So let's use graphics instead of text to see where this "A" is, because text only gets us so far: move the sliders for the following + graphics to see what, given a specific t value, our A coordinate is. As well as some other coordinates, which + taken together let us derive a value that the graphics call "ratio": if you move the curve's points around, A, B, and C will move, what + happens to that value? +

    +
    + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + + + + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + + + +
    -
    -
    -

    Curve/curve intersection

    -

    Using de Casteljau's algorithm to split the curve we can now implement curve/curve intersection finding using a "divide and conquer" technique:

    -
      -
    1. Take two curves C1 and C2, and treat them as a pair.
    2. -
    3. If their bounding boxes overlap, split up each curve into two sub-curves
    4. -
    5. With C1.1, C1.2, C2.1 and C2.2, form four new pairs (C1.1,C2.1), (C1.1, C2.2), (C1.2,C2.1), and (C1.2,C2.2).
    6. -
    7. For each pair, check whether their bounding boxes overlap.
        -
      1. If their bounding boxes do not overlap, discard the pair, as there is no intersection between this pair of curves.
      2. -
      3. If there is overlap, rerun all steps for this pair.
      4. -
      -
    8. -
    9. Once the sub-curves we form are so small that they effectively occupy sub-pixel areas, we consider an intersection found, noting that we might have a cluster of multiple intersections at the sub-pixel level, out of which we pick one to act as "found" t value (we can either throw all but one away, we can average the cluster's t values, or you can do something even more creative).
    10. -
    -

    This algorithm will start with a single pair, "balloon" until it runs in parallel for a large number of potential sub-pairs, and then taper back down as it homes in on intersection coordinates, ending up with as many pairs as there are intersections.

    -

    The following graphic applies this algorithm to a pair of cubic curves, one step at a time, so you can see the algorithm in action. Click the button to run a single step in the algorithm, after setting up your curves in some creative arrangement. You can also change the value that is used in step 5 to determine whether the curves are small enough. Manipulating the curves or changing the threshold will reset the algorithm, so you can try this with lots of different curves.

    -

    (can you find the configuration that yields the maximum number of intersections between two cubic curves? Nine intersections!)

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - - -

    Finding self-intersections is effectively the same procedure, except that we're starting with a single curve, so we need to turn that into two separate curves first. This is trivially achieved by splitting at an inflection point, or if there are none, just splitting at t=0.5 first, and then running the exact same algorithm as above, with all non-overlapping curve pairs getting removed at each iteration, and each successive step homing in on the curve's self-intersection points.

    - -
    -
    -

    The projection identity

    -

    De Casteljau's algorithm is the pivotal algorithm when it comes to Bézier curves. You can use it not just to split curves, but also to draw them efficiently (especially for high-order Bézier curves), as well as to come up with curves based on three points and a tangent. Particularly this last thing is really useful because it lets us "mold" a curve, by picking it up at some point, and dragging that point around to change the curve's shape.

    -

    How does that work? Succinctly: we run de Casteljau's algorithm in reverse!

    -

    In order to run de Casteljau's algorithm in reverse, we need a few basic things: a start and end point, a point on the curve that we want to be moving around, which has an associated t value, and a point we've not explicitly talked about before, and as far as I know has no explicit name, but lives one iteration higher in the de Casteljau process then our on-curve point does. I like to call it "A" for reasons that will become obvious.

    -

    So let's use graphics instead of text to see where this "A" is, because text only gets us so far: move the sliders for the following graphics to see what, given a specific t value, our A coordinate is. As well as some other coordinates, which taken together let us derive a value that the graphics call "ratio": if you move the curve's points around, A, B, and C will move, what happens to that value?

    -
    - - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - -
    - -

    So these graphics show us several things:

    -
      -
    1. a point at the tip of the curve construction's "hat": let's call that A, as well as
    2. -
    3. our on-curve point give our chosen t value: let's call that B, and finally,
    4. -
    5. a point that we get by projecting A, through B, onto the line between the curve's start and end points: let's call that C.
    6. -
    7. for both quadratic and cubic curves, two points e1 and e2, which represent the single-to-last step in de Casteljau's algorithm: in the last step, we find B at (1-t) * e1 + t * e2.
    8. -
    9. for cubic curves, also the points v1 and v2, which together with A represent the first step in de Casteljau's algorithm: in the next step, we find e1 and e2.
    10. -
    -

    These three values A, B, and C allow us to derive an important identity formula for quadratic and cubic Bézier curves: for any point on the curve with some t value, the ratio of distances from A to B and B to C is fixed: if some t value sets up a C that is 20% away from the start and 80% away from the end, then it doesn't matter where the start, end, or control points are; for that t value, C will always lie at 20% from the start and 80% from the end point. Go ahead, pick an on-curve point in either graphic and then move all the other points around: if you only move the control points, start and end won't move, and so neither will C, and if you move either start or end point, C will move but its relative position will not change.

    -

    So, how can we compute C? We start with our observation that C always lies somewhere between the start and end points, so logically C will have a function that interpolates between those two coordinates:

    - - -

    If we can figure out what the function u(t) looks like, we'll be done. Although we do need to remember that this u(t) will have a different form depending on whether we're working with quadratic or cubic curves. Running through the maths (with thanks to Boris Zbarsky) shows us the following two formulae:

    - - -

    And

    - - -

    So, if we know the start and end coordinates and the t value, we know C without having to calculate the A or even B coordinates. In fact, we can do the same for the ratio function. As another function of t, we technically don't need to know what A or B or C are. It, too, can be expressed as a pure function of t.

    -

    We start by observing that, given A, B, and C, the following always holds:

    - - -

    Working out the maths for this, we see the following two formulae for quadratic and cubic curves:

    - - -

    And

    - - -

    Which now leaves us with some powerful tools: given three points (start, end, and "some point on the curve"), as well as a t value, we can construct curves. We can compute C using the start and end points and our u(t) function, and once we have C, we can use our on-curve point (B) and the ratio(t) function to find A:

    - - -

    With A found, finding e1 and e2 for quadratic curves is a matter of running the linear interpolation with t between start and A to yield e1, and between A and end to yield e2. For cubic curves, there is no single pair of points that can act as e1 and e2 (there are infinitely many, because the tangent at B is a free parameter for cubic curves) so as long as the distance ratio between e1 to B and B to e2 is the Bézier ratio (1-t):t, we are free to pick any pair, after which we can reverse engineer v1 and v2:

    - - -

    And then reverse engineer the curve's control points:

    - - -

    So: if we have a curve's start and end points, as well as some third point B that we want the curve to pass through, then for any t value we implicitly know all the ABC values, which (combined with an educated guess on appropriate e1 and e2 coordinates for cubic curves) gives us the necessary information to reconstruct a curve's "de Casteljau skeleton". Which means that we can now do several things: we can "fit" curves using only three points, which means we can also "mold" curves by moving an on-curve point but leaving its start and end points, and then reconstruct the curve based on where we moved the on-curve point to. These are very useful things, and we'll look at both in the next few sections.

    - -
    -
    -

    Creating a curve from three points

    -

    Given the preceding section, you might be wondering if we can use that knowledge to just "create" curves by placing some points and having the computer do the rest, to which the answer is: that's exactly what we can now do!

    -

    For quadratic curves, things are pretty easy. Technically, we'll need a t value in order to compute the ratio function used in computing the ABC coordinates, but we can just as easily approximate one by treating the distance between the start and B point, and B and end point as a ratio, using

    - - -

    With this code in place, creating a quadratic curve from three points is literally just computing the ABC values, and using A as our curve's control point:

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - -

    For cubic curves we need to do a little more work, but really only just a little. We're first going to assume that a decent curve through the three points should approximate a circular arc, which first requires knowing how to fit a circle to three points. You may remember (if you ever learned it!) that a line between two points on a circle is called a chord, and that one property of chords is that the line from the center of any chord, perpendicular to that chord, passes through the center of the circle.

    -

    That means that if we have three points on a circle, we have three (different) chords, and consequently, three (different) lines that go from those chords through the center of the circle: if we find two of those lines, then their intersection will be our circle's center, and the circle's radius will—by definition!—be the distance from the center to any of our three points:

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - -

    With that covered, we now also know the tangent line to our point B, because the tangent to any point on the circle is a line through that point, perpendicular to the line from that point to the center. That just leaves marking appropriate points e1 and e2 on that tangent, so that we can construct a new cubic curve hull. We use the approach as we did for quadratic curves to automatically determine a reasonable t value, and then our e1 and e2 coordinates must obey the standard de Casteljau rule for linear interpolation:

    - - -

    Where d is the total length of the line segment from e1 to e2. So how long do we make that? There are again all kinds of approaches we can take, and a simple-but-effective one is to set the length of that segment to "one third the length of the baseline". This forces e1 and e2 to always be the "linear curve" distance apart, which means if we place our three points on a line, it will actually look like a line. Nice! The last thing we'll need to do is make sure to flip the sign of d depending on which side of the baseline our B is located, so we don't end up creating a funky curve with a loop in it. To do this, we can use the atan2 function:

    - - -

    This angle φ will be between 0 and π if B is "above" the baseline (rotating all three points so that the start is on the left and the end is the right), so we can use a relatively straight forward check to make sure we're using the correct sign for our value d:

    - - -

    The result of this approach looks as follows:

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - -

    It is important to remember that even though we're using a circular arc to come up with decent e1 and e2 terms, we're not trying to perfectly create a circular arc with a cubic curve (which is good, because we can't; more on that later), we're only trying to come up with some reasonable e1 and e2 points so we can construct a new cubic curve... so now that we have those: let's see what kind of cubic curve that gives us:

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - -

    That looks perfectly serviceable!

    -

    Of course, we can take this one step further: we can't just "create" curves, we also have (almost!) all the tools available to "mold" curves, where we can reshape a curve by dragging a point on the curve around while leaving the start and end fixed, effectively molding the shape as if it were clay or the like. We'll see the last tool we need to do that in the next section, and then we'll look at implementing curve molding in the section after that, so read on!

    + +

    The result of this approach looks as follows:

    + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + +

    + It is important to remember that even though we're using a circular arc to come up with decent e1 and e2 terms, + we're not trying to perfectly create a circular arc with a cubic curve (which is good, because we can't; + more on that later), we're only trying to come up with some reasonable e1 and + e2 points so we can construct a new cubic curve... so now that we have those: let's see what kind of cubic curve that gives + us: +

    + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + +

    That looks perfectly serviceable!

    +

    + Of course, we can take this one step further: we can't just "create" curves, we also have (almost!) all the tools available to "mold" + curves, where we can reshape a curve by dragging a point on the curve around while leaving the start and end fixed, effectively molding + the shape as if it were clay or the like. We'll see the last tool we need to do that in the next section, and then we'll look at + implementing curve molding in the section after that, so read on! +

    +
    +
    +

    + + Projecting a point onto a Bézier curve +

    +

    + Before we can move on to actual curve molding, it'll be good if know how to actually be able to find "some point on the curve" that we're + trying to click on. After all, if all we have is our Bézier coordinates, that is not in itself enough to figure out which point on the + curve our cursor will be closest to. So, how do we project points onto a curve? +

    +

    + If the Bézier curve is of low enough order, we might be able to + work out the maths for how to do this, and get a perfect t value back, but in general this is an incredibly hard problem and the easiest solution is, really, a + numerical approach again. We'll be finding our ideal t value using a + binary search. First, we do a coarse distance-check based on + t values associated with the curve's "to draw" coordinates (using a lookup table, or LUT). This is pretty fast: +

    -
    -
    -

    Projecting a point onto a Bézier curve

    -

    Before we can move on to actual curve molding, it'll be good if know how to actually be able to find "some point on the curve" that we're trying to click on. After all, if all we have is our Bézier coordinates, that is not in itself enough to figure out which point on the curve our cursor will be closest to. So, how do we project points onto a curve?

    -

    If the Bézier curve is of low enough order, we might be able to work out the maths for how to do this, and get a perfect t value back, but in general this is an incredibly hard problem and the easiest solution is, really, a numerical approach again. We'll be finding our ideal t value using a binary search. First, we do a coarse distance-check based on t values associated with the curve's "to draw" coordinates (using a lookup table, or LUT). This is pretty fast:

    - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    - -

    After this runs, we know that LUT[i] is the coordinate on the curve in our LUT that is closest to the point we want to project, so that's a pretty good initial guess as to what the best projection onto our curve is. To refine it, we note that LUT[i] is a better guess than both LUT[i-1] and LUT[i+1], but there might be an even better projection somewhere else between those two values, so that's what we're going to be testing for, using a variation of the binary search.

    -
      -
    1. we start with our point p, and the t values t1=LUT[i-1].t and t2=LUT[i+1].t, which span an interval v = t2-t1.
    2. -
    3. we test this interval in five spots: the start, middle, and end (which we already have), and the two points in between the middle and start/end points
    4. -
    5. we then check which of these five points is the closest to our original point p, and then repeat step 1 with the points before and after the closest point we just found.
    6. -
    -

    This makes the interval we check smaller and smaller at each iteration, and we can keep running the three steps until the interval becomes so small as to lead to distances that are, for all intents and purposes, the same for all points.

    -

    So, let's see that in action: in this case, I'm going to arbitrarily say that if we're going to run the loop until the interval is smaller than 0.001, and show you what that means for projecting your mouse cursor or finger tip onto a rather complex Bézier curve (which, of course, you can reshape as you like). Also shown are the original three points that our coarse check finds.

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - + i = index + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + -
    -
    -

    Intersections with a circle

    -

    It might seem odd to cover this subject so much later than the line/line, line/curve, and curve/curve intersection topics from several sections earlier, but the reason we can't cover circle/curve intersections is that we can't really discuss circle/curve intersection until we've covered the kind of lookup table (LUT) walking that the section on projecting a point onto a curve uses. To see why, let's look at what we would have to do if we wanted to find the intersections between a curve and a circle using calculus.

    -

    First, we observe that "finding intersections" in this case means that, given a circle defined by a center point c = (x,y) and a radius r, we want to find all points on the Bezier curve for which the distance to the circle's center point is equal to the circle radius, which by definition means those points lie on the circle, and so count as intersections. In maths, that means we're trying to solve:

    - - -

    Which seems simple enough. Unfortunately, when we expand that dist function, things get a lot more problematic:

    - - -

    And now we have a problem because that's a sixth degree polynomial inside the square root. So, thanks to the Abel-Ruffini theorem that we saw before, we can't solve this by just going "square both sides because we don't care about signs"... we can't solve a sixth degree polynomial. So, we're going to have to actually evaluate that expression. We can "simplify" this by translating all our coordinates so that the center of the circle is (0,0) and all our coordinates are shifted accordingly, which makes the cx and cy terms fall away, but then we're still left with a monstrous function to solve.

    -

    So instead, we turn to the same kind of "LUT walking" that we saw for projecting points onto a curve, with a twist: instead of finding the on-curve point with the smallest distance to our projection point, we want to find the on-curve point that has the exact distance r to our projection point (namely, our circle center). Of course, there can be more than one such point, so there's also a bit more code to make sure we find all of them, but let's look at the steps involved:

    + +

    + And now we have a problem because that's a sixth degree polynomial inside the square root. So, thanks to the + Abel-Ruffini theorem that we saw before, we can't solve this by + just going "square both sides because we don't care about signs"... we can't solve a sixth degree polynomial. So, we're going to have to + actually evaluate that expression. We can "simplify" this by translating all our coordinates so that the center of the circle is (0,0) and + all our coordinates are shifted accordingly, which makes the cx and cy terms fall away, but then we're still left + with a monstrous function to solve. +

    +

    + So instead, we turn to the same kind of "LUT walking" that we saw for projecting points onto a curve, with a twist: instead of finding the + on-curve point with the smallest distance to our projection point, we want to find the on-curve point that has the exact distance + r to our projection point (namely, our circle center). Of course, there can be more than one such point, so there's also a + bit more code to make sure we find all of them, but let's look at the steps involved: +

    - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    - -

    This is very similar to the code in the previous section, with an extra input r for the circle radius, and a minor change in the "distance for this coordinate": rather than just distance(coordinate, p) we want to know the difference between that distance and the circle radius. After all, if that difference is zero, then the distance from the coordinate to the circle center is exactly the radius, so the coordinate lies on both the curve and the circle.

    -

    So far so good.

    -

    However, we also want to make sure we find all the points, not just a single one, so we need a little more code for that:

    + i = index + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    - -

    After running this code, values will be the list of all LUT coordinates that are closest to the distance r: we can use those values to run the same kind of refinement lookup we used for point projection (with the caveat that we're now not checking for smallest distance, but for "distance closest to r"), and we'll have all our intersection points. Of course, that does require explaining what findClosest does: rather than looking for a global minimum, we're now interested in finding a local minimum, so instead of checking a single point and looking at its distance value, we check three points ("current", "previous" and "before previous") and then check whether they form a local minimum:

    + start = i + 2; + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + - - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    - -

    In words: given a start index, the circle center and radius, and our LUT, we check where (closest to our start index) we can find a local minimum for the difference between "the distance from the curve to the circle center", and the circle's radius. We track this by looking at three values (associated with the indices index-2, index-1, and index), and we know we've found a local minimum if the three values show that the middle value (pd1) is less than either value beside it. When we do, we can set our "best guess, relative to start" as index-1. Of course, since we're now checking values relative to some start value, we might not find another candidate value at all, in which case we return start - 1, so that a simple "is the result less than start?" lets us determine that there are no more intersections to find.

    -

    Finally, while not necessary for point projection, there is one more step we need to perform when we run the binary refinement function on our candidate LUT indices, because we've so far only been testing using distances "closest to the radius of the circle", and that's actually not good enough... we need distances that are the radius of the circle. So, after running the refinement for each of these indices, we need to discard any final value that isn't the circle radius. And because we're working with floating point numbers, what this really means is that we need to discard any value that's a pixel or more "off". Or, if we want to get really fancy, "some small epsilon value".

    -

    Based on all of that, the following graphic shows this off for the standard cubic curve (which you can move the coordinates around for, of course) and a circle with a controllable radius centered on the graphic's center, using the code approach described above.

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - -

    And of course, for the full details, click that "view source" link.

    + return start + i + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + 18 + + + 19 + + -
    -
    -

    Molding a curve

    -

    Armed with knowledge of the "ABC" relation, point-on-curve projection, and guestimating reasonable looking helper values for cubic curve construction, we can finally cover curve molding: updating a curve's shape interactively, by dragging points on the curve around.

    -

    For quadratic curve, this is a really simple trick: we project our cursor onto the curve, which gives us a t value and initial B coordinate. We don't even need the latter: with our t value and "wherever the cursor is" as target B, we can compute the associated C:

    - - -

    And then the associated A:

    - - -

    And we're done, because that's our new quadratic control point!

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - -

    As before, cubic curves are a bit more work, because while it's easy to find our initial t value and ABC values, getting those all-important e1 and e2 coordinates is going to pose a bit of a problem... in the section on curve creation, we were free to pick an appropriate t value ourselves, which allowed us to find appropriate e1 and e2 coordinates. That's great, but when we're curve molding we don't have that luxury: whatever point we decide to start moving around already has its own t value, and its own e1 and e2 values, and those may not make sense for the rest of the curve.

    -

    For example, let's see what happens if we just "go with what we get" when we pick a point and start moving it around, preserving its t value and e1/e2 coordinates:

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - -

    That looks reasonable, close to the original point, but the further we drag our point, the less "useful" things become. Especially if we drag our point across the baseline, rather than turning into a nice curve.

    -

    One way to combat this might be to combine the above approach with the approach from the creating curves section: generate both the "unchanged t/e1/e2" curve, as well as the "idealized" curve through the start/cursor/end points, with idealized t value, and then interpolating between those two curves:

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - -

    The slide controls the "falloff distance" relative to where the original point on the curve is, so that as we drag our point around, it interpolates with a bias towards "preserving t/e1/e2" closer to the original point, and bias towards "idealized" form the further away we move our point, with anything that's further than our falloff distance simply being the idealized curve. We don't even try to interpolate at that point.

    -

    A more advanced way to try to smooth things out is to implement continuous molding, where we constantly update the curve as we move around, and constantly change what our B point is, based on constantly projecting the cursor on the curve as we're updating it - this is, you won't be surprised to learn, tricky, and beyond the scope of this section: interpolation (with a reasonable distance) will do for now!

    - -
    -
    -

    Curve fitting

    -

    Given the previous section, one question you might have is "what if I don't want to guess t values?". After all, plenty of graphics packages do automated curve fitting, so how can we implement that in a way that just finds us reasonable t values all on its own?

    -

    And really this is just a variation on the question "how do I get the curve through these X points?", so let's look at that. Specifically, let's look at the answer: "curve fitting". This is in fact a rather rich field in geometry, applying to anything from data modelling to path abstraction to "drawing", so there's a fair number of ways to do curve fitting, but we'll look at one of the most common approaches: something called a least squares polynomial regression. In this approach, we look at the number of points we have in our data set, roughly determine what would be an appropriate order for a curve that would fit these points, and then tackle the question "given that we want an nth order curve, what are the coordinates we can find such that our curve is "off" by the least amount?".

    -

    Now, there are many ways to determine how "off" points are from the curve, which is where that "least squares" term comes in. The most common tool in the toolbox is to minimise the squared distance between each point we have, and the corresponding point on the curve we end up "inventing". A curve with a snug fit will have zero distance between those two, and a bad fit will have non-zero distances between every such pair. It's a workable metric. You might wonder why we'd need to square, rather than just ensure that distance is a positive value (so that the total error is easy to compute by just summing distances) and the answer really is "because it tends to be a little better". There's lots of literature on the web if you want to deep-dive the specific merits of least squared error metrics versus least absolute error metrics, but those are well beyond the scope of this material.

    -

    So let's look at what we end up with in terms of curve fitting if we start with the idea of performing least squares Bézier fitting. We're going to follow a procedure similar to the one described by Jim Herold over on his "Least Squares Bézier Fit" article, and end with some nice interactive graphics for doing some curve fitting.

    -

    Before we begin, we're going to use the curve in matrix form. In the section on matrices, I mentioned that some things are easier if we use the matrix representation of a Bézier curve rather than its calculus form, and this is one of those things.

    -

    As such, the first step in the process is expressing our Bézier curve as powers/coefficients/coordinate matrix T x M x C, by expanding the Bézier functions.

    -
    - -

    Revisiting the matrix representation

    -

    Rewriting Bézier functions to matrix form is fairly easy, if you first expand the function, and then arrange them into a multiple line form, where each line corresponds to a power of t, and each column is for a specific coefficient. First, we expand the function:

    - - -

    And then we (trivially) rearrange the terms across multiple lines:

    - - -

    This rearrangement has "factors of t" at each row (the first row is t⁰, i.e. "1", the second row is t¹, i.e. "t", the third row is t²) and "coefficient" at each column (the first column is all terms involving "a", the second all terms involving "b", the third all terms involving "c").

    -

    With that arrangement, we can easily decompose this as a matrix multiplication:

    - - -

    We can do the same for the cubic curve, of course. We know the base function for cubics:

    - - -

    So we write out the expansion and rearrange:

    - - -

    Which we can then decompose:

    - - -

    And, of course, we can do this for quartic curves too (skipping the expansion step):

    - - -

    And so and on so on. Now, let's see how to use these T, M, and C, to do some curve fitting.

    -
    + +

    + And so and on so on. Now, let's see how to use these T, M, and C, to do some curve + fitting. +

    + -

    Let's get started: we're going to assume we picked the right order curve: for n points we're fitting an n-1th order curve, so we "start" with a vector P that represents the coordinates we already know, and for which we want to do curve fitting:

    - - -

    Next, we need to figure out appropriate t values for each point in the curve, because we need something that lets us tie "the actual coordinate" to "some point on the curve". There's a fair number of different ways to do this (and a large part of optimizing "the perfect fit" is about picking appropriate t values), but in this case let's look at two "obvious" choices:

    -
      -
    1. equally spaced t values, and
    2. -
    3. t values that align with distance along the polygon.
    4. -
    -

    The first one is really simple: if we have n points, then we'll just assign each point i a t value of (i-1)/(n-1). So if we have four points, the first point will have t=(1-1)/(4-1)=0/3, the second point will have t=(2-1)/(4-1)=1/3, the third point will have t=2/3, and the last point will be t=1. We're just straight up spacing the t values to match the number of points we have.

    -

    The second one is a little more interesting: since we're doing polynomial regression, we might as well exploit the fact that our base coordinates just constitute a collection of line segments. At the first point, we're fixing t=0, and the last point, we want t=1, and anywhere in between we're simply going to say that t is equal to the distance along the polygon, scaled to the [0,1] domain.

    -

    To get these values, we first compute the general "distance along the polygon" matrix:

    - - -

    Where length() is literally just that: the length of the line segment between the point we're looking at, and the previous point. This isn't quite enough, of course: we still need to make sure that all the values between i=1 and i=n fall in the [0,1] interval, so we need to scale all values down by whatever the total length of the polygon is:

    - - -

    And now we can move on to the actual "curve fitting" part: what we want is a function that lets us compute "ideal" control point values such that if we build a Bézier curve with them, that curve passes through all our original points. Or, failing that, have an overall error distance that is as close to zero as we can get it. So, let's write out what the error distance looks like.

    -

    As mentioned before, this function is really just "the distance between the actual coordinate, and the coordinate that the curve evaluates to for the associated t value", which we'll square to get rid of any pesky negative signs:

    - - -

    Since this function only deals with individual coordinates, we'll need to sum over all coordinates in order to get the full error function. So, we literally just do that; the total error function is simply the sum of all these individual errors:

    - - -

    And here's the trick that justifies using matrices: while we can work with individual values using calculus, with matrices we can compute as many values as we make our matrices big, all at the "same time", We can replace the individual terms pi with the full P coordinate matrix, and we can replace Bézier(si) with the matrix representation T x M x C we talked about before, which gives us:

    - - -

    In which we can replace the rather cumbersome "squaring" operation with a more conventional matrix equivalent:

    - - -

    Here, the letter T is used instead of the number 2, to represent the matrix transpose; each row in the original matrix becomes a column in the transposed matrix instead (row one becomes column one, row two becomes column two, and so on).

    -

    This leaves one problem: T isn't actually the matrix we want: we don't want symbolic t values, we want the actual numerical values that we computed for S, so we need to form a new matrix, which we'll call 𝕋, that makes use of those, and then use that 𝕋 instead of T in our error function:

    - - -

    Which, because of the first and last values in S, means:

    - - -

    Now we can properly write out the error function as matrix operations:

    - - -

    So, we have our error function: we now need to figure out the expression for where that function has minimal value, e.g. where the error between the true coordinates and the coordinates generated by the curve fitting is smallest. Like in standard calculus, this requires taking the derivative, and determining where that derivative is zero:

    - - -
    + +
    +

    Where did this derivative come from?

    +

    + That... is a good question. In fact, when trying to run through this approach, I ran into the same question! And you know what? I + straight up had no idea. I'm decent enough at calculus, I'm decent enough at linear algebra, and I just don't know. +

    +

    + So I did what I always do when I don't understand something: I asked someone to help me understand how things work. In this specific + case, I + posted a question + to Math.stackexchange, and received a answer that goes into way more detail than I had + hoped to receive. +

    +

    + Is that answer useful to you? Probably: no. At least, not unless you like understanding maths on a recreational level. And I do mean + maths in general, not just basic algebra. But it does help in giving us a reference in case you ever wonder "Hang on. Why was that + true?". There are answers. They might just require some time to come to understand. +

    +
    -

    Where did this derivative come from?

    -

    That... is a good question. In fact, when trying to run through this approach, I ran into the same question! And you know what? I straight up had no idea. I'm decent enough at calculus, I'm decent enough at linear algebra, and I just don't know.

    -

    So I did what I always do when I don't understand something: I asked someone to help me understand how things work. In this specific case, I posted a question to Math.stackexchange, and received a answer that goes into way more detail than I had hoped to receive.

    -

    Is that answer useful to you? Probably: no. At least, not unless you like understanding maths on a recreational level. And I do mean maths in general, not just basic algebra. But it does help in giving us a reference in case you ever wonder "Hang on. Why was that true?". There are answers. They might just require some time to come to understand.

    -
    - -

    Now, given the above derivative, we can rearrange the terms (following the rules of matrix algebra) so that we end up with an expression for C:

    - - -

    Here, the "to the power negative one" is the notation for the matrix inverse. But that's all we have to do: we're done. Starting with P and inventing some t values based on the polygon the coordinates in P define, we can compute the corresponding Bézier coordinates C that specify a curve that goes through our points. Or, if it can't go through them exactly, as near as possible.

    -

    So before we try that out, how much code is involved in implementing this? Honestly, that answer depends on how much you're going to be writing yourself. If you already have a matrix maths library available, then really not that much code at all. On the other hand, if you are writing this from scratch, you're going to have to write some utility functions for doing your matrix work for you, so it's really anywhere from 50 lines of code to maybe 200 lines of code. Not a bad price to pay for being able to fit curves to pre-specified coordinates.

    -

    So let's try it out! The following graphic lets you place points, and will start computing exact-fit curves once you've placed at least three. You can click for more points, and the code will simply try to compute an exact fit using a Bézier curve of the appropriate order. Four points? Cubic Bézier. Five points? Quartic. And so on. Of course, this does break down at some point: depending on where you place your points, it might become mighty hard for the fitter to find an exact fit, and things might actually start looking horribly off once there's enough points for compound floating point rounding errors to start making a difference (which is around 10~11 points).

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - - -

    You'll note there is a convenient "toggle" buttons that lets you toggle between equidistant t values, and distance ratio along the polygon formed by the points. Arguably more interesting is that once you have points to abstract a curve, you also get direct control over the time values through sliders for each, because if the time values are our degree of freedom, you should be able to freely manipulate them and see what the effect on your curve is.

    - -
    -
    -

    Bézier curves and Catmull-Rom curves

    -

    Taking an excursion to different splines, the other common design curve is the Catmull-Rom spline, which unlike Bézier curves pass through each control point, so they offer a kind of "built-in" curve fitting.

    -

    In fact, let's start with just playing with one: the following graphic has a predefined curve that you manipulate the points for, and lets you add points by clicking/tapping the background, as well as let you control "how fast" the curve passes through its point using the tension slider. The tenser the curve, the more the curve tends towards straight lines from one point to the next.

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - -

    Now, it may look like Catmull-Rom curves are very different from Bézier curves, because these curves can get very long indeed, but what looks like a single Catmull-Rom curve is actually a spline: a single curve built up of lots of identically-computed pieces, similar to if you just took a whole bunch of Bézier curves, placed them end to end, and lined up their control points so that things look like a single curve. For a Catmull-Rom curve, each "piece" between two points is defined by the point's coordinates, and the tangent for those points, the latter of which can trivially be derived from knowing the previous and next point:

    - + +

    + You'll note there is a convenient "toggle" buttons that lets you toggle between equidistant t values, and distance ratio + along the polygon formed by the points. Arguably more interesting is that once you have points to abstract a curve, you also get + direct control over the time values through sliders for each, because if the time values are our degree of freedom, you should be + able to freely manipulate them and see what the effect on your curve is. +

    +
    +
    +

    + + Bézier curves and Catmull-Rom curves +

    +

    + Taking an excursion to different splines, the other common design curve is the + Catmull-Rom spline, which unlike Bézier curves + pass through each control point, so they offer a kind of "built-in" curve fitting. +

    +

    + In fact, let's start with just playing with one: the following graphic has a predefined curve that you manipulate the points for, and lets + you add points by clicking/tapping the background, as well as let you control "how fast" the curve passes through its point using the + tension slider. The tenser the curve, the more the curve tends towards straight lines from one point to the next. +

    + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + + + +

    + Now, it may look like Catmull-Rom curves are very different from Bézier curves, because these curves can get very long indeed, but what + looks like a single Catmull-Rom curve is actually a spline: a single + curve built up of lots of identically-computed pieces, similar to if you just took a whole bunch of Bézier curves, placed them end to end, + and lined up their control points so that things look like a single curve. For a Catmull-Rom curve, each "piece" between two points is + defined by the point's coordinates, and the tangent for those points, the latter of which + can trivially be derived from knowing the + previous and next point: +

    + - -

    One downside of this is that—as you may have noticed from the graphic—the first and last point of the overall curve don't actually join up with the rest of the curve: they don't have a previous/next point respectively, and so there is no way to calculate what their tangent should be. Which also makes it rather tricky to fit a Catmull-Rom curve to three points like we were able to do for Bézier curves. More on that in the next section.

    -

    In fact, before we move on, let's look at how to actually draw the basic form of these curves (I say basic, because there are a number of variations that make things considerable more complex):

    + +

    + One downside of this is that—as you may have noticed from the graphic—the first and last point of the overall curve don't actually join up + with the rest of the curve: they don't have a previous/next point respectively, and so there is no way to calculate what their tangent + should be. Which also makes it rather tricky to fit a Catmull-Rom curve to three points like we were able to do for Bézier curves. More on + that in the next section. +

    +

    + In fact, before we move on, let's look at how to actually draw the basic form of these curves (I say basic, because there are a number of + variations that make things + considerable more + complex): +

    - - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    - -

    Now, since a Catmull-Rom curve is a form of cubic Hermite spline, and as cubic Bézier curves are also a form of cubic Hermite spline, we run into an interesting bit of maths programming: we can convert one to the other and back, and the maths for doing so is surprisingly simple!

    -

    The main difference between Catmull-Rom curves and Bézier curves is "what the points mean":

    -
      -
    • A cubic Bézier curve is defined by a start point, a control point that implies the tangent at the start, a control point that implies the tangent at the end, and an end point, plus a characterizing matrix that we can multiply by that point vector to get on-curve coordinates.
    • -
    • A Catmull-Rom curve is defined by a start point, a tangent that for that starting point, an end point, and a tangent for that end point, plus a characteristic matrix that we can multiple by the point vector to get on-curve coordinates.
    • -
    -

    Those are very similar, so let's see exactly how similar they are. We've already see the matrix form for Bézier curves, so how different is the matrix form for Catmull-Rom curves?:

    - - -

    That's pretty dang similar. So the question is: how can we convert that expression with Catmull-Rom matrix and vector into an expression of the Bézier matrix and vector? The short answer is of course "by using linear algebra", but the longer answer is the rest of this section, and involves some maths that you may not even care for: if you just want to know the (incredibly simple) conversions between the two curve forms, feel free to skip to the end of the following explanation, but if you want to how we can get one from the other... let's get mathing!

    -
    - -

    Deriving the conversion formulae

    -

    In order to convert between Catmull-Rom curves and Bézier curves, we need to know two things. Firstly, how to express the Catmull-Rom curve using a "set of four coordinates", rather than a mix of coordinates and tangents, and secondly, how to convert those Catmull-Rom coordinates to and from Bézier form.

    -

    We start with the first part, to figure out how we can go from Catmull-Rom V coordinates to Bézier P coordinates, by applying "some matrix T". We don't know what that T is yet, but we'll get to that:

    - - -

    So, this mapping says that in order to map a Catmull-Rom "point + tangent" vector to something based on an "all coordinates" vector, we need to determine the mapping matrix such that applying T yields P2 as start point, P3 as end point, and two tangents based on the lines between P1 and P3, and P2 nd P4, respectively.

    -

    Computing T is really more "arranging the numbers":

    - - -

    Thus:

    - - -

    However, we're not quite done, because Catmull-Rom curves have that "tension" parameter, written as τ (a lowercase"tau"), which is a scaling factor for the tangent vectors: the bigger the tension, the smaller the tangents, and the smaller the tension, the bigger the tangents. As such, the tension factor goes in the denominator for the tangents, and before we continue, let's add that tension factor into both our coordinate vector representation, and mapping matrix T:

    - - -

    With the mapping matrix properly done, let's rewrite the "point + tangent" Catmull-Rom matrix form to a matrix form in terms of four coordinates, and see what we end up with:

    - - -

    Replace point/tangent vector with the expression for all-coordinates:

    - - -

    and merge the matrices:

    - - -

    This looks a lot like the Bézier matrix form, which as we saw in the chapter on Bézier curves, should look like this:

    - - -

    So, if we want to express a Catmull-Rom curve using a Bézier curve, we'll need to turn this Catmull-Rom bit:

    - - -

    Into something that looks like this:

    - - -

    And the way we do that is with a fairly straight forward bit of matrix rewriting. We start with the equality we need to ensure:

    - - -

    Then we remove the coordinate vector from both sides without affecting the equality:

    - - -

    Then we can "get rid of" the Bézier matrix on the right by left-multiply both with the inverse of the Bézier matrix:

    - - -

    A matrix times its inverse is the matrix equivalent of 1, and because "something times 1" is the same as "something", so we can just outright remove any matrix/inverse pair:

    - - -

    And now we're basically done. We just multiply those two matrices and we know what V is:

    - - -

    We now have the final piece of our function puzzle. Let's run through each step.

    -
      -
    1. Start with the Catmull-Rom function:
    2. -
    - - -
      -
    1. rewrite to pure coordinate form:
    2. -
    - - -
      -
    1. rewrite for "normal" coordinate vector:
    2. -
    - - -
      -
    1. merge the inner matrices:
    2. -
    - - -
      -
    1. rewrite for Bézier matrix form:
    2. -
    - - -
      -
    1. and transform the coordinates so we have a "pure" Bézier expression:
    2. -
    - - -

    And we're done: we finally know how to convert these two curves!

    -
    + +

    And we're done: we finally know how to convert these two curves!

    + -

    If we have a Catmull-Rom curve defined by four coordinates P1 through P4, then we can draw that curve using a Bézier curve that has the vector:

    - - -

    Similarly, if we have a Bézier curve defined by four coordinates P1 through P4, we can draw that using a standard tension Catmull-Rom curve with the following coordinate values:

    - - -

    Or, if your API allows you to specify Catmull-Rom curves using plain coordinates:

    - - - -
    -
    -

    Creating a Catmull-Rom curve from three points

    -

    Much shorter than the previous section: we saw that Catmull-Rom curves need at least 4 points to draw anything sensible, so how do we create a Catmull-Rom curve from three points?

    -

    Short and sweet: we don't.

    -

    We run through the maths that lets us create a cubic Bézier curve, and then convert its coordinates to Catmull-Rom form using the conversion formulae we saw above.

    - -
    -
    -

    Forming poly-Bézier curves

    -

    Much like lines can be chained together to form polygons, Bézier curves can be chained together to form poly-Béziers, and the only trick required is to make sure that:

    -
      -
    1. the end point of each section is the starting point of the following section, and
    2. -
    3. the derivatives across that dual point line up.
    4. -
    -

    Unless you want sharp corners, of course. Then you don't even need 2.

    -

    We'll cover three forms of poly-Bézier curves in this section. First, we'll look at the kind that just follows point 1. where the end point of a segment is the same point as the start point of the next segment. This leads to poly-Béziers that are pretty hard to work with, but they're the easiest to implement:

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - -

    Dragging the control points around only affects the curve segments that the control point belongs to, and moving an on-curve point leaves the control points where they are, which is not the most useful for practical modelling purposes. So, let's add in the logic we need to make things a little better. We'll start by linking up control points by ensuring that the "incoming" derivative at an on-curve point is the same as it's "outgoing" derivative:

    - - -

    We can effect this quite easily, because we know that the vector from a curve's last control point to its last on-curve point is equal to the derivative vector. If we want to ensure that the first control point of the next curve matches that, all we have to do is mirror that last control point through the last on-curve point. And mirroring any point A through any point B is really simple:

    - - -

    So let's implement that and see what it gets us. The following two graphics show a quadratic and a cubic poly-Bézier curve again, but this time moving the control points around moves others, too. However, you might see something unexpected going on for quadratic curves...

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - -

    As you can see, quadratic curves are particularly ill-suited for poly-Bézier curves, as all the control points are effectively linked. Move one of them, and you move all of them. Not only that, but if we move the on-curve points, it's possible to get a situation where a control point cannot satisfy the constraint that it's the reflection of its two neighbouring control points... This means that we cannot use quadratic poly-Béziers for anything other than really, really simple shapes. And even then, they're probably the wrong choice. Cubic curves are pretty decent, but the fact that the derivatives are linked means we can't manipulate curves as well as we might if we relaxed the constraints a little.

    -

    So: let's relax the requirement a little.

    -

    We can change the constraint so that we still preserve the angle of the derivatives across sections (so transitions from one section to the next will still look natural), but give up the requirement that they should also have the same vector length. Doing so will give us a much more useful kind of poly-Bézier curve:

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - -

    Cubic curves are now better behaved when it comes to dragging control points around, but the quadratic poly-Bézier still has the problem that moving one control points will move the control points and may ending up defining "the next" control point in a way that doesn't work. Quadratic curves really aren't very useful to work with...

    -

    Finally, we also want to make sure that moving the on-curve coordinates preserves the relative positions of the associated control points. With that, we get to the kind of curve control that you might be familiar with from applications like Photoshop, Inkscape, Blender, etc.

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - -

    Again, we see that cubic curves are now rather nice to work with, but quadratic curves have a new, very serious problem: we can move an on-curve point in such a way that we can't compute what needs to "happen next". Move the top point down, below the left and right points, for instance. There is no way to preserve correct control points without a kink at the bottom point. Quadratic curves: just not that good...

    -

    A final improvement is to offer fine-level control over which points behave which, so that you can have "kinks" or individually controlled segments when you need them, with nicely well-behaved curves for the rest of the path. Implementing that, is left as an exercise for the reader.

    - -
    -
    -

    Curve offsetting

    -

    Perhaps you're like me, and you've been writing various small programs that use Bézier curves in some way or another, and at some point you make the step to implementing path extrusion. But you don't want to do it pixel based; you want to stay in the vector world. You find that extruding lines is relatively easy, and tracing outlines is coming along nicely (although junction caps and fillets are a bit of a hassle), and then you decide to do things properly and add Bézier curves to the mix. Now you have a problem.

    -

    Unlike lines, you can't simply extrude a Bézier curve by taking a copy and moving it around, because of the curvatures; rather than a uniform thickness, you get an extrusion that looks too thin in places, if you're lucky, but more likely will self-intersect. The trick, then, is to scale the curve, rather than simply copying it. But how do you scale a Bézier curve?

    -

    Bottom line: you can't. So you cheat. We're not going to do true curve scaling, or rather curve offsetting, because that's impossible. Instead we're going to try to generate 'looks good enough' offset curves.

    -
    - -

    "What do you mean, you can't? Prove it."

    -

    First off, when I say "you can't," what I really mean is "you can't offset a Bézier curve with another Bézier curve", not even by using a really high order curve. You can find the function that describes the offset curve, but it won't be a polynomial, and as such it cannot be represented as a Bézier curve, which has to be a polynomial. Let's look at why this is:

    -

    From a mathematical point of view, an offset curve O(t) is a curve such that, given our original curve B(t), any point on O(t) is a fixed distance d away from coordinate B(t). So let's math that:

    - - -

    However, we're working in 2D, and d is a single value, so we want to turn it into a vector. If we want a point distance d "away" from the curve B(t) then what we really mean is that we want a point at d times the "normal vector" from point B(t), where the "normal" is a vector that runs perpendicular ("at a right angle") to the tangent at B(t). Easy enough:

    - - -

    Now this still isn't very useful unless we know what the formula for N(t) is, so let's find out. N(t) runs perpendicular to the original curve tangent, and we know that the tangent is simply B'(t), so we could just rotate that 90 degrees and be done with it. However, we need to ensure that N(t) has the same magnitude for every t, or the offset curve won't be at a uniform distance, thus not being an offset curve at all. The easiest way to guarantee this is to make sure N(t) always has length 1, which we can achieve by dividing B'(t) by its magnitude:

    - - -

    Determining the length requires computing an arc length, and this is where things get Tricky with a capital T. First off, to compute arc length from some start a to end b, we must use the formula we saw earlier. Noting that "length" is usually denoted with double vertical bars:

    - - -

    So if we want the length of the tangent, we plug in B'(t), with t = 0 as start and -t = 1 as end:

    - - -

    And that's where things go wrong. It doesn't even really matter what the second derivative for B(t) is, that square root is screwing everything up, because it turns our nice polynomials into things that are no longer polynomials.

    -

    There is a small class of polynomials where the square root is also a polynomial, but they're utterly useless to us: any polynomial with unweighted binomial coefficients has a square root that is also a polynomial. Now, you might think that Bézier curves are just fine because they do, but they don't; remember that only the base function has binomial coefficients. That's before we factor in our coordinates, which turn it into a non-binomial polygon. The only way to make sure the functions stay binomial is to make all our coordinates have the same value. And that's not a curve, that's a point. We can already create offset curves for points, we call them circles, and they have much simpler functions than Bézier curves.

    -

    So, since the tangent length isn't a polynomial, the normalised tangent won't be a polynomial either, which means N(t) won't be a polynomial, which means that d times N(t) won't be a polynomial, which means that, ultimately, O(t) won't be a polynomial, which means that even if we can determine the function for O(t) just fine (and that's far from trivial!), it simply cannot be represented as a Bézier curve.

    -

    And that's one reason why Bézier curves are tricky: there are actually a lot of curves that cannot be represented as a Bézier curve at all. They can't even model their own offset curves. They're weird that way. So how do all those other programs do it? Well, much like we're about to do, they cheat. We're going to approximate an offset curve in a way that will look relatively close to what the real offset curve would look like, if we could compute it.

    -
    + +

    + And that's where things go wrong. It doesn't even really matter what the second derivative for B(t) is, that square root is + screwing everything up, because it turns our nice polynomials into things that are no longer polynomials. +

    +

    + There is a small class of polynomials where the square root is also a polynomial, but they're utterly useless to us: any polynomial with + unweighted binomial coefficients has a square root that is also a polynomial. Now, you might think that Bézier curves are just fine + because they do, but they don't; remember that only the base function has binomial coefficients. That's before we + factor in our coordinates, which turn it into a non-binomial polygon. The only way to make sure the functions stay binomial is to make + all our coordinates have the same value. And that's not a curve, that's a point. We can already create offset curves for points, we call + them circles, and they have much simpler functions than Bézier curves. +

    +

    + So, since the tangent length isn't a polynomial, the normalised tangent won't be a polynomial either, which means + N(t) won't be a polynomial, which means that d times N(t) won't be a polynomial, which means + that, ultimately, O(t) won't be a polynomial, which means that even if we can determine the function for + O(t) just fine (and that's far from trivial!), it simply cannot be represented as a Bézier curve. +

    +

    + And that's one reason why Bézier curves are tricky: there are actually a lot of curves that cannot be represented as a Bézier + curve at all. They can't even model their own offset curves. They're weird that way. So how do all those other programs do it? Well, + much like we're about to do, they cheat. We're going to approximate an offset curve in a way that will look relatively close to what the + real offset curve would look like, if we could compute it. +

    + -

    So, you cannot offset a Bézier curve perfectly with another Bézier curve, no matter how high-order you make that other Bézier curve. However, we can chop up a curve into "safe" sub-curves (where "safe" means that all the control points are always on a single side of the baseline, and the midpoint of the curve at t=0.5 is roughly in the center of the polygon defined by the curve coordinates) and then point-scale each sub-curve with respect to its scaling origin (which is the intersection of the point normals at the start and end points).

    -

    A good way to do this reduction is to first find the curve's extreme points, as explained in the earlier section on curve extremities, and use these as initial splitting points. After this initial split, we can check each individual segment to see if it's "safe enough" based on where the center of the curve is. If the on-curve point for t=0.5 is too far off from the center, we simply split the segment down the middle. Generally this is more than enough to end up with safe segments.

    -

    The following graphics show off curve offsetting, and you can use the slider to control the distance at which the curve gets offset. The curve first gets reduced to safe segments, each of which is then offset at the desired distance. Especially for simple curves, particularly easily set up for quadratic curves, no reduction is necessary, but the more twisty the curve gets, the more the curve needs to be reduced in order to get segments that can safely be scaled.

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - -

    You may notice that this may still lead to small 'jumps' in the sub-curves when moving the curve around. This is caused by the fact that we're still performing a naive form of offsetting, moving the control points the same distance as the start and end points. If the curve is large enough, this may still lead to incorrect offsets.

    - -
    -
    -

    Graduated curve offsetting

    -

    What if we want to do graduated offsetting, starting at some distance s but ending at some other distance e? Well, if we can compute the length of a curve (which we can if we use the Legendre-Gauss quadrature approach) then we can also determine how far "along the line" any point on the curve is. With that knowledge, we can offset a curve so that its offset curve is not uniformly wide, but graduated between with two different offset widths at the start and end.

    -

    Like normal offsetting we cut up our curve in sub-curves, and then check at which distance along the original curve each sub-curve starts and ends, as well as to which point on the curve each of the control points map. This gives us the distance-along-the-curve for each interesting point in the sub-curve. If we call the total length of all sub-curves seen prior to seeing "the current" sub-curve S (and if the current sub-curve is the first one, S is zero), and we call the full length of our original curve L, then we get the following graduation values:

    -
      -
    • start: map S from interval (0,L) to interval (s,e)
    • -
    • c1: map(<strong>S+d1</strong>, 0,L, s,e), d1 = distance along curve to projection of c1
    • -
    • c2: map(<strong>S+d2</strong>, 0,L, s,e), d2 = distance along curve to projection of c2
    • -
    • ...
    • -
    • end: map(<strong>S+length(subcurve)</strong>, 0,L, s,e)
    • -
    -

    At each of the relevant points (start, end, and the projections of the control points onto the curve) we know the curve's normal, so offsetting is simply a matter of taking our original point, and moving it along the normal vector by the offset distance for each point. Doing so will give us the following result (these have with a starting width of 0, and an end width of 40 pixels, but can be controlled with your up and down arrow keys):

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - - -
    -
    -

    Circles and quadratic Bézier curves

    -

    Circles and Bézier curves are very different beasts, and circles are infinitely easier to work with than Bézier curves. Their formula is much simpler, and they can be drawn more efficiently. But, sometimes you don't have the luxury of using circles, or ellipses, or arcs. Sometimes, all you have are Bézier curves. For instance, if you're doing font design, fonts have no concept of geometric shapes, they only know straight lines, and Bézier curves. OpenType fonts with TrueType outlines only know quadratic Bézier curves, and OpenType fonts with Type 2 outlines only know cubic Bézier curves. So how do you draw a circle, or an ellipse, or an arc?

    -

    You approximate.

    -

    We already know that Bézier curves cannot model all curves that we can think of, and this includes perfect circles, as well as ellipses, and their arc counterparts. However, we can certainly approximate them to a degree that is visually acceptable. Quadratic and cubic curves offer us different curvature control, so in order to approximate a circle we will first need to figure out what the error is if we try to approximate arcs of increasing degree with quadratic and cubic curves, and where the coordinates even lie.

    -

    Since arcs are mid-point-symmetrical, we need the control points to set up a symmetrical curve. For quadratic curves this means that the control point will be somewhere on a line that intersects the baseline at a right angle. And we don't get any choice on where that will be, since the derivatives at the start and end point have to line up, so our control point will lie at the intersection of the tangents at the start and end point.

    -

    First, let's try to fit the quadratic curve onto a circular arc. In the following sketch you can move the mouse around over a unit circle, to see how well, or poorly, a quadratic curve can approximate the arc from (1,0) to where your mouse cursor is:

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - -

    As you can see, things go horribly wrong quite quickly; even trying to approximate a quarter circle using a quadratic curve is a bad idea. An eighth of a turns might look okay, but how okay is okay? Let's apply some maths and find out. What we're interested in is how far off our on-curve coordinates are with respect to a circular arc, given a specific start and end angle. We'll be looking at how much space there is between the circular arc, and the quadratic curve's midpoint.

    -

    We start out with our start and end point, and for convenience we will place them on a unit circle (a circle around 0,0 with radius 1), at some angle φ:

    - - -

    What we want to find is the intersection of the tangents, so we want a point C such that:

    - - -

    i.e. we want a point that lies on the vertical line through S (at some distance a from S) and also lies on the tangent line through E (at some distance b from E). Solving this gives us:

    - - -

    First we solve for b:

    - - -

    which yields:

    - - -

    which we can then substitute in the expression for a:

    - - -

    A quick check shows that plugging these values for a and b into the expressions for Cx and Cy give the same x/y coordinates for both "a away from A" and "b away from B", so let's continue: now that we know the coordinate values for C, we know where our on-curve point T for t=0.5 (or angle φ/2) is, because we can just evaluate the Bézier polynomial, and we know where the circle arc's actual point P is for angle φ/2:

    - - -

    We compute T, observing that if t=0.5, the polynomial values (1-t)², 2(1-t)t, and t² are 0.25, 0.5, and 0.25 respectively:

    - - -

    Which, worked out for the x and y components, gives:

    - - -

    And the distance between these two is the standard Euclidean distance:

    - - -

    So, what does this distance function look like when we plot it for a number of ranges for the angle φ, such as a half circle, quarter circle and eighth circle?

    -
    - - plotted for 0 ≤ φ ≤ π: - - - plotted for 0 ≤ φ ≤ ½π: - - - - - plotted for 0 ≤ φ ≤ ¼π: -
    + +

    + So, what does this distance function look like when we plot it for a number of ranges for the angle φ, such as a half circle, quarter + circle and eighth circle? +

    + + + + + + + + +
    + + plotted for 0 ≤ φ ≤ π: + + + plotted for 0 ≤ φ ≤ ½π: + + + + + plotted for 0 ≤ φ ≤ ¼π: +
    -

    We now see why the eighth circle arc looks decent, but the quarter circle arc doesn't: an error of roughly 0.06 at t=0.5 means we're 6% off the mark... we will already be off by one pixel on a circle with pixel radius 17. Any decent sized quarter circle arc, say with radius 100px, will be way off if approximated by a quadratic curve! For the eighth circle arc, however, the error is only roughly 0.003, or 0.3%, which explains why it looks so close to the actual eighth circle arc. In fact, if we want a truly tiny error, like 0.001, we'll have to contend with an angle of (rounded) 0.593667, which equates to roughly 34 degrees. We'd need 11 quadratic curves to form a full circle with that precision! (technically, 10 and ten seventeenth, but we can't do partial curves, so we have to round up). That's a whole lot of curves just to get a shape that can be drawn using a simple function!

    -

    In fact, let's flip the function around, so that if we plug in the precision error, labelled ε, we get back the maximum angle for that precision:

    - - -

    And frankly, things are starting to look a bit ridiculous at this point, we're doing way more maths than we've ever done, but thankfully this is as far as we need the maths to take us: If we plug in the precisions 0.1, 0.01, 0.001 and 0.0001 we get the radians values 1.748, 1.038, 0.594 and 0.3356; in degrees, that means we can cover roughly 100 degrees (requiring four curves), 59.5 degrees (requiring six curves), 34 degrees (requiring 11 curves), and 19.2 degrees (requiring a whopping nineteen curves).

    -

    The bottom line? Quadratic curves are kind of lousy if you want circular (or elliptical, which are circles that have been squashed in one dimension) curves. We can do better, even if it's just by raising the order of our curve once. So let's try the same thing for cubic curves.

    - -
    -
    -

    Circular arcs and cubic Béziers

    -

    Let's look at approximating circles and circular arcs using cubic Béziers. How much better is that?

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - -

    At cursory glance, a fair bit better, but let's find out how much better by looking at how to construct the Bézier curve.

    -

    A construction diagram for a cubic approximation of a circular arc

    -

    The start and end points are trivial, but the mid point requires a bit of work, but it's mostly basic trigonometry once we know the angle θ for our circular arc: if we scale our circular arc to a unit circle, we can always start our arc, with radius 1, at (1,0) and then given our arc angle θ, we also know that the circular arc has length θ (because unit circles are nice that way). We also know our end point, because that's just (cos(θ), sin(θ)), and so the challenge is to figure out what control points we need in order for the curve at t=0.5 to exactly touch the circular arc at the angle θ/2:

    -

    So let's again formally describe this:

    - - -

    Only P3 isn't quite straight-forward here, and its description is based on the fact that the triangle (origin, P4, P3) is a right angled triangle, with the distance between the origin and P4 being 1 (because we're working with a unit circle), and the distance between P4 and P3 being k, so that we can represent P3 as "The point P4 plus the vector from the origin to P4 but then rotated a quarter circle, counter-clockwise, and scaled by k".

    -

    With that, we can determine the y-coordinates for A, B, e1, and e2, after which we have all the information we need to determine what the value of k is. We can find these values by using (no surprise here) linear interpolation between known points, as A is midway between P2 and P3, e1 is between A and "midway between P1 and P2" (which is "half height" P2), and so forth:

    - - -

    Which now gives us two identities for B, because in addition to determining B through linear interpolation, we also know that B's y coordinate is just sin(θ/2): we started this exercise by saying we were going to approximate the circular arc using a Bézier curve that had its midpoint, which is point B, touching the unit circle at the arc's half-angle, by definition making B the point at (cos(θ/2), sin(θ/2)).

    -

    This means we can equate the two identities we now have for By and solve for k.

    -
    - -

    Deriving k

    -

    Solving for k is fairly straight forward, but it's a fair few steps, and if you just the immediate result: using a tool like Wolfram Alpha is definitely the way to go. That said, let's get going:

    - - -

    And finally, we can take further advantage of several trigonometric identities to drastically simplify our formula for k:

    - - -

    And we're done.

    -
    + +

    And we're done.

    + -

    So, the distance of our control points to the start/end points can be expressed as a number that we get from an almost trivial expression involving the circular arc's angle:

    - - -

    Which means that for any circular arc with angle θ and radius r, our Bézier approximation based on three points of incidence is:

    - - -

    Which also gives us the commonly found value of 0.55228 for quarter circles, based on them having an angle of half π:

    - - -

    And thus giving us the following Bézier coordinates for a quarter circle of radius r:

    - - -
    + +
    +

    So, how accurate is this?

    +

    + Unlike for the quadratic curve, we can't use t=0.5 as our reference point because by its very nature it's one of the three points + that are actually guaranteed to be on the circular arc itself. Instead, we need a different t value that will give us the maximum + deflection - there are two possible choices (as our curve is still strictly "overshoots" the circular arc, and it's symmetrical) but + rather than trying to use calculus to find the perfect t value—which we could! the maths is perfectly reasonable as long as we + get to use computers—we can also just perform a binary search for the biggest deflection and not bother with all this maths stuff. +

    +

    + So let's do that instead: we can run a maximum deflection check that just runs through t from 0 to 1 at some coarse interval, + finds a t value that has "the highest deflection of the bunch", then reruns the same check with a much smaller interval around + that t value, repeating as many times as necessary to get us an arbitrarily precise value of t: +

    -

    So, how accurate is this?

    -

    Unlike for the quadratic curve, we can't use t=0.5 as our reference point because by its very nature it's one of the three points that are actually guaranteed to be on the circular arc itself. Instead, we need a different t value that will give us the maximum deflection - there are two possible choices (as our curve is still strictly "overshoots" the circular arc, and it's symmetrical) but rather than trying to use calculus to find the perfect t value—which we could! the maths is perfectly reasonable as long as we get to use computers—we can also just perform a binary search for the biggest deflection and not bother with all this maths stuff.

    -

    So let's do that instead: we can run a maximum deflection check that just runs through t from 0 to 1 at some coarse interval, finds a t value that has "the highest deflection of the bunch", then reruns the same check with a much smaller interval around that t value, repeating as many times as necessary to get us an arbitrarily precise value of t:

    - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    - -

    Plus, how often do you get to write a function with that name?

    -

    Using this code, we find that our t values are approximately 0.211325 and 0.788675, so let's pick the lower of the two and see what the maximum deflection is across our domain of angles, with the original quadratic error show in green (rocketing off to infinity first, and then coming back down as we approach 2π)

    - - -
    - - - - - -
    - error plotted for 0 ≤ φ ≤ 2π - - error plotted for 0 ≤ φ ≤ π - - error plotted for 0 ≤ φ ≤ ½π -
    + return getMostWrongT(radius, bezier, worst_t - stepsize, worst_t + stepsize) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + -

    That last image is probably not quite clear enough: the cubic approximation of a quarter circle is so incredibly much better that we can't even really see it at the same scale of our quadratic curve. Let's scale the y-axis a little, and try that again:

    -

    +

    Plus, how often do you get to write a function with that name?

    +

    + Using this code, we find that our t values are approximately 0.211325 and 0.788675, so let's pick the lower of the two and see + what the maximum deflection is across our domain of angles, with the original quadratic error show in green (rocketing off to infinity + first, and then coming back down as we approach 2π) +

    + + + + + + + + + + + + + +
    + + + + + +
    + error plotted for 0 ≤ φ ≤ 2π + + error plotted for 0 ≤ φ ≤ π + + error plotted for 0 ≤ φ ≤ ½π +
    -

    Yeah... the error of a cubic approximation for a quarter circle turns out to be two orders of magnitude better. At approximately 0.00027 (or: just shy of being 2.7 pixels off for a circle with a radius of 10,000 pixels) the increase in precision over quadratic curves is quite spectacular - certainly good enough that no one in their right mind should ever use quadratic curves.

    -
    +

    + That last image is probably not quite clear enough: the cubic approximation of a quarter circle is so incredibly much better that we + can't even really see it at the same scale of our quadratic curve. Let's scale the y-axis a little, and try that again: +

    +

    -

    So that's it, kappa is 4/3 · tan(θ/4) , we're done! ...or are we?

    -

    Can we do better?

    -

    Technically: yes, we can. But I'm going to prefix this section with "we can, and we should investigate that possibility, but let me warn you up front that the result is only better if we're going to hard-code the values". We're about to get into the weeds and the standard three-points-of-incidence value is so good already that for most applications, trying to do better won't make any sense at all.

    -

    So with that said: what we calculated above is an upper bound for a best fit Bézier curve for a circular arc: anywhere we don't touch the circular arc in our approximation, we've "overshot" the arc. What if we dropped our value for k just a little, so that the curve starts out as an over-estimation, but then crosses the circular arc, yielding an region of underestimation, and then crosses the circular arc again, with another region of overestimation. This might give us a lower overall error, so let's see what we can do.

    -

    First, let's express the total error (given circular arc angle θ, and some k) using standard calculus notation:

    - - -

    This says that the error function for a given angle and value of k is equal to the "infinite" sum of differences between our curve and the circular arc, as we run t from 0 to 1, using an infinitely small step size. between subsequent t values.

    -

    Now, since we want to find the minimal error, that means we want to know where along this function things go from "error is getting progressively less" to "error is increasing again", which means we want to know where its derivative is zero, which as mathematical expression looks like:

    - - -

    And here we have the most direct application of the Fundamental Theorem of Calculus: the derivative and integral are each other's inverse operations, so they cancel out, leaving us with our original function:

    - - -

    And now we just solve for that... oh wait. We've seen this before. In order to solve this, we'd end up needing to solve this:

    - - -

    And both of those terms on the left of the equal sign are 6th degree polynomials, which means—as we've covered in the section on arc lengths—there is no symbolic solution for this equasion. Instead, we'll have to use a numerical approach to find the solutions here, so... to the computer!

    -
    + +

    + And both of those terms on the left of the equal sign are 6th degree polynomials, which means—as we've covered in the section + on arc lengths—there is no symbolic solution for this equasion. + Instead, we'll have to use a numerical approach to find the solutions here, so... to the computer! +

    +
    +

    Iterating on a solution

    +

    + By which I really mean "to the binary search algorithm", because we're dealing with a reasonably well behaved function: depending on the + value for k , we're either going to end up with a Bézier curve that's on average "not at distance r from the arc's + center", "exactly distance r from the arc's center", or "more than distance r from the arc's center", so we can just + binary search our way to the most accurate value for c that gets us that middle case. +

    +

    First our setup, where we determine our upper and lower bounds, before entering our binary search:

    -

    Iterating on a solution

    -

    By which I really mean "to the binary search algorithm", because we're dealing with a reasonably well behaved function: depending on the value for k , we're either going to end up with a Bézier curve that's on average "not at distance r from the arc's center", "exactly distance r from the arc's center", or "more than distance r from the arc's center", so we can just binary search our way to the most accurate value for c that gets us that middle case.

    -

    First our setup, where we determine our upper and lower bounds, before entering our binary search:

    - - - - -
    1 - -
    2
    3
    4
    - -

    And then the binary search algorithm, which can be found in pretty much any CS textbook, as well as more online articles, tutorials, and blog posts than you can ever read in a life time:

    + return binarySearch(radius, angle, points, lowerBound, upperBound) + + + + 2 + + + 3 + + + 4 + + - - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    - -

    Using the following radialError function, which samples the curve's approximation of the circular arc over several points (although the first and last point will never contribute anything, so we skip them):

    + return binarySearch(radius, angle, points, value, upperBound) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + 18 + + + 19 + + - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    - -

    In this, getOnCurvePoint is just the standard Bézier evaluation function, yielding a point. Treating that point as a vector, we can get its length to the origin using a magnitude call.

    -

    Examining the result

    -

    Running the above code we can get a list of k values associated with a list of angles θ from 0 to π, and we can use that to, for each angle, plot what the difference between the circular arc and the Bézier approximation looks like:

    -

    image-20210419085430711

    -

    Here we see the difference between an arc and its Bézier approximation plotted as we run t from 0 to 1. Just by looking at the plot we can tell that there is maximum deflection at t = 0.5, so let's plot the maximum deflection "function", for angles from 0 to θ:

    -

    In fact, let's plot the maximum deflections for both approaches as a functions over θ:

    - - -
    - - - - - -
    - max deflection using unit scale - - max deflection at 10x scale - - max deflection at 100x scale -
    + return err + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + -

    That doesn't actually appear to be all that much better, so let's look at some numbers, to see what the improvement actually is:

    - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
    angle"improved" deflection"upper bound" deflectiondifference
    1/8 π6.202833502388927E-86.657161222278773E-84.5432771988984655E-9
    1/4 π3.978021202111215E-64.246252911066506E-62.68231708955291E-7
    3/8 π4.547652269037972E-54.8397483513262785E-52.9209608228830675E-6
    1/2 π2.569196199214696E-42.7251652752280364E-41.559690760133403E-5
    5/8 π9.877526288810667E-40.00104441758597118025.666495709011343E-5
    3/4 π0.002981649786796270.00314556284145806051.6391305466179062E-4
    7/8 π0.00763231828070198850.0080477779099483734.1545962924638413E-4
    π0.0173621859640437080.0183490165195459029.86830555502194E-4
    -

    As we can see, the increase in precision is not particularly big: for a quarter circle (π/2) the traditional k will be off by 2.75 pixels on a circle with radius 10,000 pixels, whereas this "better" fit will be off by 2.56 pixels. And while that's certainly an almost 10% improvement, it's also nowhere near enough of an improvement to make a discernible difference.

    -
    +

    + In this, getOnCurvePoint is just the standard Bézier evaluation function, yielding a point. Treating that point as a + vector, we can get its length to the origin using a magnitude call. +

    +

    Examining the result

    +

    + Running the above code we can get a list of k values associated with a list of angles θ from 0 to π, and we can use that to, + for each angle, plot what the difference between the circular arc and the Bézier approximation looks like: +

    +

    image-20210419085430711

    +

    + Here we see the difference between an arc and its Bézier approximation plotted as we run t from 0 to 1. Just by looking at the + plot we can tell that there is maximum deflection at t = 0.5, so let's plot the maximum deflection "function", for angles from + 0 to θ: +

    +

    In fact, let's plot the maximum deflections for both approaches as a functions over θ:

    + + + + + + + + + + + + + +
    + + + + + +
    + max deflection using unit scale + + max deflection at 10x scale + + max deflection at 100x scale +
    -

    At this point it should be clear that while, yes, there are improvement to be had, they're essentially insignificant while also being much more computationally expensive.

    -

    TL;DR: just tell me which value I should be using

    -

    It depends on what we need to do. If we just want the best value for quarter circles, and we're going to hard code the value for k, then there is no reason to hard-code the constant k=4/3*tan(pi/8) when you can just as easily hard-code the constant as k=0.551784777779014 instead.

    -

    If you need "the" value for quarter circles, use 0.551785 instead of 0.55228

    -

    However, for dynamic arc approximation, in code that tries to fit circular paths using Bézier paths instead, it should be fairly obvious that the simple function involving a tangent computation, two divisions, and one multiplication, is vastly more performant than running all the code we ended writing just to get a 25% lower error value, and most certainly worth preferring over getting the "more accurate" value.

    -

    If you need to fit Béziers to circular arcs on the fly, use 4/3 * tan(θ/4)

    -

    However, always remember that if you're writing for humans, you can typically use the best of both worlds: as the user interacts with their curves, you should draw their curves instead of drawing approximations of them. If they need to draw circles or circular arcs, draw those, and only approximate them with a Bézier curve when the data needs to be exported to a format that doesn't support those. Ideally with a preview mechanism that highlights where the errors will be, and how large they will be.

    -

    If you're writing code for graphics design by humans, use circular arcs for circular arcs

    -

    And that's it. We have pretty well exhausted this subject. There are different metrics we could use to find "different best k values", like trying to match arc length (e.g. when we're optimizing for material cost), or minimizing the area between the circular arc and the Bézier curve (e.g. when we're optimizing for inking), or minimizing the rate of change of the Bézier's curvature (e.g. when we're optimizing for curve traversal) and they all yield values that are so similar that it's almost certainly not worth it. (For instance, for quarter circle approximations those values are 0.551777, 0.5533344, and 0.552184 respectively. Much like the 0.551785 we get from minimizing the maximum deflection, none of these values are significantly better enough to prefer them over the upper bound value).

    +

    That doesn't actually appear to be all that much better, so let's look at some numbers, to see what the improvement actually is:

    + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
    angle"improved" deflection"upper bound" deflectiondifference
    1/8 π6.202833502388927E-86.657161222278773E-84.5432771988984655E-9
    1/4 π3.978021202111215E-64.246252911066506E-62.68231708955291E-7
    3/8 π4.547652269037972E-54.8397483513262785E-52.9209608228830675E-6
    1/2 π2.569196199214696E-42.7251652752280364E-41.559690760133403E-5
    5/8 π9.877526288810667E-40.00104441758597118025.666495709011343E-5
    3/4 π0.002981649786796270.00314556284145806051.6391305466179062E-4
    7/8 π0.00763231828070198850.0080477779099483734.1545962924638413E-4
    π0.0173621859640437080.0183490165195459029.86830555502194E-4
    +

    + As we can see, the increase in precision is not particularly big: for a quarter circle (π/2) the traditional k will be off by + 2.75 pixels on a circle with radius 10,000 pixels, whereas this "better" fit will be off by 2.56 pixels. And while that's certainly an + almost 10% improvement, it's also nowhere near enough of an improvement to make a discernible difference. +

    +
    -
    -
    -

    Approximating Bézier curves with circular arcs

    -

    Let's look at doing the exact opposite of the previous section: rather than approximating circular arc using Bézier curves, let's approximate Bézier curves using circular arcs.

    -

    We already saw in the section on circle approximation that this will never yield a perfect equivalent, but sometimes you need circular arcs, such as when you're working with fabrication machinery, or simple vector languages that understand lines and circles, but not much else.

    -

    The approach is fairly simple: pick a starting point on the curve, and pick two points that are further along the curve. Determine the circle that goes through those three points, and see if it fits the part of the curve we're trying to approximate. Decent fit? Try spacing the points further apart. Bad fit? Try spacing the points closer together. Keep doing this until you've found the "good approximation/bad approximation" boundary, record the "good" arc, and then move the starting point up to overlap the end point we previously found. Rinse and repeat until we've covered the entire curve.

    -

    We already saw how to fit a circle through three points in the section on creating a curve from three points, and finding the arc through those points is straight-forward: pick one of the three points as start point, pick another as an end point, and the arc has to necessarily go from the start point, to the end point, over the remaining point.

    -

    So, how can we convert a Bézier curve into a (sequence of) circular arc(s)?

    -
      -
    • Start at t=0
    • -
    • Pick two points further down the curve at some value m = t + n and e = t + 2n
    • -
    • Find the arc that these points define
    • -
    • Determine how close the found arc is to the curve:
        -
      • Pick two additional points e1 = t + n/2 and e2 = t + n + n/2.
      • -
      • These points, if the arc is a good approximation of the curve interval chosen, should - lie on the circle, so their distance to the center of the circle should be the - same as the distance from any of the three other points to the center.
      • -
      • For point points, determine the (absolute) error between the radius of the circle, and the -actual distance from the center of the circle to the point on the curve.
      • -
      • If this error is too high, we consider the arc bad, and try a smaller interval.
      • -
      -
    • -
    -

    The result of this is shown in the next graphic: we start at a guaranteed failure: s=0, e=1. That's the entire curve. The midpoint is simply at t=0.5, and then we start performing a binary search.

    -
      -
    1. We start with low=0, mid=0.5 and high=1
    2. -
    3. That'll fail, so we retry with the interval halved: {0, 0.25, 0.5}
        -
      • If that arc's good, we move back up by half distance: {0, 0.375, 0.75}.
      • -
      • However, if the arc was still bad, we move down by half the distance: {0, 0.125, 0.25}.
      • -
      -
    4. -
    5. We keep doing this over and over until we have two arcs, in sequence, of which the first arc is good, and the second arc is bad. When we find that pair, we've found the boundary between a good approximation and a bad approximation, and we pick the good arc.
    6. -
    -

    The following graphic shows the result of this approach, with a default error threshold of 0.5, meaning that if an arc is off by a combined half pixel over both verification points, then we treat the arc as bad. This is an extremely simple error policy, but already works really well. Note that the graphic is still interactive, and you can use your up and down arrow keys keys to increase or decrease the error threshold, to see what the effect of a smaller or larger error threshold is.

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - -

    With that in place, all that's left now is to "restart" the procedure by treating the found arc's end point as the new to-be-determined arc's starting point, and using points further down the curve. We keep trying this until the found end point is for t=1, at which point we are done. Again, the following graphic allows for up and down arrow key input to increase or decrease the error threshold, so you can see how picking a different threshold changes the number of arcs that are necessary to reasonably approximate a curve:

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - -

    So... what is this good for? Obviously, if you're working with technologies that can't do curves, but can do lines and circles, then the answer is pretty straightforward, but what else? There are some reasons why you might need this technique: using circular arcs means you can determine whether a coordinate lies "on" your curve really easily (simply compute the distance to each circular arc center, and if any of those are close to the arc radii, at an angle between the arc start and end, bingo, this point can be treated as lying "on the curve"). Another benefit is that this approximation is "linear": you can almost trivially travel along the arcs at fixed speed. You can also trivially compute the arc length of the approximated curve (it's a bit like curve flattening). The only thing to bear in mind is that this is a lossy equivalence: things that you compute based on the approximation are guaranteed "off" by some small value, and depending on how much precision you need, arc approximation is either going to be super useful, or completely useless. It's up to you to decide which, based on your application!

    - -
    -
    -

    B-Splines

    -

    No discussion on Bézier curves is complete without also giving mention of that other beast in the curve design space: B-Splines. Easily confused to mean Bézier splines, that's not actually what they are; they are "basis function" splines, which makes a lot of difference, and we'll be looking at those differences in this section. We're not going to dive as deep into B-Splines as we have for Bézier curves (that would be an entire primer on its own) but we'll be looking at how B-Splines work, what kind of maths is involved in computing them, and how to draw them based on a number of parameters that you can pick for individual B-Splines.

    -

    First off: B-Splines are piecewise, polynomial interpolation curves, where the "single curve" is built by performing polynomial interpolation over a set of points, using a sliding window of a fixed number of points. For instance, a "cubic" B-Spline defined by twelve points will have its curve built by evaluating the polynomial interpolation of four points, and the curve can be treated as a lot of different sections, each controlled by four points at a time, such that the full curve consists of smoothly connected sections defined by points {1,2,3,4}, {2,3,4,5}, ..., {8,9,10,11}, and finally {9,10,11,12}, for eight sections.

    -

    What do they look like? They look like this! Tap on the graphic to add more points, and move points around to see how they map to the spline curve drawn.

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - -

    The important part to notice here is that we are not doing the same thing with B-Splines that we do for poly-Béziers or Catmull-Rom curves: both of the latter simply define new sections as literally "new sections based on new points", so a 12 point cubic poly-Bézier curve is actually impossible, because we start with a four point curve, and then add three more points for each section that follows, so we can only have 4, 7, 10, 13, 16, etc. point Poly-Béziers. Similarly, while Catmull-Rom curves can grow by adding single points, this addition of a single point introduces three implicit Bézier points. Cubic B-Splines, on the other hand, are smooth interpolations of each possible curve involving four consecutive points, such that at any point along the curve except for our start and end points, our on-curve coordinate is defined by four control points.

    -

    Consider the difference to be this:

    -
      -
    • for Bézier curves, the curve is defined as an interpolation of points, but:
    • -
    • for B-Splines, the curve is defined as an interpolation of curves.
    • -
    -

    In fact, let's look at that again, but this time with the base curves shown, too. Each consecutive four points define one curve:

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - -

    In order to make this interpolation of curves work, the maths is necessarily more complex than the maths for Bézier curves, so let's have a look at how things work.

    -

    How to compute a B-Spline curve: some maths

    -

    Given a B-Spline of degree d and thus order k=d+1 (so a quadratic B-Spline is degree 2 and order 3, a cubic B-Spline is degree 3 and order 4, etc) and n control points P0 through Pn-1, we can compute a point on the curve for some value t in the interval [0,1] (where 0 is the start of the curve, and 1 the end, just like for Bézier curves), by evaluating the following function:

    - + +

    + In order to make this interpolation of curves work, the maths is necessarily more complex than the maths for Bézier curves, so let's have + a look at how things work. +

    +

    How to compute a B-Spline curve: some maths

    +

    + Given a B-Spline of degree d and thus order k=d+1 (so a quadratic B-Spline is degree 2 and order 3, a cubic + B-Spline is degree 3 and order 4, etc) and n control points P0 through Pn-1, we can compute a point on the curve for some value t in the interval [0,1] (where 0 is the start of the curve, and 1 the + end, just like for Bézier curves), by evaluating the following function: +

    + - -

    Which, honestly, doesn't tell us all that much. All we can see is that a point on a B-Spline curve is defined as "a mix of all the control points, weighted somehow", where the weighting is achieved through the N(...) function, subscripted with an obvious parameter i, which comes from our summation, and some magical parameter k. So we need to know two things: 1. what does N(t) do, and 2. what is that k? Let's cover both, in reverse order.

    -

    The parameter k represents the "knot interval" over which a section of curve is defined. As we learned earlier, a B-Spline curve is itself an interpolation of curves, and we can treat each transition where a control point starts or stops influencing the total curvature as a "knot on the curve". -Doing so for a degree d B-Spline with n control point gives us d + n + 1 knots, defining d + n intervals along the curve, and it is these intervals that the above k subscript to the N() function applies to.

    -

    Then the N() function itself. What does it look like?

    - - -

    So this is where we see the interpolation: N(t) for an (i,k) pair (that is, for a step in the above summation, on a specific knot interval) is a mix between N(t) for (i,k-1) and N(t) for (i+1,k-1), so we see that this is a recursive iteration where i goes up, and k goes down, so it seem reasonable to expect that this recursion has to stop at some point; obviously, it does, and specifically it does so for the following i/k values:

    - - -

    And this function finally has a straight up evaluation: if a t value lies within a knot-specific interval once we reach a k=1 value, it "counts", otherwise it doesn't. We did cheat a little, though, because for all these values we need to scale our t value first, so that it lies in the interval bounded by knots[d] and knots[n], which are the start point and end point where curvature is controlled by exactly order control points. For instance, for degree 3 (=order 4) and 7 control points, with knot vector [1,2,3,4,5,6,7,8,9,10,11], we map t from [the interval 0,1] to the interval [4,8], and then use that value in the functions above, instead.

    -

    Can we simplify that?

    -

    We can, yes.

    -

    People far smarter than us have looked at this work, and two in particular — Maurice Cox and Carl de Boor — came to a mathematically pleasing solution: to compute a point P(t), we can compute this point by evaluating d(t) on a curve section between knots i and i+1:

    - - -

    This is another recursive function, with k values decreasing from the curve order to 1, and the value α (alpha) defined by:

    - - -

    That looks complicated, but it's not. Computing alpha is just a fraction involving known, plain numbers. And, once we have our alpha value, we also have (1-alpha) because it's a trivial subtraction. Computing the d() function is thus mostly a matter of computing pretty simple arithmetical statements, with some caching of results so we can refer to them as we recurve. While the recursion might see computationally expensive, the total algorithm is cheap, as each step only involves very simple maths.

    -

    Of course, the recursion does need a stop condition:

    - - -

    So, we actually see two stopping conditions: either i becomes 0, in which case d() is zero, or k becomes zero, in which case we get the same "either 1 or 0" that we saw in the N() function above.

    -

    Thanks to Cox and de Boor, we can compute points on a B-Spline pretty easily using the same kind of linear interpolation we saw in de Casteljau's algorithm. For instance, if we write out d() for i=3 and k=3, we get the following recursion diagram:

    - - -

    That is, we compute d(3,3) as a mixture of d(2,3) and d(2,2), where those two are themselves a mixture of d(1,3) and d(1,2), and d(1,2) and d(1,1), respectively, which are themselves a mixture of etc. etc. We simply keep expanding our terms until we reach the stop conditions, and then sum everything back up. It's really quite elegant.

    -

    One thing we need to keep in mind is that we're working with a spline that is constrained by its control points, so even though the d(..., k) values are zero or one at the lowest level, they are really "zero or one, times their respective control point", so in the next section you'll see the algorithm for running through the computation in a way that starts with a copy of the control point vector and then works its way up to that single point, rather than first starting "on the left", working our way "to the right" and then summing back up "to the left". We can just start on the right and work our way left immediately.

    -

    Running the computation

    -

    Unlike the de Casteljau algorithm, where the t value stays the same at every iteration, for B-Splines that is not the case, and so we end having to (for each point we evaluate) run a fairly involving bit of recursive computation. The algorithm is discussed on this Michigan Tech page, but an easier to read version is implemented by b-spline.js, so we'll look at its code.

    -

    Given an input value t, we first map the input to a value from the domain [0,1] to the domain [knots[degree], knots[knots.length - 1 - degree]. Then, we find the section number s that this mapped t value lies on:

    + +

    + That is, we compute d(3,3) as a mixture of d(2,3) and d(2,2), where those two are themselves a + mixture of d(1,3) and d(1,2), and d(1,2) and d(1,1), respectively, which are + themselves a mixture of etc. etc. We simply keep expanding our terms until we reach the stop conditions, and then sum everything back up. + It's really quite elegant. +

    +

    + One thing we need to keep in mind is that we're working with a spline that is constrained by its control points, so even though the + d(..., k) values are zero or one at the lowest level, they are really "zero or one, times their respective control point", so + in the next section you'll see the algorithm for running through the computation in a way that starts with a copy of the control point + vector and then works its way up to that single point, rather than first starting "on the left", working our way "to the right" and then + summing back up "to the left". We can just start on the right and work our way left immediately. +

    +

    Running the computation

    +

    + Unlike the de Casteljau algorithm, where the t value stays the same at every iteration, for B-Splines that is not the case, + and so we end having to (for each point we evaluate) run a fairly involving bit of recursive computation. The algorithm is discussed on + this Michigan Tech page, but an easier to read version + is implemented by b-spline.js, so we'll look at its code. +

    +

    + Given an input value t, we first map the input to a value from the domain [0,1] to the domain + [knots[degree], knots[knots.length - 1 - degree]. Then, we find the section number s that this mapped + t value lies on: +

    - - -
    1 - -
    2
    3
    - -

    after running this code, s is the index for the section the point will lie on. We then run the algorithm mentioned on the MU page (updated to use this description's variable names):

    +} + + + + 2 + + + 3 + + - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    - -

    (A nice bit of behaviour in this code is that we work the interpolation "backwards", starting at i=s at each level of the interpolation, and we stop when i = s - order + level, so we always end up with a value for i such that those v[i-1] don't try to use an array index that doesn't exist)

    -

    Open vs. closed paths

    -

    Much like poly-Béziers, B-Splines can be either open, running from the first point to the last point, or closed, where the first and last point are the same coordinate. However, because B-Splines are an interpolation of curves, not just points, we can't simply make the first and last point the same, we need to link as many points as are necessary to form "a curve" that the spline performs interpolation with. As such, for an order d B-Spline, we need to make the first and last d points the same. This is of course hardly more work than before (simply append points.splice(0,d) to points) but it's important to remember that you need more than just a single point.

    -

    Of course if we want to manipulate these kind of curves we need to make sure to mark them as "closed" so that we know the coordinate for points[0] and points[n-k] etc. don't just happen to have the same x/y values, but really are the same coordinate, so that manipulating one will equally manipulate the other, but programming generally makes this really easy by storing references to points, rather than copies (or other linked values such as coordinate weights, discussed in the NURBS section) rather than separate coordinate objects.

    -

    Manipulating the curve through the knot vector

    -

    The most important thing to understand when it comes to B-Splines is that they work because of the concept of a knot vector. As mentioned above, knots represent "where individual control points start/stop influencing the curve", but we never looked at the values that go in the knot vector. If you look back at the N() and a() functions, you see that interpolations are based on intervals in the knot vector, rather than the actual values in the knot vector, and we can exploit this to do some pretty interesting things with clever manipulation of the knot vector. Specifically there are four things we can do that are worth looking at:

    -
      -
    1. we can use a uniform knot vector, with equally spaced intervals,
    2. -
    3. we can use a non-uniform knot vector, without enforcing equally spaced intervals,
    4. -
    5. we can collapse sequential knots to the same value, locally lowering curve complexity using "null" intervals, and
    6. -
    7. we can form a special case non-uniform vector, by combining (1) and (3) to for a vector with collapsed start and end knots, with a uniform vector in between.
    8. -
    -

    Uniform B-Splines

    -

    The most straightforward type of B-Spline is the uniform spline. In a uniform spline, the knots are distributed uniformly over the entire curve interval. For instance, if we have a knot vector of length twelve, then a uniform knot vector would be [0,1,2,3,...,9,10,11]. Or [4,5,6,...,13,14,15], which defines the same intervals, or even [0,2,3,...,18,20,22], which also defines the same intervals, just scaled by a constant factor, which becomes normalised during interpolation and so does not contribute to the curvature.

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - -

    This is an important point: the intervals that the knot vector defines are relative intervals, so it doesn't matter if every interval is size 1, or size 100 - the relative differences between the intervals is what shapes any particular curve.

    -

    The problem with uniform knot vectors is that, as we need order control points before we have any curve with which we can perform interpolation, the curve does not "start" at the first point, nor "ends" at the last point. Instead there are "gaps". We can get rid of these, by being clever about how we apply the following uniformity-breaking approach instead...

    -

    Reducing local curve complexity by collapsing intervals

    -

    Collapsing knot intervals, by making two or more consecutive knots have the same value, allows us to reduce the curve complexity in the sections that are affected by the knots involved. This can have drastic effects: for every interval collapse, the curve order goes down, and curve continuity goes down, to the point where collapsing order knots creates a situation where all continuity is lost and the curve "kinks".

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - -

    Open-Uniform B-Splines

    -

    By combining knot interval collapsing at the start and end of the curve, with uniform knots in between, we can overcome the problem of the curve not starting and ending where we'd kind of like it to:

    -

    For any curve of degree D with control points N, we can define a knot vector of length N+D+1 in which the values 0 ... D+1 are the same, the values D+1 ... N+1 follow the "uniform" pattern, and the values N+1 ... N+D+1 are the same again. For example, a cubic B-Spline with 7 control points can have a knot vector [0,0,0,0,1,2,3,4,4,4,4], or it might have the "identical" knot vector [0,0,0,0,2,4,6,8,8,8,8], etc. Again, it is the relative differences that determine the curve shape.

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - -

    Non-uniform B-Splines

    -

    This is essentially the "free form" version of a B-Spline, and also the least interesting to look at, as without any specific reason to pick specific knot intervals, there is nothing particularly interesting going on. There is one constraint to the knot vector, other than that any value knots[k+1] should be greater than or equal to knots[k].

    -

    One last thing: Rational B-Splines

    -

    While it is true that this section on B-Splines is running quite long already, there is one more thing we need to talk about, and that's "Rational" splines, where the rationality applies to the "ratio", or relative weights, of the control points themselves. By introducing a ratio vector with weights to apply to each control point, we greatly increase our influence over the final curve shape: the more weight a control point carries, the closer to that point the spline curve will lie, a bit like turning up the gravity of a control point, just like for rational Bézier curves.

    - - - 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. - - - - - -

    Of course this brings us to the final topic that any text on B-Splines must touch on before calling it a day: the NURBS, or Non-Uniform Rational B-Spline (NURBS is not a plural, the capital S actually just stands for "spline", but a lot of people mistakenly treat it as if it is, so now you know better). NURBS is an important type of curve in computer-facilitated design, used a lot in 3D modelling (typically as NURBS surfaces) as well as in arbitrary-precision 2D design due to the level of control a NURBS curve offers designers.

    -

    While a true non-uniform rational B-Spline would be hard to work with, when we talk about NURBS we typically mean the Open-Uniform Rational B-Spline, or OURBS, but that doesn't roll off the tongue nearly as nicely, and so remember that when people talk about NURBS, they typically mean open-uniform, which has the useful property of starting the curve at the first control point, and ending it at the last.

    -

    Extending our implementation to cover rational splines

    -

    The algorithm for working with Rational B-Splines is virtually identical to the regular algorithm, and the extension to work in the control point weights is fairly simple: we extend each control point from a point in its original number of dimensions (2D, 3D, etc.) to one dimension higher, scaling the original dimensions by the control point's weight, and then assigning that weight as its value for the extended dimension.

    -

    For example, a 2D point (x,y) with weight w becomes a 3D point (w * x, w * y, w).

    -

    We then run the same algorithm as before, which will automatically perform weight interpolation in addition to regular coordinate interpolation, because all we've done is pretended we have coordinates in a higher dimension. The algorithm doesn't really care about how many dimensions it needs to interpolate.

    -

    In order to recover our "real" curve point, we take the final result of the point generation algorithm, and "unweigh" it: we take the final point's derived weight w' and divide all the regular coordinate dimensions by it, then throw away the weight information.

    -

    Based on our previous example, we take the final 3D point (x', y', w'), which we then turn back into a 2D point by computing (x'/w', y'/w'). And that's it, we're done!

    +} + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + -
    -
    - +

    + (A nice bit of behaviour in this code is that we work the interpolation "backwards", starting at i=s at each level of the + interpolation, and we stop when i = s - order + level, so we always end up with a value for i such that those + v[i-1] don't try to use an array index that doesn't exist) +

    +

    Open vs. closed paths

    +

    + Much like poly-Béziers, B-Splines can be either open, running from the first point to the last point, or closed, where the first and last + point are the same coordinate. However, because B-Splines are an interpolation of curves, not just points, we can't simply make the first + and last point the same, we need to link as many points as are necessary to form "a curve" that the spline performs interpolation with. As + such, for an order d B-Spline, we need to make the first and last d points the same. This is of course hardly + more work than before (simply append points.splice(0,d) to points) but it's important to remember that you need + more than just a single point. +

    +

    + Of course if we want to manipulate these kind of curves we need to make sure to mark them as "closed" so that we know the coordinate for + points[0] and points[n-k] etc. don't just happen to have the same x/y values, but really are the same + coordinate, so that manipulating one will equally manipulate the other, but programming generally makes this really easy by storing + references to points, rather than copies (or other linked values such as coordinate weights, discussed in the NURBS section) rather than + separate coordinate objects. +

    +

    Manipulating the curve through the knot vector

    +

    + The most important thing to understand when it comes to B-Splines is that they work because of the concept of a knot vector. As + mentioned above, knots represent "where individual control points start/stop influencing the curve", but we never looked at the + values that go in the knot vector. If you look back at the N() and a() functions, you see that interpolations are based on + intervals in the knot vector, rather than the actual values in the knot vector, and we can exploit this to do some pretty interesting + things with clever manipulation of the knot vector. Specifically there are four things we can do that are worth looking at: +

    +
      +
    1. we can use a uniform knot vector, with equally spaced intervals,
    2. +
    3. we can use a non-uniform knot vector, without enforcing equally spaced intervals,
    4. +
    5. we can collapse sequential knots to the same value, locally lowering curve complexity using "null" intervals, and
    6. +
    7. + we can form a special case non-uniform vector, by combining (1) and (3) to for a vector with collapsed start and end knots, with a + uniform vector in between. +
    8. +
    +

    Uniform B-Splines

    +

    + The most straightforward type of B-Spline is the uniform spline. In a uniform spline, the knots are distributed uniformly over the entire + curve interval. For instance, if we have a knot vector of length twelve, then a uniform knot vector would be [0,1,2,3,...,9,10,11]. Or + [4,5,6,...,13,14,15], which defines the same intervals, or even [0,2,3,...,18,20,22], which also defines + the same intervals, just scaled by a constant factor, which becomes normalised during interpolation and so does not contribute to + the curvature. +

    + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + + + +

    + This is an important point: the intervals that the knot vector defines are relative intervals, so it doesn't matter if every + interval is size 1, or size 100 - the relative differences between the intervals is what shapes any particular curve. +

    +

    + The problem with uniform knot vectors is that, as we need order control points before we have any curve with which we can + perform interpolation, the curve does not "start" at the first point, nor "ends" at the last point. Instead there are "gaps". We can get + rid of these, by being clever about how we apply the following uniformity-breaking approach instead... +

    +

    Reducing local curve complexity by collapsing intervals

    +

    + Collapsing knot intervals, by making two or more consecutive knots have the same value, allows us to reduce the curve complexity in the + sections that are affected by the knots involved. This can have drastic effects: for every interval collapse, the curve order goes down, + and curve continuity goes down, to the point where collapsing order knots creates a situation where all continuity is lost + and the curve "kinks". +

    + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + + + +

    Open-Uniform B-Splines

    +

    + By combining knot interval collapsing at the start and end of the curve, with uniform knots in between, we can overcome the problem of the + curve not starting and ending where we'd kind of like it to: +

    +

    + For any curve of degree D with control points N, we can define a knot vector of length N+D+1 in + which the values 0 ... D+1 are the same, the values D+1 ... N+1 follow the "uniform" pattern, and the values + N+1 ... N+D+1 are the same again. For example, a cubic B-Spline with 7 control points can have a knot vector + [0,0,0,0,1,2,3,4,4,4,4], or it might have the "identical" knot vector [0,0,0,0,2,4,6,8,8,8,8], etc. Again, it is the relative differences + that determine the curve shape. +

    + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + + + +

    Non-uniform B-Splines

    +

    + This is essentially the "free form" version of a B-Spline, and also the least interesting to look at, as without any specific reason to + pick specific knot intervals, there is nothing particularly interesting going on. There is one constraint to the knot vector, other than + that any value knots[k+1] should be greater than or equal to knots[k]. +

    +

    One last thing: Rational B-Splines

    +

    + While it is true that this section on B-Splines is running quite long already, there is one more thing we need to talk about, and that's + "Rational" splines, where the rationality applies to the "ratio", or relative weights, of the control points themselves. By introducing a + ratio vector with weights to apply to each control point, we greatly increase our influence over the final curve shape: the more weight a + control point carries, the closer to that point the spline curve will lie, a bit like turning up the gravity of a control point, just like + for rational Bézier curves. +

    + + + 스크립트가 꺼져 있어 대체 이미지를 대신 표시합니다. + + + + + +

    + Of course this brings us to the final topic that any text on B-Splines must touch on before calling it a day: the + NURBS, or Non-Uniform Rational B-Spline (NURBS is not a plural, + the capital S actually just stands for "spline", but a lot of people mistakenly treat it as if it is, so now you know better). NURBS is an + important type of curve in computer-facilitated design, used a lot in 3D modelling (typically as NURBS surfaces) as well as in + arbitrary-precision 2D design due to the level of control a NURBS curve offers designers. +

    +

    + While a true non-uniform rational B-Spline would be hard to work with, when we talk about NURBS we typically mean the Open-Uniform + Rational B-Spline, or OURBS, but that doesn't roll off the tongue nearly as nicely, and so remember that when people talk about NURBS, + they typically mean open-uniform, which has the useful property of starting the curve at the first control point, and ending it at the + last. +

    +

    Extending our implementation to cover rational splines

    +

    + The algorithm for working with Rational B-Splines is virtually identical to the regular algorithm, and the extension to work in the + control point weights is fairly simple: we extend each control point from a point in its original number of dimensions (2D, 3D, etc.) to + one dimension higher, scaling the original dimensions by the control point's weight, and then assigning that weight as its value for the + extended dimension. +

    +

    For example, a 2D point (x,y) with weight w becomes a 3D point (w * x, w * y, w).

    +

    + We then run the same algorithm as before, which will automatically perform weight interpolation in addition to regular coordinate + interpolation, because all we've done is pretended we have coordinates in a higher dimension. The algorithm doesn't really care about how + many dimensions it needs to interpolate. +

    +

    + In order to recover our "real" curve point, we take the final result of the point generation algorithm, and "unweigh" it: we take the + final point's derived weight w' and divide all the regular coordinate dimensions by it, then throw away the weight + information. +

    +

    + Based on our previous example, we take the final 3D point (x', y', w'), which we then turn back into a 2D point by computing + (x'/w', y'/w'). And that's it, we're done! +

    +
    +
    + -

    Comments and questions

    -

    First off, if you enjoyed this book, or you simply found it useful for something you were trying to get done, and you were wondering how to let me know you appreciated this book, you have two options: you can either head on over to the Patreon page for this book, or if you prefer to make a one-time donation, head on over to the buy Pomax a coffee page. This work has grown from a small primer to a 70-plus print-page-equivalent reader on the subject of Bézier curves over the years, and a lot of coffee went into the making of it. I don't regret a minute I spent on writing it, but I can always do with some more coffee to keep on writing.

    -

    With that said, on to the comments!

    -
    +

    + + Comments and questions +

    +

    + First off, if you enjoyed this book, or you simply found it useful for something you were trying to get done, and you were wondering how + to let me know you appreciated this book, you have two options: you can either head on over to the + Patreon page for this book, or if you prefer to make a one-time donation, head on over to + the buy Pomax a coffee page. This + work has grown from a small primer to a 70-plus print-page-equivalent reader on the subject of Bézier curves over the years, and a lot of + coffee went into the making of it. I don't regret a minute I spent on writing it, but I can always do with some more coffee to keep on + writing. +

    +

    With that said, on to the comments!

    +
    +
    +
    +
    +
    -
    - -
    + -
    - - - - - - - - - \ No newline at end of file + + + diff --git a/docs/news/2020-09-18.html b/docs/news/2020-09-18.html index 2aa1ba11..9c20fa10 100644 --- a/docs/news/2020-09-18.html +++ b/docs/news/2020-09-18.html @@ -1,153 +1,181 @@ + + + + A Primer on Bézier Curves - Rewriting the tech stack - - - - A Primer on Bézier Curves - Rewriting the tech stack + - + - - - - + - - - + - :root[lang="en-GB"] { - font-family: "Helvetica Neue", Helvetica, Arial, sans-serif; - font-size: 18px; - } + + + + + + + + + + + + + + + - + + + + + + + - + + - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - + + - - - - + + + - - + + -
    - This page on GitHub - - This page on GitHub - -
    +
    + This page on GitHub + + This page on GitHub + +
    - +
    +

    Rewriting the tech stack

    +
    Fri, 18 Sep 2020
    +
    -
    - -

    Rewriting the tech stack

    -
    Fri, 18 Sep 2020
    +
    + +
    + + + submit to reddit + submit to hacker news + tweet your read + +
    -
    +

    + Once upon a time, I needed to draw some Bezier curves because I was trying to create a Japanese kanji composition system that turned strokes + into outlines, and that required knowing how to offset Bezier curves and... at the time (2011, time flies) there was no good single source of + information for Bezier curves on the web. So I made one. Sure it started small, but it turns out that if you just keep adding bits to + something, several years later you have quite the monster, and a single HTML file becomes intractible. +

    +

    + So, in 2016, when React.js exploded onto the scene, I rewrote the primer as a React app, and it became a + lot easier to maintain. Like, a lot a lot. However, there was a downside: no JS meant no content. Sure, server-side rendering sort of + existed, but not really, and because the Primer is hosted through github, there was no "server" to run. Plus, trying to rehydrate an app the + size of the Primer from a giant HTML file had truly dire performance. +

    +

    + So I left it a regular React app, and every time I thought "wouldn't it be nice if it was just... a web page again?" the browser landscape + just hadn't caught up. Finally, in 2020, things are different: with a global pandemic, and some vacation time, and something random causing me + to look up the state of HTML custom elements, everything was pointing at it being time to finally, finally, turn the Primer back into + a normal web page. +

    +

    + The new tech stack is, frankly, pretty amazing. It does some things that weren't possible even half a year before I started the rewrite, let + alone being possible in 2016, and so because so much has changed, this post will be the first in a series of posts on how the new tech stack + works. +

    +

    To give a bit of a teaser, some of the things I'll be writing about:

    + +
    + + + Scripts are disabled. Showing fallback image. + + + +
    -
    - - -
    - - - submit to reddit - submit to hacker news - tweet your read - -
    - - -

    Once upon a time, I needed to draw some Bezier curves because I was trying to create a Japanese kanji composition system that turned strokes into outlines, and that required knowing how to offset Bezier curves and... at the time (2011, time flies) there was no good single source of information for Bezier curves on the web. So I made one. Sure it started small, but it turns out that if you just keep adding bits to something, several years later you have quite the monster, and a single HTML file becomes intractible.

    -

    So, in 2016, when React.js exploded onto the scene, I rewrote the primer as a React app, and it became a lot easier to maintain. Like, a lot a lot. However, there was a downside: no JS meant no content. Sure, server-side rendering sort of existed, but not really, and because the Primer is hosted through github, there was no "server" to run. Plus, trying to rehydrate an app the size of the Primer from a giant HTML file had truly dire performance.

    -

    So I left it a regular React app, and every time I thought "wouldn't it be nice if it was just... a web page again?" the browser landscape just hadn't caught up. Finally, in 2020, things are different: with a global pandemic, and some vacation time, and something random causing me to look up the state of HTML custom elements, everything was pointing at it being time to finally, finally, turn the Primer back into a normal web page.

    -

    The new tech stack is, frankly, pretty amazing. It does some things that weren't possible even half a year before I started the rewrite, let alone being possible in 2016, and so because so much has changed, this post will be the first in a series of posts on how the new tech stack works.

    -

    To give a bit of a teaser, some of the things I'll be writing about:

    - -
    - - - Scripts are disabled. Showing fallback image. - - - -
    - - -
    - - -
    + + - + - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    - - -

    It's going to take me a while to detail the entire tech stack, but ultimately what matters is that you get a Primer that is a normal "vanilla" HTML, CSS, and JS page again, that "just works" even with JS disabled.

    -

    Enjoy The new Primer on Bézier Curves, and if you find any problems, you know where to go.

    -

    See you in the next post!

    -

    Pomax

    +} + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + +

    + It's going to take me a while to detail the entire tech stack, but ultimately what matters is that you get a Primer that is a normal "vanilla" + HTML, CSS, and JS page again, that "just works" even with JS disabled. +

    +

    + Enjoy The new Primer on Bézier Curves, and if you find any problems, + you know where to go. +

    +

    See you in the next post!

    +

    Pomax

    +
    - +
    -
    - - - - - - - \ No newline at end of file + + + diff --git a/docs/news/2020-11-22.html b/docs/news/2020-11-22.html index 70c914b4..7412e35d 100644 --- a/docs/news/2020-11-22.html +++ b/docs/news/2020-11-22.html @@ -1,142 +1,143 @@ + + + + A Primer on Bézier Curves - Curve-circle intersections - - - - A Primer on Bézier Curves - Curve-circle intersections + - + - - - - + - - - + - :root[lang="en-GB"] { - font-family: "Helvetica Neue", Helvetica, Arial, sans-serif; - font-size: 18px; - } + + + + + + + + + + + + + + + - + + + + + + + - + + - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - + + - - - - + + + - - + + -
    - This page on GitHub - - This page on GitHub - -
    +
    + This page on GitHub + + This page on GitHub + +
    - +
    +

    Curve-circle intersections

    +
    Sun, 22 Nov 2020
    +
    -
    - -

    Curve-circle intersections

    -
    Sun, 22 Nov 2020
    +
    + +
    + + + submit to reddit + submit to hacker news + tweet your read + +
    -
    +

    + While the primer covered line/line, line/curve, and curve/curve intersections, there was one other obvious intersection conspicuously missing: + circle/curve intersections. You'd think those were just an extension on the maths used for the other three, but unfortunately, this is not the + case. Rather than using calculus, the only real way to determine where a polynomial curve intersects it is to sample the curve at a resolution + high enough to find you intervals on the curve where there likely is an intersection, then refining that interval until you find actual + intersections. +

    +

    + It is, in fact, rather similar to projecting a point onto a bezier curve where + the point is the circle's center, and where the projection distance actually needs to match the circle radius, so: + let's see how to do that! +

    +

    Pomax

    + -
    - - -
    - - - submit to reddit - submit to hacker news - tweet your read - -
    +
    - -

    While the primer covered line/line, line/curve, and curve/curve intersections, there was one other obvious intersection conspicuously missing: circle/curve intersections. You'd think those were just an extension on the maths used for the other three, but unfortunately, this is not the case. Rather than using calculus, the only real way to determine where a polynomial curve intersects it is to sample the curve at a resolution high enough to find you intervals on the curve where there likely is an intersection, then refining that interval until you find actual intersections.

    -

    It is, in fact, rather similar to projecting a point onto a bezier curve where the point is the circle's center, and where the projection distance actually needs to match the circle radius, so: let's see how to do that!

    -

    Pomax

    - - -
    - -
    - - - - - - - \ No newline at end of file + + + diff --git a/docs/news/index.html b/docs/news/index.html index bf941888..e45a8573 100644 --- a/docs/news/index.html +++ b/docs/news/index.html @@ -1,138 +1,113 @@ + + + + A Primer on Bézier Curves - News - - - - A Primer on Bézier Curves - News + - + - - - - + - - - + - :root[lang="en-GB"] { - font-family: "Helvetica Neue", Helvetica, Arial, sans-serif; - font-size: 18px; - } + + + + + + + + + + + + + + + - + + + + + + + - + + - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - + + - - - - + + + - - + + -
    - This page on GitHub - - This page on GitHub - -
    +
    + This page on GitHub + + This page on GitHub + +
    - +
    +

    News posts

    +
    -
    - -

    News posts

    +
    +

    + Every now and then the Primer gets updated - these posts chronicle the evolution of the site, and hopefully offer interesting information not + just about the process of maintaining a resource like this, but also neat tech tricks, implementation approaches, maths that didn't make it + into the primer itself, etc. +

    -
    +

    + This section is still very new, so for the moment there aren't all that many posts up yet, but there's a series of posts planned already, and + if you're the kind of person who likes to keep tabs on updates by using RSS: good news, have an RSS link!. +

    -
    - -

    - Every now and then the Primer gets updated - these posts chronicle the evolution of the site, - and hopefully offer interesting information not just about the process of maintaining a resource - like this, but also neat tech tricks, implementation approaches, maths that didn't make it into - the primer itself, etc. -

    + +
    -

    - This section is still very new, so for the moment there aren't all that many posts up yet, but - there's a series of posts planned already, and if you're the kind of person who likes to keep - tabs on updates by using RSS: good news, have an RSS link!. -

    +
    - - - - -
    - - - - - - - \ No newline at end of file + + + diff --git a/docs/news/rss.xml b/docs/news/rss.xml index 90821d01..569b61a4 100644 --- a/docs/news/rss.xml +++ b/docs/news/rss.xml @@ -6,7 +6,7 @@ News updates for the primer on Bézier Curves by Pomax en-GB - Sat Sep 03 2022 09:57:02 +00:00 + Sat Sep 03 2022 17:00:46 +00:00 https://pomax.github.io/bezierinfo/images/og-image.png A Primer on Bézier Curves @@ -23,7 +23,7 @@ <p>— <a href="https://twitter.com/TheRealPomax">Pomax</a></p> - Sat Nov 21 2020 16:00:00 +00:00 + Sun Nov 22 2020 00:00:00 +00:00 2020-11-22.html Rewriting the tech stack @@ -118,7 +118,7 @@ draw() { <p>— <a href="https://twitter.com/TheRealPomax">Pomax</a></p> - Thu Sep 17 2020 17:00:00 +00:00 + Fri Sep 18 2020 00:00:00 +00:00 2020-09-18.html diff --git a/docs/ru-RU/index.html b/docs/ru-RU/index.html index 5a31da9e..d00737c6 100644 --- a/docs/ru-RU/index.html +++ b/docs/ru-RU/index.html @@ -1,161 +1,169 @@ + + + + Основы кривых Безье - - - - Основы кривых Безье + - + - - - - + - - - + - :root { - font-family: "Helvetica Neue", Helvetica, Arial, sans-serif; - font-size: 18px; - } + + + + + + + + + + + + + + + - + + + + + + + - + + - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - + + - - - - + + + - - + + -
    - This page on GitHub - - This page on GitHub - -
    +
    + This page on GitHub + + This page on GitHub + +
    -
    - - - submit to reddit - submit to hacker news - tweet your read - -
    +
    + + + submit to reddit + submit to hacker news + tweet your read + +
    - + +
    +

    + Основы кривых Безье +

    +

    Бесплатная онлайн-книга для тех, кому действительно важно знать, как работают кривые Безье

    +
    + Читайте на своём языке: + +

    + (Не нашли свой язык или хотите, чтобы он достиг до 100%? + Помогите нам с переводом!) +

    +
    -
    - -

    Основы кривых Безье

    -

    Бесплатная онлайн-книга для тех, кому действительно важно знать, как работают кривые Безье

    -
    - Читайте на своём языке: - -

    (Не нашли свой язык или хотите, чтобы он достиг до 100%? Помогите нам с переводом!)

    -
    - -

    - Приветствуем на Основах кривых Безье. Это бесплатная веб-страница/электронная книга, посвященная как математическим, - так и программным аспектам кривых Безье, охватывает широкий спектр тем касающихся рисования и работы с кривой, - которая, кажется, появляется повсюду: от кривых в Photoshop до функций плавности (easing functions) CSS и начертания шрифтов. -

    -

    - Если вы здесь впервые: Добро пожаловать! Обязательно напишите мне, - если вы ищете что-то конкретное, но не нашли ответ в тексте. -

    - -

    Пожертвования и спонсорство

    +

    + Приветствуем на Основах кривых Безье. Это бесплатная веб-страница/электронная книга, посвященная как математическим, так и программным + аспектам кривых Безье, охватывает широкий спектр тем касающихся рисования и работы с кривой, которая, кажется, появляется повсюду: от кривых в + Photoshop до функций плавности (easing functions) CSS и начертания шрифтов. +

    +

    + Если вы здесь впервые: Добро пожаловать! Обязательно напишите мне, если вы ищете + что-то конкретное, но не нашли ответ в тексте. +

    +

    Пожертвования и спонсорство

    -

    - Если вы используете этот источник для исследований, в качестве справочника по работе, или даже для написания - собственного программного обеспечения рассмотрите возможность - задонатить - (любая сумма годится) или станьте патроном на Patreon. Мне не - платят за эту работу, так что, если вы нашли что-то полезное для себя и хотите, чтобы он оставался с нами надолго, - то вот вам мысль: на создание этих страниц ушло много кофе, и уйдет ещё больше на их совершенствование; - если вы можете помочь с кофе, то поможете долгой и счастливой жизни этого ресурса. -

    -

    - Кроме того, если вы являетесь компанией и ваши сотрудники используют эту книгу в качестве источника, или вы используете - её в качестве вспомогательного ресурса, пожалуйста: подумайте о финансировании сайта! Я более чем счастлив, работая - с вашим финансовым отделом над выставлением счетов и признанием спонсорства. -

    - +

    + Если вы используете этот источник для исследований, в качестве справочника по работе, или даже для написания собственного программного + обеспечения рассмотрите возможность + задонатить + (любая сумма годится) или станьте патроном на Patreon. Мне не платят за эту работу, так что, + если вы нашли что-то полезное для себя и хотите, чтобы он оставался с нами надолго, то вот вам мысль: на создание этих страниц ушло много + кофе, и уйдет ещё больше на их совершенствование; если вы можете помочь с кофе, то поможете долгой и счастливой жизни этого ресурса. +

    +

    + Кроме того, если вы являетесь компанией и ваши сотрудники используют эту книгу в качестве источника, или вы используете её в качестве + вспомогательного ресурса, пожалуйста: подумайте о финансировании сайта! Я более чем счастлив, работая с вашим финансовым отделом над + выставлением счетов и признанием спонсорства. +

    - -
    +
    -

    - — Pomax -

    - + +
    - -
    +
    +
    +

    Предисловие

    +

    + Чтобы рисовать объекты в 2D, мы обычно полагаемся на линии, которые обычно подразделяются на две категории: прямые линии и кривые. Первые из + них так же легко нарисовать, как их легко нарисовать на компьютере. Дайте компьютеру первую и последнюю точку в строке, и БАЦ! Прямая линия. + Никаких сложностей не должно возникнуть. +

    +

    + Кривые, однако, представляют собой гораздо большую проблему. В то время как мы можем рисовать кривые с невероятной легкостью от руки, + компьютеры немного ограничены в том, что они не могут рисовать кривые, если нет математической функции, описывающей, как они должны быть + нарисованы. На самом деле, они также нуждаются в этом и для прямых линий, но функция до смешного проста, поэтому мы склонны игнорировать это + в том, что касается компьютеров; все линии являются "функциями", независимо от того, прямые они или кривые. Однако это означает, что нам + нужно придумать быстрые в вычислении функции, которые приводят к красивым кривым на компьютере. Их существует несколько, и в этой статье мы + сосредоточимся на конкретной функции, которая получила довольно много внимания и используется практически во всем, что может рисовать + кривые: кривые Безье. +

    +

    + Они названы в честь Пьера Безье, + который сделал так, чтобы они стали известны миру как кривая, хорошо подходящая для проектных работ (опубликовал свои исследования в 1962 + году, работая в Renault), хотя он не был первым или единственным, кто "изобрел" эти типы кривых. Может возникнуть соблазн сказать, что + математик + Поль де Кастельжо + был первым, поскольку он начал исследовать природу этих кривых в 1959 году, работая в Citroën, и открыл действительно элегантный способ, как + их нарисовать. Однако, Поль де Кастельжо не опубликовал свою работу, поэтому на вопрос, «кто был первым», трудно ответить в каком-либо + абсолютном смысле. Либо это? Кривые Безье – это, по сути, «полиномы Бернштейна», семейство математических функций, исследованных + Сергеем Натановичем Бернштейном, чьи публикации о них датируются, по крайней мере, до 1912 года. +

    +

    + В любом случае это в основном мелочи, вас, скорее всего, будет волновать то, что эти кривые удобны: вы можете связать несколько кривых + Безье, чтобы комбинация выглядела как одна кривая. Если вы когда-либо рисовали "Путь" (Path – прим. пер.) в Photoshop или работали с + программами векторного рисования, такими как Flash, Illustrator или Inkscape, эти кривые, которые вы рисовали, являются кривыми Безье. +

    +

    + Но что, если вам нужно будет запрограммировать их самостоятельно? В чем заключаются подводные камни? Как вы их рисуете? Что такое + ограничительные рамки, как определяются пересечения, как можно вытянуть кривую, короче говоря: какие возможности они предоставляют и что + можно сделать с этими кривыми? Вот для чего предназначена эта страница. Приготовьтесь к матчу! +

    +
    +

    Практически вся графика Безье интерактивная.

    +

    + На этой странице используются интерактивные примеры, в значительной степени опирающиеся на + Bezier.js, а также математические формулы, которые набираются в SVG с помощью + XeLaTeX система набора текста и pdf2svg + Дэвид Бартон. +

    +

    Эта книга с открытым исходным кодом.

    +

    + Эта книга представляет собой проект с открытым исходным кодом и хранится в двух репозиториях github. Во-первых, это + https://github.com/pomax/bezierinfo и является версией исключительно для презентаций, + которую вы просматриваете прямо сейчас. Другим хранилищем является + https://github.com/pomax/BezierInfo-2 – это версия для разработки, в которой + содержится весь код, и он превращается в веб-версию. Также в нём вы можете сообщать о проблемах, если обнаружите ошибки или у вас есть + идеи, что улучшить или добавить в учебник. +

    +

    Насколько сложной будет математика?

    +

    + Большая часть математики в этом учебнике – это математика для начальной школы. Если вы разбираетесь в элементарной арифметике и умеете + читать, у вас все должно получиться просто отлично. Иногда будут гораздо более сложные математические задачи, но если вам не + хочется их переваривать, вы можете спокойно пропустить их, либо пропустив "подробные поля" в разделе, либо просто перейдя к концу раздела + с математикой, которая выглядит слишком увлекательной. В конце разделов обычно просто перечисляются выводы, чтобы вы могли просто работать + непосредственно с этими знаниями. +

    +

    На каком языке написаны все примеры кода?

    +

    + Существует слишком много языков программирования, чтобы отдать предпочтение одному из них, поэтому во всех примерах в этом учебнике + используется форма псевдокода, которая использует синтаксис, достаточно близкий к современным скриптовым языкам, таким как JS, Python и т. + д., но на самом деле это не они. Поэтому вы не сможете копировать-вставить что-либо, не задумываясь, это сделано намеренно: если читаете + этот учебник, предположительно, вы хотите учиться, но вы не учитесь путем копирования-вставки. Вы учитесь, делая что-то + самостоятельно, совершая ошибки, а затем исправляя эти ошибки. Конечно, я намеренно не добавлял ошибки в пример кода только для того, + чтобы заставить вас ошибаться (это было бы ужасно!), Но я намеренно не позволял примерам кода отдавать предпочтение одному языку + программирования перед другим. Не волнуйтесь, если вы знаете хотя бы один процедурный язык программирования, вы сможете прочитать примеры + без каких-либо трудностей. +

    +

    Вопросы, комментарии:

    +

    + Если у вас есть предложения по новым разделам, перейдите по ссылке + на Github (также доступно из репозитория, ссылка в правом верхнем углу). Если у + вас есть вопросы по материалу, в настоящее время нет раздела комментариев, пока я его переписываю, но вы также можете использовать github + для этого. Как только переписывание будет закончено, я снова добавлю раздел общих комментариев и, возможно, более актуальную систему + "выберите этот раздел текста и нажмите кнопку "вопрос", чтобы задать вопрос об этом". Посмотрим. +

    +

    Помогите поддержать книгу!

    +

    + Если вам понравилась эта книга или просто сочли её полезной, сообщите об этом, чтобы оценить эту книгу, есть два варианта: можете либо + перейти на страницу Patreon или, если вы предпочитаете сделать единовременное + пожертвование, зайдите на страницу + купить кофе Pomax. За эти годы + эта работа из небольшого букваря превратилась в пособие, эквивалентное 100 с лишним печатным страницам, посвященное кривым Безье, и в его + создание было вложено много кофе. Я не жалею ни минуты, потраченной на его написание, но мне всегда не помешает еще немного кофе, чтобы + продолжать писать! +

    +
    +
    +
    +

    Что нового?

    +

    + Этот документ живое пособие, в зависимости от даты вашего последнего посещения, может появиться новый материал. Кликайте + здесь для просмотра лога изменений. (также доступен RSS-канал) +

    + + + +
    +

    November 2020

    +
      +
    • Added a section on finding curve/circle intersections

    • +
    +

    October 2020

    +
      +
    • Added the Ukranian locale! Help out in getting its localization to 100%!

    • +
    +

    August-September 2020

    +
      +
    • +

      + Completely overhauled the site: the Primer is now a normal web page that works fine with JS disabled, but obviously better with JS + turned on. +

      +
    • +
    +

    June 2020

    +
      +
    • Added automatic CI/CD using Github Actions

    • +
    +

    January 2020

    +
      +
    • Added reset buttons to all graphics

    • +
    • Updated to preface to correctly describe the on-page maths

    • +
    • Fixed the Catmull-Rom section because it had glaring maths errors

    • +
    +

    August 2019

    +
      +
    • Added a section on (plain) rational Bezier curves

    • +
    • Improved the Graphic component to allow for sliders

    • +
    +

    December 2018

    +
      +
    • Added a section on curvature and calculating kappa.

    • +
    • +

      + Added a Patreon page! Head on over to patreon.com/bezierinfo to help support this + site! +

      +
    • +
    +

    August 2018

    +
      +
    • Added a section on finding a curve's y, if all you have is the x coordinate.

    • +
    +

    July 2018

    +
      +
    • Rewrote the 3D normals section, implementing and explaining Rotation Minimising Frames.

    • +
    • Updated the section on curve order raising/lowering, showing how to get a least-squares optimized lower order curve.

    • +
    • +

      (Finally) updated 'npm test' so that it automatically rebuilds when files are changed while the dev server is running.

      +
    • +
    +

    June 2018

    +
      +
    • Added a section on direct curve fitting.

    • +
    • Added source links for all graphics.

    • +
    • Added this "What's new?" section.

    • +
    +

    April 2017

    +
      +
    • Added a section on 3d normals.

    • +
    • Added live-updating for the social link buttons, so they always link to the specific section you're reading.

    • +
    +

    February 2017

    +
      +
    • Finished rewriting the entire codebase for localization.

    • +
    +

    January 2016

    +
      +
    • Added a section to explain the Bezier interval.

    • +
    • Rewrote the Primer as a React application.

    • +
    +

    December 2015

    +
      +
    • Set up the split repository between BezierInfo-2 as development repository, and bezierinfo as live page.

    • +
    • +

      + Removed the need for client-side LaTeX parsing entirely, so the site doesn't take a full minute or more to load all the graphics. +

      +
    • +
    +

    May 2015

    +
      +
    • Switched over to pure JS rather than Processing-through-Processing.js

    • +
    • Added Cardano's algorithm for finding the roots of a cubic polynomial.

    • +
    +

    April 2015

    +
      +
    • Added a section on arc length approximations.

    • +
    +

    February 2015

    +
      +
    • Added a section on the canonical cubic Bezier form.

    • +
    +

    November 2014

    +
      +
    • Switched to HTTPS.

    • +
    +

    July 2014

    +
      +
    • Added the section on arc approximation.

    • +
    +

    April 2014

    +
      +
    • Added the section on Catmull-Rom fitting.

    • +
    +

    November 2013

    +
      +
    • Added the section on Catmull-Rom / Bezier conversion.

    • +
    • Added the section on Bezier cuves as matrices.

    • +
    +

    April 2013

    +
      +
    • Added a section on poly-Beziers.

    • +
    • Added a section on boolean shape operations.

    • +
    +

    March 2013

    +
      +
    • First drastic rewrite.

    • +
    • Added sections on circle approximations.

    • +
    • Added a section on projecting a point onto a curve.

    • +
    • Added a section on tangents and normals.

    • +
    • Added Legendre-Gauss numerical data tables.

    • +
    +

    October 2011

    +
      +
    • +

      + First commit for the bezierinfo site, based on the pre-Primer webpage that covered + the basics of Bezier curves in HTML with Processing.js examples. +

      +
    • +
    +
    +
    +
    +
    +

    + + Быстрое введение +

    +

    + Начнем с приятного: говоря о кривых Безье, мы говорим о штуках доступных нашему лицезрению на картинках ниже. Отрезки протекают от + начальной до конечной точек, искривление обусловленно одной или более вспомогательными контрольными точками. Теперь, поскольку вся графика + на данной странице интереактивна, вы можете манипулировать кривыми посредством перемещения вышеупомянутых точек и наблюдать производимый + эффект соотносительно вашим действиям. +

    +
    + + + Скрипты отключены. Показываем резервное изображение. + + + + + + Скрипты отключены. Показываем резервное изображение. + + + +
    -
    - -

    Предисловие

    -

    Чтобы рисовать объекты в 2D, мы обычно полагаемся на линии, которые обычно подразделяются на две категории: прямые линии и кривые. Первые из них так же легко нарисовать, как их легко нарисовать на компьютере. Дайте компьютеру первую и последнюю точку в строке, и БАЦ! Прямая линия. Никаких сложностей не должно возникнуть.

    -

    Кривые, однако, представляют собой гораздо большую проблему. В то время как мы можем рисовать кривые с невероятной легкостью от руки, компьютеры немного ограничены в том, что они не могут рисовать кривые, если нет математической функции, описывающей, как они должны быть нарисованы. На самом деле, они также нуждаются в этом и для прямых линий, но функция до смешного проста, поэтому мы склонны игнорировать это в том, что касается компьютеров; все линии являются "функциями", независимо от того, прямые они или кривые. Однако это означает, что нам нужно придумать быстрые в вычислении функции, которые приводят к красивым кривым на компьютере. Их существует несколько, и в этой статье мы сосредоточимся на конкретной функции, которая получила довольно много внимания и используется практически во всем, что может рисовать кривые: кривые Безье.

    -

    Они названы в честь Пьера Безье, который сделал так, чтобы они стали известны миру как кривая, хорошо подходящая для проектных работ (опубликовал свои исследования в 1962 году, работая в Renault), хотя он не был первым или единственным, кто "изобрел" эти типы кривых. Может возникнуть соблазн сказать, что математик Поль де Кастельжо был первым, поскольку он начал исследовать природу этих кривых в 1959 году, работая в Citroën, и открыл действительно элегантный способ, как их нарисовать. Однако, Поль де Кастельжо не опубликовал свою работу, поэтому на вопрос, «кто был первым», трудно ответить в каком-либо абсолютном смысле. Либо это? Кривые Безье – это, по сути, «полиномы Бернштейна», семейство математических функций, исследованных Сергеем Натановичем Бернштейном, чьи публикации о них датируются, по крайней мере, до 1912 года.

    -

    В любом случае это в основном мелочи, вас, скорее всего, будет волновать то, что эти кривые удобны: вы можете связать несколько кривых Безье, чтобы комбинация выглядела как одна кривая. Если вы когда-либо рисовали "Путь" (Path – прим. пер.) в Photoshop или работали с программами векторного рисования, такими как Flash, Illustrator или Inkscape, эти кривые, которые вы рисовали, являются кривыми Безье.

    -

    Но что, если вам нужно будет запрограммировать их самостоятельно? В чем заключаются подводные камни? Как вы их рисуете? Что такое ограничительные рамки, как определяются пересечения, как можно вытянуть кривую, короче говоря: какие возможности они предоставляют и что можно сделать с этими кривыми? Вот для чего предназначена эта страница. Приготовьтесь к матчу!

    -
    - -

    Практически вся графика Безье интерактивная.

    -

    На этой странице используются интерактивные примеры, в значительной степени опирающиеся на Bezier.js, а также математические формулы, которые набираются в SVG с помощью XeLaTeX система набора текста и pdf2svg Дэвид Бартон.

    -

    Эта книга с открытым исходным кодом.

    -

    Эта книга представляет собой проект с открытым исходным кодом и хранится в двух репозиториях github. Во-первых, это https://github.com/pomax/bezierinfo и является версией исключительно для презентаций, которую вы просматриваете прямо сейчас. Другим хранилищем является https://github.com/pomax/BezierInfo-2 – это версия для разработки, в которой содержится весь код, и он превращается в веб-версию. Также в нём вы можете сообщать о проблемах, если обнаружите ошибки или у вас есть идеи, что улучшить или добавить в учебник.

    -

    Насколько сложной будет математика?

    -

    Большая часть математики в этом учебнике – это математика для начальной школы. Если вы разбираетесь в элементарной арифметике и умеете читать, у вас все должно получиться просто отлично. Иногда будут гораздо более сложные математические задачи, но если вам не хочется их переваривать, вы можете спокойно пропустить их, либо пропустив "подробные поля" в разделе, либо просто перейдя к концу раздела с математикой, которая выглядит слишком увлекательной. В конце разделов обычно просто перечисляются выводы, чтобы вы могли просто работать непосредственно с этими знаниями.

    -

    На каком языке написаны все примеры кода?

    -

    Существует слишком много языков программирования, чтобы отдать предпочтение одному из них, поэтому во всех примерах в этом учебнике используется форма псевдокода, которая использует синтаксис, достаточно близкий к современным скриптовым языкам, таким как JS, Python и т. д., но на самом деле это не они. Поэтому вы не сможете копировать-вставить что-либо, не задумываясь, это сделано намеренно: если читаете этот учебник, предположительно, вы хотите учиться, но вы не учитесь путем копирования-вставки. Вы учитесь, делая что-то самостоятельно, совершая ошибки, а затем исправляя эти ошибки. Конечно, я намеренно не добавлял ошибки в пример кода только для того, чтобы заставить вас ошибаться (это было бы ужасно!), Но я намеренно не позволял примерам кода отдавать предпочтение одному языку программирования перед другим. Не волнуйтесь, если вы знаете хотя бы один процедурный язык программирования, вы сможете прочитать примеры без каких-либо трудностей.

    -

    Вопросы, комментарии:

    -

    Если у вас есть предложения по новым разделам, перейдите по ссылке на Github (также доступно из репозитория, ссылка в правом верхнем углу). Если у вас есть вопросы по материалу, в настоящее время нет раздела комментариев, пока я его переписываю, но вы также можете использовать github для этого. Как только переписывание будет закончено, я снова добавлю раздел общих комментариев и, возможно, более актуальную систему "выберите этот раздел текста и нажмите кнопку "вопрос", чтобы задать вопрос об этом". Посмотрим.

    -

    Помогите поддержать книгу!

    -

    Если вам понравилась эта книга или просто сочли её полезной, сообщите об этом, чтобы оценить эту книгу, есть два варианта: можете либо перейти на страницу Patreon или, если вы предпочитаете сделать единовременное пожертвование, зайдите на страницу купить кофе Pomax. За эти годы эта работа из небольшого букваря превратилась в пособие, эквивалентное 100 с лишним печатным страницам, посвященное кривым Безье, и в его создание было вложено много кофе. Я не жалею ни минуты, потраченной на его написание, но мне всегда не помешает еще немного кофе, чтобы продолжать писать!

    -
    -
    -
    -

    Что нового?

    -

    Этот документ живое пособие, в зависимости от даты вашего последнего посещения, может появиться новый материал. Кликайте здесь для просмотра лога изменений. (также доступен RSS-канал)

    - - - -

    November 2020

    • Added a section on finding curve/circle intersections

      -
    -

    October 2020

    • Added the Ukranian locale! Help out in getting its localization to 100%!

      -
    -

    August-September 2020

    • Completely overhauled the site: the Primer is now a normal web page that works fine with JS disabled, but obviously better with JS turned on.

      -
    -

    June 2020

    • Added automatic CI/CD using Github Actions

      -
    -

    January 2020

    • Added reset buttons to all graphics

      -
    • -
    • Updated to preface to correctly describe the on-page maths

      -
    • -
    • Fixed the Catmull-Rom section because it had glaring maths errors

      -
    -

    August 2019

    • Added a section on (plain) rational Bezier curves

      -
    • -
    • Improved the Graphic component to allow for sliders

      -
    -

    December 2018

    • Added a section on curvature and calculating kappa.

      -
    • -
    • Added a Patreon page! Head on over to patreon.com/bezierinfo to help support this site!

      -
    -

    August 2018

    • Added a section on finding a curve's y, if all you have is the x coordinate.

      -
    -

    July 2018

    • Rewrote the 3D normals section, implementing and explaining Rotation Minimising Frames.

      -
    • -
    • Updated the section on curve order raising/lowering, showing how to get a least-squares optimized lower order curve.

      -
    • -
    • (Finally) updated 'npm test' so that it automatically rebuilds when files are changed while the dev server is running.

      -
    -

    June 2018

    • Added a section on direct curve fitting.

      -
    • -
    • Added source links for all graphics.

      -
    • -
    • Added this "What's new?" section.

      -
    -

    April 2017

    • Added a section on 3d normals.

      -
    • -
    • Added live-updating for the social link buttons, so they always link to the specific section you're reading.

      -
    -

    February 2017

    • Finished rewriting the entire codebase for localization.

      -
    -

    January 2016

    • Added a section to explain the Bezier interval.

      -
    • -
    • Rewrote the Primer as a React application.

      -
    -

    December 2015

    • Set up the split repository between BezierInfo-2 as development repository, and bezierinfo as live page.

      -
    • -
    • Removed the need for client-side LaTeX parsing entirely, so the site doesn't take a full minute or more to load all the graphics.

      -
    -

    May 2015

    • Switched over to pure JS rather than Processing-through-Processing.js

      -
    • -
    • Added Cardano's algorithm for finding the roots of a cubic polynomial.

      -
    -

    April 2015

    • Added a section on arc length approximations.

      -
    -

    February 2015

    • Added a section on the canonical cubic Bezier form.

      -
    -

    November 2014

    • Switched to HTTPS.

      -
    -

    July 2014

    • Added the section on arc approximation.

      -
    -

    April 2014

    • Added the section on Catmull-Rom fitting.

      -
    -

    November 2013

    • Added the section on Catmull-Rom / Bezier conversion.

      -
    • -
    • Added the section on Bezier cuves as matrices.

      -
    -

    April 2013

    • Added a section on poly-Beziers.

      -
    • -
    • Added a section on boolean shape operations.

      -
    -

    March 2013

    • First drastic rewrite.

      -
    • -
    • Added sections on circle approximations.

      -
    • -
    • Added a section on projecting a point onto a curve.

      -
    • -
    • Added a section on tangents and normals.

      -
    • -
    • Added Legendre-Gauss numerical data tables.

      -
    -

    October 2011

    • First commit for the bezierinfo site, based on the pre-Primer webpage that covered the basics of Bezier curves in HTML with Processing.js examples.

      -
    -
    -
    -

    Быстрое введение

    -

    Начнем с приятного: говоря о кривых Безье, мы говорим о штуках доступных нашему лицезрению на картинках ниже. Отрезки протекают от начальной до конечной точек, искривление обусловленно одной или более вспомогательными контрольными точками. Теперь, поскольку вся графика на данной странице интереактивна, вы можете манипулировать кривыми посредством перемещения вышеупомянутых точек и наблюдать производимый эффект соотносительно вашим действиям.

    -
    - - - Скрипты отключены. Показываем резервное изображение. - - - - - - Скрипты отключены. Показываем резервное изображение. - - - -
    - -

    Данный тип кривых активно используется в компьютерном дизайне и производственных (CAD/CAM) применениях, в графических программах, таких как Adobe Illustrator and Photoshop, Inkscape, GIMP ... в графических технологиях, таких как маштабируемая векторная графика (SVG) и шрифты OpenType (TTF/OTF). Области использования кривых Безье довольно широки, и если вы желаете узнать больше о составляющих их аспектах: пристегните ремни!

    - -
    -
    -

    Итак, из чего сделаны кривые Безье?

    -

    Играя с позициями точек, вы, возможно, получили общее представление о том, как кривые Безье себя ведут. Но чем же является кривая Безье? Ее можно объяснять двумя способами. Оба эквиваленты, однако один из них затрагивает объемные математические описания, тогда как второй пользуется довольно простой математикой. Итак... давайте начнем с простейшего:

    -

    Кривая Безье есть результатом линейной интерполяции (*в оригинале текста другая ссылка, тут и далее пр. переводчика). Звучит заморочено, но в действительности вы занимались линейной интерполяций с юных лет: каждый раз когда вам приходилось указывать на что либо лежащее между двумя другими "чем либо" — вы применяли линейную интерполяцию. Т.е. попросту "выбор точки лежащей на линии между двумя другими точками".

    -

    Вот, скажем, зная расстояние между двух точек и желая поставить третью на удалении 20ти % этого расстояния до первой и, соответственно, 80ти % до второй, вычислить результат можно следующим образом:

    - - -

    Что же, посмотрим на это в действии: ниже представлена интерактивная проекция, кликнув на ползунок, можно пользоваться клавишами вниз-вверх для увеличения и уменьшения соотношения интерполяции и наблюдать получаемый результат. Сначала, основываясь на трех точках, задаем два отрезка, затем производим линейную интерполяцию по длине каждого из них, получая еще две точки. Далее мы опять производим линейную интерполяцию уже между полученными точками и в итоге получаем искомую точку. (* на изображении видим 3 проекции соответственно этим трем действиям).

    - - - Скрипты отключены. Показываем резервное изображение. - - - - - -

    И это приводит нас к более замороченному математическому анализу.

    -

    Хотя и не очевидно, но мы только что прорисовали квадратную кривую Безье, просто пошагово вместо цельной линии. Один увлекательный аспект кривых Безье является тем, что кривая может одновременно быть описана как полиномиальная функция и так же как набор линейных интерполяций. Это предоставляет возможность рассматривать эти типы кривых с двух планов: математического (анализа функции, ее производных, и все такое) а так же с механической композиции (которая, к примеру, позволяет легко заметить, что точка никогда не выйдет за рамки заданных точек, используемых для ее конструкции).

    -

    С этим, давайте рассмотрим кривые Безье более подробно: в частности их математические выражения, свойства, доступные к вычислению на их основе, и вариации действий применимых к и производимых из кривых Безье.

    - -
    -
    -

    Математика под кривыми Безье

    -

    Кривые Безье по своей форме представляют собой параметрические функции. С точки зрения математики, эти функции хитрят, поскольку разнятся с каноническим определением функции. Последнее есть преобразованием любого количества вводных в единый результат. Как бы мы не меняли вводные значения переменных, на выходе всегда получим одно значение.

    -

    Параметрические функции хитрят, потому как говоря: "что ж, ладно, нам нужно произвести более одного вывода", заключают, попросту: "давайте использовать более одной функции". Проиллюстрируем: представьте что есть функция, которая преобразует вводную, назовем ее x, в другое значение следуя определенной логике

    - - -

    Нотация f(x) есть стандартным способом записи функции (по конвенции, f, когда представлена всего одна) и ее вывод меняется в зависимости от значения одной переменной (в данном случае, x). Так, изменив x и получим другой результат f(x).

    -

    Пока все логично. Теперь, давайте рассмотрим параметрические функции в действии и разберемся в чем их фокус. И, для начала, примите ко внимаю следующий пример:

    - - -

    Итак, нам представлены две функции. Ничего особо выдающегося: просто функции синуса и косинуса. Отметим, что, как вы можете видеть, вводные переменные разные. Затем, меняя значение a, мы не влияем на вывод f(b), поскольку a никак не задействована в этой функции. Параметрические функции хитрят именно с этим: весь набор функций вывода делит между собой одну или более переменную вводную. Как здесь:

    - - -

    Что же, несколько функций, и всего одна вводная. Меняя значение t — меняем вывод обеих ф-ций, fa(t) и fb(t). Возможно вы спросите: "в чем же польза?". Ответ прост и очевиден, если уточнить, что мы имеем ввиду под записью наших функций:

    - - -

    Так-то. Координаты x/y. Связанные мистическим значением t.

    -

    Таким образом, мы видим, что параметрические функции не определяют значение y через значение x, как обычные ф-ции, вместо этого они выводят и x и y из мистического значения t. Значит, для каждого значения t, существуют два соответствующих значения x и y, которые мы можем использовать для зарисовки кривой на графике. К примеру, приведенная выше параметрическая функция выводит координаты точки на круге: можно подставить любое значение t (от негативной до позитивной бесконечности), и на выводе мы всегда получим координаты точки на круге с центром в (0,0) и радиусом в 1. Зарисовав точки для t от 0 до 5, получаем следующее:

    - - - Скрипты отключены. Показываем резервное изображение. - - - - - -

    Кривые Безье — всего один из многих классов параметрических функций. Их главной характеристикой есть использование одной и той же базовой функции для генерации всех выводов. До сих пор, в использованном нами примере, мы производили значения x и y с помощью разных функций (ф-цией синуса и ф-цией косинуса); Безье же использует единый "биноминальный полином" для вывода обоих значений. Но что же такое "биноминальный полином"?

    -

    Возможно, вы помните полиномы из школьной программы. Они выглядят следующим образом:

    - - -

    Так, если наивысшая степень в уравнении составляет , мы называем такой полином кубическим, если — квадратным. Если это только — значит это просто линия (когда же x не упоминается вовсе — это не полином!).

    -

    Кривые Безье являются полиномом t (вместо x), где значение t — варъируется между 0 и 1, и коэффициентами a, b и т.д. принимающими "биноминальную" форму. Звучит сложно, но на практике выглядит понятнее:

    - - -

    Я знаю что вы думаете: это не выглядит таким уж простым. Но если мы уберем t и вместо этого поставим *1, внезапно, все становится ясно. Примите ко внимаю такие биномиальные термины:

    - - -

    Обратите внимание, что 2 это то же, что 1+1, и 3 это 2+1 и 1+2 и 6 это 3+3... Как вы можете видеть, каждый раз поднимаясь на одно разрешение выше, мы начинаем и заканчиваем с единицы, а все промежуточные значения равны суме двух значений над ними в предыдущей строке. Так получаем последовательность цифр, называемой треугольником Паскаля (*в оригинале другая ссылка ).

    -

    Так-же существует простой способ для выяснения подоплотной работы полиноминальных терминов: если мы переименуем (1-t) в a и t в b, убрав "веса", получим следующее:

    - - -

    В целом это просто сума "каждого сочетания a и b", получаемая прогрессивной заменой a на b по ходу уравнения. Потому, это так-же довольно просто. Итак теперь вы знаете что такое биноминальные полиномы. Для полноты картины, ниже привожу их общую функцию:

    - - -

    И теперь, это полное объяснение. Σ в этой функции означает, что это серия сум (с использованием переменной приведенной под Σ, со стартовым значением в ...=<value> и максимальным значением представленным над Σ)

    -
    + +

    + И теперь, это полное объяснение. Σ в этой функции означает, что это серия сум (с использованием переменной приведенной под Σ, со стартовым + значением в ...=<value> и максимальным значением представленным над Σ) +

    +
    +

    Имплементация базовых функций

    +

    Наивно, мы могли бы воплотить базис этой функции как математическую конструкцию, используя функцию как путеводитель:

    -

    Имплементация базовых функций

    -

    Наивно, мы могли бы воплотить базис этой функции как математическую конструкцию, используя функцию как путеводитель:

    - - - - - -
    1 - -
    2
    3
    4
    5
    - -

    Я говорю "мы могли бы", поскольку это не входит в наши планы: факторные функции невероятно дорого стоят. И, как мы можем понять из выше приведенного текста, треугольник Паскаля можно получить и без таких ухищрений: попросту начнем с [1], далее [1,1], затем [1,2,1], потом [1,3,3,1] и т.д., с каждым новым рядком, записывая на 1 цифру больше, чем в предыдущем, начиная и заканчивая с единицы, а промежуточные цифры определяя суммируя две над ними в предыдущем рядке.

    -

    Такой ряд можно сгенерировать предельно быстро, и с этим нам не придется компилировать биноминальные термины, поскольку можно пользоваться значениями из таблицы:

    + return sum + + + + 2 + + + 3 + + + 4 + + + 5 + + - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    - -

    Итак, что же здесь происходит? Сначала мы декларируем таблицу достаточного размера для удовлетворения большинства запросов. Далее мы заявляем функцию вывода необходимого значения, вытаскивая его из таблицы, предварительно убедившись, что значения для запрашиваемых n/k присутствуют в наборе и расширяя набор по необходимости (если не присутствуют). Наша базовая функция теперь выглядит типа этого:

    + return lut[n][k] + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + 18 + + - - - - -
    1 - -
    2
    3
    4
    5
    - -

    Отлично. Конечно, мы можем оптимизировать ее и далее. Для большинства задач компьютерной графики нам не потребуются кривые произвольного порядка (хотя мы приводим код для произвольных кривых в этом пособии); зачастую нам нужны квадратные и кубические кривые, а это значит, мы можем значительно упростить весь наш код:

    + return sum + + + + 2 + + + 3 + + + 4 + + + 5 + + - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    - -

    И вот теперь мы знаем как программировать базовую функцию. Превосходно.

    -
    + return mt3 + 3*mt2*t + 3*mt*t2 + t3 + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + -

    Итак, зная как выглядят базовые функции, время добавить магию делающую кривые Безье такими особенными: контрольные точки.

    +

    И вот теперь мы знаем как программировать базовую функцию. Превосходно.

    +
    -
    -
    -

    Контроль кривых Безье

    -

    Кривые Безье, как и все отрезки кривых, являются интерполярными функциями. Это значит, что они имеют заданный набор контрольных точек и генерируют значения где-то "между" ними. (Одним следствием этого есть то, что невозможно сгенерировать точки вне этого "каркаса". Полезная информация!). На практике мы можем проиллюстрировать влияние каждой контрольной точки на течение кривой по отдельности и проследить какие из точек имеют наибольший эффект в тех и иных частях кривой.

    -

    Следующие зарисовки демонстрируют интерполяции функций для квадратной и кубической кривых, где "S" представляет степень влияния контрольной точки на положение участка кривой по ходу развития t. Потяните ползунки, чтобы отследить проценты интерполяций каждой контрольной точки на конкретном t.

    -
    - - - Скрипты отключены. Показываем резервное изображение. - - - - - - - - Скрипты отключены. Показываем резервное изображение. - - - - - - - - Скрипты отключены. Показываем резервное изображение. - - - - - -
    +

    Итак, зная как выглядят базовые функции, время добавить магию делающую кривые Безье такими особенными: контрольные точки.

    +
    +
    +

    + + Контроль кривых Безье +

    +

    + Кривые Безье, как и все отрезки кривых, являются интерполярными функциями. Это значит, что они имеют заданный набор контрольных точек и + генерируют значения где-то "между" ними. (Одним следствием этого есть то, что невозможно сгенерировать точки вне этого "каркаса". Полезная + информация!). На практике мы можем проиллюстрировать влияние каждой контрольной точки на течение кривой по отдельности и проследить какие + из точек имеют наибольший эффект в тех и иных частях кривой. +

    +

    + Следующие зарисовки демонстрируют интерполяции функций для квадратной и кубической кривых, где "S" представляет степень влияния + контрольной точки на положение участка кривой по ходу развития t. Потяните ползунки, чтобы отследить проценты интерполяций каждой + контрольной точки на конкретном t. +

    +
    + + + Скрипты отключены. Показываем резервное изображение. + + + + + + + + Скрипты отключены. Показываем резервное изображение. + + + + + + + + Скрипты отключены. Показываем резервное изображение. + + + + + +
    -

    Также представлена интерполяция кривой Безье 15го порядка. Как вы можете видеть, начальная и конечная точки влияют на положение кривой значительно больше, чем любая другая контрольная точка.

    -

    Желая изменить кривую, нам нужно изменить "вес"-а (* коофициенты) для каждой точки, по сути меняя интерполяцию. Имплементация оного предельно логична: нужно просто перемножить каждую точку на значение, меняющее ее 'силу'. Эти значения, по конвенции называют "весом" и мы можем учесть их запись в нашей оригинальной функции Безье:

    - - -

    Хоть и выглядит заморочено, но, так уж получается, в реальности "веса" просто значения координат на графике, к которым мы бы хотели, чтобы наша функция стремилась. Так, для кривой n-го порядка, w0 есть начальной координатой, wn конечной координатой, а все между ними — контрольными координатами. Например, чтобы кубическая кривая начиналась в (110,150), стремилась к точкам (25,190) и (210,250) заканчиваясь на (210,30), мы запишем это следующим образом:

    - - -

    Это производит кривую, график которой мы видели в первой главе этой статьи.

    - - - Скрипты отключены. Показываем резервное изображение. - - - -

    Что еще можно сделать с кривой Безье? На самом деле — вполне не мало. Остальная часть статьи описывает множества доступных операций и задач, которые они решают.

    -
    + +

    Это производит кривую, график которой мы видели в первой главе этой статьи.

    + + + Скрипты отключены. Показываем резервное изображение. + + + +

    + Что еще можно сделать с кривой Безье? На самом деле — вполне не мало. Остальная часть статьи описывает множества доступных операций и + задач, которые они решают. +

    +
    +

    Имплементация весовых функций.

    +

    + Отталкиваясь от того, что мы уже знаем как воплотить базис функции, добавление контрольных точек представляет собой удивительно легкую + задачу: +

    -

    Имплементация весовых функций.

    -

    Отталкиваясь от того, что мы уже знаем как воплотить базис функции, добавление контрольных точек представляет собой удивительно легкую задачу:

    - - - - - -
    1 - -
    2
    3
    4
    5
    - -

    И оптимизированная версия:

    + return sum + + + + 2 + + + 3 + + + 4 + + + 5 + + - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    - -

    И вот, мы знаем как программировать базис функции с весами.

    -
    + return w[0]*mt3 + 3*w[1]*mt2*t + 3*w[2]*mt*t2 + w[3]*t3 + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + -
    -
    -

    Контроль над кривыми Безье, часть 2: Соотносительные Безье

    -

    Мы можем расширить степень влияния на кривые Безье, "соотнеся" их составляющие: подразумевается добавление в формулу коэффициента соотношений (* в оригинале "ratio" — соотношение) в довесок к уже используемым весам.

    -

    Как и прочее, воплощение этого коэффициента не должно составить нам особого труда. Тогда как обычная функция:

    - - -

    Функция для соотносительных кривых Безье имеет два дополнительных термина:

    - - -

    Первый из добавочных терминов, представляет собой дополнительный "вес" для каждой координаты, Например если наши значения соотношений [1, 0.5, 0.5, 1], тогда частица0 = 1, частица1 = 0.5 и т.д., и на практике ничем не отличается от использования дополнительного "вес"-а. Пока ничего особо выдающегося.

    -

    Тем не менее, второй из добавленных терминов, составляет всю важность разницы. В процессе компиляции точек на кривой, мы компилируем "нормальные" значения Безье, а далее делим их на значение Безье с учетом только соотношений (частиц), без учета весов.

    -

    Это создает неожиданный эффект: превращает наш полиноминал во что-то, что полиноминалом более не является. Теперь этот тип кривой есть супер классом полиноминала, что позволяет производить крутые вещи, на которые кривые Безье не способны сами по себе. Например обозначить круг посредством такой кривой (что, как мы убедимся чуть погодя, категорически невозможно используя стандартную кривую Безье).

    -

    Но лучшим объяснением, как и всегда, является наглядная демонстрация: давайте посмотрим на эффект "соотносительности" нашей кривой Безье на примере интерактивного графика соотнесительной кривой. Следующий график показывает кривую Безье из предыдущей секции, "обогащенную" факторами соотношений для каждой координаты. Чем ближе к нулю мы выставляем значения этих факторов — тем меньше влияния оказывают соответственные контрольные точки на начертания кривой, и, соответственно, чем выше эти показатели — тем больше влияния контрольная точка оказывает на изгиб кривой. Попробуйте сами:

    - - - Скрипты отключены. Показываем резервное изображение. - - - - - - - - -

    Вы можете думать о значениях соотношений, как о показателе силы притяжения соответствующей точки. Чем выше сила притяжения, тем больше наша кривая будет стремится к этой точке. Вы также можете наблюдать, что одинаковое увеличение или уменьшение всех показателей не оказывает никакого эффекта на результат... схоже с гравитацией: если значения остаются одинаковыми относительно друг-друга, на выводе ничего не меняется. Значения соотношений определяют влияние каждой координаты относительно всех остальных координат.

    -
    + +

    + Первый из добавочных терминов, представляет собой дополнительный "вес" для каждой координаты, Например если наши значения соотношений [1, + 0.5, 0.5, 1], тогда частица0 = 1, частица1 = 0.5 и т.д., и на практике ничем не отличается + от использования дополнительного "вес"-а. Пока ничего особо выдающегося. +

    +

    + Тем не менее, второй из добавленных терминов, составляет всю важность разницы. В процессе компиляции точек на кривой, мы компилируем + "нормальные" значения Безье, а далее делим их на значение Безье с учетом только соотношений (частиц), без учета весов. +

    +

    + Это создает неожиданный эффект: превращает наш полиноминал во что-то, что полиноминалом более не является. Теперь этот тип кривой есть + супер классом полиноминала, что позволяет производить крутые вещи, на которые кривые Безье не способны сами по себе. Например обозначить + круг посредством такой кривой (что, как мы убедимся чуть погодя, категорически невозможно используя стандартную кривую Безье). +

    +

    + Но лучшим объяснением, как и всегда, является наглядная демонстрация: давайте посмотрим на эффект "соотносительности" нашей кривой Безье + на примере интерактивного графика соотнесительной кривой. Следующий график показывает кривую Безье из предыдущей секции, "обогащенную" + факторами соотношений для каждой координаты. Чем ближе к нулю мы выставляем значения этих факторов — тем меньше влияния оказывают + соответственные контрольные точки на начертания кривой, и, соответственно, чем выше эти показатели — тем больше влияния контрольная точка + оказывает на изгиб кривой. Попробуйте сами: +

    + + + Скрипты отключены. Показываем резервное изображение. + + + + + + + + +

    + Вы можете думать о значениях соотношений, как о показателе силы притяжения соответствующей точки. Чем выше сила притяжения, тем больше + наша кривая будет стремится к этой точке. Вы также можете наблюдать, что одинаковое увеличение или уменьшение всех показателей не + оказывает никакого эффекта на результат... схоже с гравитацией: если значения остаются одинаковыми относительно друг-друга, на выводе + ничего не меняется. Значения соотношений определяют влияние каждой координаты относительно всех остальных координат. +

    +
    +

    Имплементация соотносительных кривых

    +

    Дополнение кода из предыдущей секции для учета этой функциональности фактически тривиально:

    -

    Имплементация соотносительных кривых

    -

    Дополнение кода из предыдущей секции для учета этой функциональности фактически тривиально:

    - - - - - - - - - - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    20
    21
    22
    23
    24
    25
    26
    - -

    Вот и все что требуется.

    -
    + return (f[0] * w[0] + f[1] * w[1] + f[2] * w[2] + f[3] * w[3])/basis + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + 18 + + + 19 + + + 20 + + + 21 + + + 22 + + + 23 + + + 24 + + + 25 + + + 26 + + -
    -
    -

    Интервал Безье [0,1]

    -

    В математике кривых Безье, вы могли заметить одну любопытную деталь — кривые Безье всегда считают вдоль одного и того же интервала t, t=0 to t=1. Почему же именно этот интервал?

    -

    Последнее обусловленно тем, как мы определяем "начало" и "конец" нашей кривой. Если у нас есть значение, которое представляет собой сочетание двух других значений, тогда общая формула для этого будет:

    - - -

    Очевидно, что начальное и конечное значения a и b должны быть a=1, b=0, чтобы в начале получать вывод 100% первого показателя и 0% второго; и a=0, b=1, чтобы в конце получать 0% value 1 и 100% value 2. В дополнение, мы не хотим чтобы "a" и "b" были независимыми, в коем случае можно было бы присвоить им любые значения и на выводе получить, например, 100% первого показателя и 100% второго. В принципе, с последним все ок, но в случае кривых Безье, мы всегда должны получать значение между двух крайностей, потому нельзя присвоить a и b значения, которые бы вместе составляли суму более 100% на выводе, что можно записать как:

    - - -

    С этим у нас есть гарантия, что мы не получим суму значений пропорций более 100%. Мы ограничиваем значение a интервалом [0,1], потому всегда получаем вывод из пропорционального смешения двух показателей, с сумой смесителей не превышающей 100%.

    -

    Но... что если мы воспользуемся этой формой, предполагающей что мы всегда будем использовать значения а между 0 и 1, для подсчета точек лежащих вне стандартного интервала? Наши вычисления пойдут по откосной? Что ж... не то чтобы, мы попросту увидим больше.

    -

    В случае кривой Безье, продление интервала просто продляет развитие кривой. Кривые Безье всего лишь сегменты полиноминальных кривых. Выбирая более широкий отрезок — мы можем проследить дальнейшее развитие кривой. И как же это выглядит?

    -

    Ниже представлены графики кривых Безье, зарисованных "обычным путем", по верх зарисовки кривых на которых они лежат, полученных путем расширения интервала значений t. Как мы можем наблюдать, кривая Безье есть частью большей формы, скрытой за границами нашего интервала, которую мы так-же можем регулировать путем перемещения контрольных точек.

    -
    - - - Скрипты отключены. Показываем резервное изображение. - - - - - - Скрипты отключены. Показываем резервное изображение. - - - -
    + +

    + С этим у нас есть гарантия, что мы не получим суму значений пропорций более 100%. Мы ограничиваем значение a интервалом + [0,1], потому всегда получаем вывод из пропорционального смешения двух показателей, с сумой смесителей не превышающей 100%. +

    +

    + Но... что если мы воспользуемся этой формой, предполагающей что мы всегда будем использовать значения а между 0 и 1, для подсчета точек + лежащих вне стандартного интервала? Наши вычисления пойдут по откосной? Что ж... не то чтобы, мы попросту увидим больше. +

    +

    + В случае кривой Безье, продление интервала просто продляет развитие кривой. Кривые Безье всего лишь сегменты полиноминальных кривых. + Выбирая более широкий отрезок — мы можем проследить дальнейшее развитие кривой. И как же это выглядит? +

    +

    + Ниже представлены графики кривых Безье, зарисованных "обычным путем", по верх зарисовки кривых на которых они лежат, полученных путем + расширения интервала значений t. Как мы можем наблюдать, кривая Безье есть частью большей формы, скрытой за границами нашего + интервала, которую мы так-же можем регулировать путем перемещения контрольных точек. +

    +
    + + + Скрипты отключены. Показываем резервное изображение. + + + + + + Скрипты отключены. Показываем резервное изображение. + + + +
    -

    В области компьютерной графики, существуют множество кривых, которые действуют по противоположному кривым Безье принципу: вместо фиксированного интервала и свободного выбора контрольных точек формирующих развитие искривлений, они фиксируют форму кривой, предоставляя возможность выбора интервала. Отличным примером последней есть кривая "Spiro", которая частично базируется на спирали Корню, также известной как спираль Эйлера (* в оригинале другая ссылка на Корню и Эйлера ). Эту эстетически приятную кривую можно встретить в нескольких графических пакетах: FontForge и Inkscape. Ее даже используют в дизайне шрифтов, например в начертания шрифта Inconsolata.

    - -
    -
    -

    Кривые Безье как матричные уравнения

    -

    Мы также можем представить кривые Безье как матричную операцию, выразив формулу Безье как функцию с полиноминальноминальной основой, матрицу коэффициентов и матрицу конкретных координат. Давайте рассмотрим что это значит для уравнений кубических кривых Безье, используя P... для обозначения координат в "одном или более пространстве".

    - - -

    Обобщив, игнорируя конкретные значения, мы получим:

    - - -

    Это, в свою очередь, может быть записано как:

    - - -

    Последнее мы можем раскрыть, записав как:

    - - -

    Более того можно записать с коэффициентами 1 и 0, включив нивелированные термины:

    - - -

    И уже это можно рассматривать как серию четырех матричных операций:

    - - -

    Скомбинировав в единую матричную операцию, получим:

    - - -

    Такой тип функций полиноминальной основы зачастую записывается с основой в возрастающем порядке, что значит мы должны обернуть нашу t матрицу горизонтально, а нашу "большую" матрицу — вертикально:

    - - -

    Наконец, мы можем добавить оригинальные координаты единой третьей матрицей:

    - - -

    Такой же фокус может быть проделан с квадратной кривой, в коем случае мы получаем:

    - - -

    Подставив t и перемножив матрицы, мы получим такие-же значения, как при подсчете с использованием исходной полиноминальной функции или графического метода интерполяции.

    -

    Итак: зачем нам возится с матрицами? Матричное произведение раскрывает свойства функции кривых, которые в противном случае, было бы сложно обнаружить. Например, мы видим, что наша функция принадлежит к типу треугольных матриц (* в оригинале другая ссылка), определенные количеством контрольных координат и обладают всеми соответствующими свойствами. Также, что они могут быть обернуты, что в свою очередь определяет тонну других свойств (* в оригинале другая ссылка), применимых к нашим кривым. Конечно же, основным вопросом остается: "В чем состоит польза?". Тогда как ответ не становится очевидным немедленно, чуть далее мы увидим определенные случаи, где некоторые свойства кривых могут быть исчислены посредством манипуляции функцией, либо остроумным использованием матриц, и иногда последнее намного быстрее.

    -

    Потому пока давайте запомним, что функции могут быть описаны таким образом, и будем двигаться дальше.

    + +

    + Подставив t и перемножив матрицы, мы получим такие-же значения, как при подсчете с использованием исходной полиноминальной + функции или графического метода интерполяции. +

    +

    + Итак: зачем нам возится с матрицами? Матричное произведение раскрывает свойства функции кривых, которые в противном + случае, было бы сложно обнаружить. Например, мы видим, что наша функция принадлежит к типу + треугольных матриц + (* в оригинале другая ссылка), определенные количеством контрольных + координат и обладают всеми соответствующими свойствами. Также, что они могут быть обернуты, что в свою очередь определяет + тонну других свойств + (* в оригинале другая ссылка), применимых к + нашим кривым. Конечно же, основным вопросом остается: "В чем состоит польза?". Тогда как ответ не становится + очевидным немедленно, чуть далее мы увидим определенные случаи, где некоторые свойства кривых могут быть исчислены посредством + манипуляции функцией, либо остроумным использованием матриц, и иногда последнее намного быстрее. +

    +

    Потому пока давайте запомним, что функции могут быть описаны таким образом, и будем двигаться дальше.

    +
    +
    +

    + + Алгоритм де Кастельжо +

    +

    + Для зарисовки кривой Безье, мы можем пробежаться по всем значениям t от 0 до 1 и скомпилировать вывод базовой функции с + подставленными весами для каждого значения. К сожалению, чем замысловатее кривая, тем дороже обходиться это обчисление. Вместо этого, мы + можем воспользоваться Алгоритмом де Кастельжо для прорисовки кривых. Этот алгоритм позволяет прорисовывать кривые базируясь на + геометрических вычислениях и довольно прост в применении. На деле, настолько прост, что его можно воплотить при помощи карандаша и + линейки. +

    +

    + Вместо использования функции математического анализа для нахождения значений x/y для t, давайте сделаем + следующее: +

    +
      +
    • Примем t за пропорцию(чем оно и является). t=0 будет 0% вдоль линии, t=1, соответсвенно, 100% вдоль линии.
    • +
    • Возьмем все линии между заданными контрольными точками. Для кривой n-го порядка это n линий
    • +
    • + Разместим маркеры вдоль линий, соответсвенно значению t. Так, если t 0.2, разместим маркер на 20% от начала, + и, соответственно, 80% от конца. +
    • +
    • Теперь соединим полученные точки линиями. Это дает нам n-1 линий.
    • +
    • Далее разместим маркеры на новых линиях, соответсвенно тому-же значению t.
    • +
    • + Продолжим повторять два предыдущих шага, пока на выводе у нас останется всего одна линия. Маркер t на этой линии будет + соответствовать точке для t на нашей кривой. +
    • +
    +

    + Чтобы проверить это в действии, ведите ползунок под ниже представленным графиком слева направо и наоборот, задавая разные значения + t для иллюстрации геометрического построения. +

    + + + Скрипты отключены. Показываем резервное изображение. + + + + + +
    +

    Имплементация Алгоритма де Кастельжо

    +

    Запишем согласно предложенному алгоритму:

    -
    -
    -

    Алгоритм де Кастельжо

    -

    Для зарисовки кривой Безье, мы можем пробежаться по всем значениям t от 0 до 1 и скомпилировать вывод базовой функции с подставленными весами для каждого значения. К сожалению, чем замысловатее кривая, тем дороже обходиться это обчисление. Вместо этого, мы можем воспользоваться Алгоритмом де Кастельжо для прорисовки кривых. Этот алгоритм позволяет прорисовывать кривые базируясь на геометрических вычислениях и довольно прост в применении. На деле, настолько прост, что его можно воплотить при помощи карандаша и линейки.

    -

    Вместо использования функции математического анализа для нахождения значений x/y для t, давайте сделаем следующее:

    -
      -
    • Примем t за пропорцию(чем оно и является). t=0 будет 0% вдоль линии, t=1, соответсвенно, 100% вдоль линии.
    • -
    • Возьмем все линии между заданными контрольными точками. Для кривой n-го порядка это n линий
    • -
    • Разместим маркеры вдоль линий, соответсвенно значению t. Так, если t 0.2, разместим маркер на 20% от начала, и, соответственно, 80% от конца.
    • -
    • Теперь соединим полученные точки линиями. Это дает нам n-1 линий.
    • -
    • Далее разместим маркеры на новых линиях, соответсвенно тому-же значению t.
    • -
    • Продолжим повторять два предыдущих шага, пока на выводе у нас останется всего одна линия. Маркер t на этой линии будет соответствовать точке для t на нашей кривой.
    • -
    -

    Чтобы проверить это в действии, ведите ползунок под ниже представленным графиком слева направо и наоборот, задавая разные значения t для иллюстрации геометрического построения.

    - - - Скрипты отключены. Показываем резервное изображение. - - - - - -
    - -

    Имплементация Алгоритма де Кастельжо

    -

    Запишем согласно предложенному алгоритму:

    - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    - -

    Готово, алгоритм воплощен. Помимо того факта, что зачастую мы не располагаем роскошью перенагрузки оператора "+", так давайте совместим вычисление для значений x и y:

    + drawCurvePoint(newpoints, t) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    - -

    И что это делает? Это зарисует точку на график, если длинна вводной points составляет всего одну точку. В противном случае — это создает новый список для вводной, составленный из "маркеров", обчисленых пропорционально значению t между поточного списка точек вводной и вызовет саму себя с новой вводной.

    -
    + drawCurvePoint(newpoints, t) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + -
    -
    -

    Simplified drawing

    -

    We can also simplify the drawing process by "sampling" the curve at certain points, and then joining those points up with straight lines, a process known as "flattening", as we are reducing a curve to a simple sequence of straight, "flat" lines.

    -

    We can do this is by saying "we want X segments", and then sampling the curve at intervals that are spaced such that we end up with the number of segments we wanted. The advantage of this method is that it's fast: instead of evaluating 100 or even 1000 curve coordinates, we can sample a much lower number and still end up with a curve that sort-of-kind-of looks good enough. The disadvantage of course is that we lose the precision of working with "the real curve", so we usually can't use the flattened for doing true intersection detection, or curvature alignment.

    -
    - - - Скрипты отключены. Показываем резервное изображение. - - - - - - - - Скрипты отключены. Показываем резервное изображение. - - - - - -
    +

    + И что это делает? Это зарисует точку на график, если длинна вводной points составляет всего одну точку. В противном случае — это создает + новый список для вводной, составленный из "маркеров", обчисленых пропорционально значению t между поточного списка точек вводной + и вызовет саму себя с новой вводной. +

    + +
    +
    +

    + + Simplified drawing +

    +

    + We can also simplify the drawing process by "sampling" the curve at certain points, and then joining those points up with straight lines, + a process known as "flattening", as we are reducing a curve to a simple sequence of straight, "flat" lines. +

    +

    + We can do this is by saying "we want X segments", and then sampling the curve at intervals that are spaced such that we end up with the + number of segments we wanted. The advantage of this method is that it's fast: instead of evaluating 100 or even 1000 curve coordinates, we + can sample a much lower number and still end up with a curve that sort-of-kind-of looks good enough. The disadvantage of course is that we + lose the precision of working with "the real curve", so we usually can't use the flattened for doing true intersection detection, or + curvature alignment. +

    +
    + + + Скрипты отключены. Показываем резервное изображение. + + + + + + + + Скрипты отключены. Показываем резервное изображение. + + + + + +
    -

    Try clicking on the sketch and using your up and down arrow keys to lower the number of segments for both the quadratic and cubic curve. You'll notice that for certain curvatures, a low number of segments works quite well, but for more complex curvatures (try this for the cubic curve), a higher number is required to capture the curvature changes properly.

    -
    +

    + Try clicking on the sketch and using your up and down arrow keys to lower the number of segments for both the quadratic and cubic curve. + You'll notice that for certain curvatures, a low number of segments works quite well, but for more complex curvatures (try this for the + cubic curve), a higher number is required to capture the curvature changes properly. +

    +
    +

    How to implement curve flattening

    +

    Let's just use the algorithm we just specified, and implement that:

    -

    How to implement curve flattening

    -

    Let's just use the algorithm we just specified, and implement that:

    - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    - -

    And done, that's the algorithm implemented. That just leaves drawing the resulting "curve" as a sequence of lines:

    + return coordinates; + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    - -

    We start with the first coordinate as reference point, and then just draw lines between each point and its next point.

    -
    + coord = _coord + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + -
    -
    -

    Splitting curves

    -

    Using de Casteljau's algorithm, we can also find all the points we need to split up a Bézier curve into two, smaller curves, which taken together form the original curve. When we construct de Casteljau's skeleton for some value t, the procedure gives us all the points we need to split a curve at that t value: one curve is defined by all the inside skeleton points found prior to our on-curve point, with the other curve being defined by all the inside skeleton points after our on-curve point.

    - - - Скрипты отключены. Показываем резервное изображение. - - - - - -
    +

    We start with the first coordinate as reference point, and then just draw lines between each point and its next point.

    +
    +
    +
    +

    + + Splitting curves +

    +

    + Using de Casteljau's algorithm, we can also find all the points we need to split up a Bézier curve into two, smaller curves, which taken + together form the original curve. When we construct de Casteljau's skeleton for some value t, the procedure gives us all the + points we need to split a curve at that t value: one curve is defined by all the inside skeleton points found prior to our + on-curve point, with the other curve being defined by all the inside skeleton points after our on-curve point. +

    + + + Скрипты отключены. Показываем резервное изображение. + + + + + +
    +

    implementing curve splitting

    +

    We can implement curve splitting by bolting some extra logging onto the de Casteljau function:

    -

    implementing curve splitting

    -

    We can implement curve splitting by bolting some extra logging onto the de Casteljau function:

    - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    - -

    After running this function for some value t, the left and right arrays will contain all the coordinates for two new curves - one to the "left" of our t value, the other on the "right". These new curves will have the same order as the original curve, and can be overlaid exactly on the original curve.

    -
    + drawCurvePoint(newpoints, t) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + -
    -
    -

    Splitting curves using matrices

    -

    Another way to split curves is to exploit the matrix representation of a Bézier curve. In the section on matrices, we saw that we can represent curves as matrix multiplications. Specifically, we saw these two forms for the quadratic and cubic curves respectively: (we'll reverse the Bézier coefficients vector for legibility)

    - - -

    and

    - - -

    Let's say we want to split the curve at some point t = z, forming two new (obviously smaller) Bézier curves. To find the coordinates for these two Bézier curves, we can use the matrix representation and some linear algebra. First, we separate out the actual "point on the curve" information into a new matrix multiplication:

    - - -

    and

    - - -

    If we could compact these matrices back to the form [t values] · [Bézier matrix] · [column matrix], with the first two staying the same, then that column matrix on the right would be the coordinates of a new Bézier curve that describes the first segment, from t = 0 to t = z. As it turns out, we can do this quite easily, by exploiting some simple rules of linear algebra (and if you don't care about the derivations, just skip to the end of the box for the results!).

    -
    - -

    Deriving new hull coordinates

    -

    Deriving the two segments upon splitting a curve takes a few steps, and the higher the curve order, the more work it is, so let's look at the quadratic curve first:

    - - - - - - - - -

    We can do this because [M · M-1] is the identity matrix. It's a bit like multiplying something by x/x in calculus: it doesn't do anything to the function, but it does allow you to rewrite it to something that may be easier to work with, or can be broken up differently. In the same way, multiplying our matrix by [M · M-1] has no effect on the total formula, but it does allow us to change the matrix sequence [something · M] to a sequence [M · something], and that makes a world of difference: if we know what [M-1 · Z · M] is, we can apply that to our coordinates, and be left with a proper matrix representation of a quadratic Bézier curve (which is [T · M · P]), with a new set of coordinates that represent the curve from t = 0 to t = z. So let's get computing:

    - - -

    Excellent! Now we can form our new quadratic curve:

    - - - - - - -

    Brilliant: if we want a subcurve from t = 0 to t = z, we can keep the first coordinate the same (which makes sense), our control point becomes a z-ratio mixture of the original control point and the start point, and the new end point is a mixture that looks oddly similar to a Bernstein polynomial of degree two. These new coordinates are actually really easy to compute directly!

    -

    Of course, that's only one of the two curves. Getting the section from t = z to t = 1 requires doing this again. We first observe that in the previous calculation, we actually evaluated the general interval [0,z]. We were able to write it down in a more simple form because of the zero, but what we actually evaluated, making the zero explicit, was:

    - - - - -

    If we want the interval [z,1], we will be evaluating this instead:

    - - - - -

    We're going to do the same trick of multiplying by the identity matrix, to turn [something · M] into [M · something]:

    - - -

    So, our final second curve looks like:

    - - - - - - -

    Nice. We see the same thing as before: we can keep the last coordinate the same (which makes sense); our control point becomes a z-ratio mixture of the original control point and the end point, and the new start point is a mixture that looks oddly similar to a bernstein polynomial of degree two, except this time it uses (z-1) rather than (1-z). These new coordinates are also really easy to compute directly!

    -
    + +

    + Nice. We see the same thing as before: we can keep the last coordinate the same (which makes sense); our control point becomes a z-ratio + mixture of the original control point and the end point, and the new start point is a mixture that looks oddly similar to a bernstein + polynomial of degree two, except this time it uses (z-1) rather than (1-z). These new coordinates are also really easy to + compute directly! +

    + -

    So, using linear algebra rather than de Casteljau's algorithm, we have determined that, for any quadratic curve split at some value t = z, we get two subcurves that are described as Bézier curves with simple-to-derive coordinates:

    - - -

    and

    - - -

    We can do the same for cubic curves. However, I'll spare you the actual derivation (don't let that stop you from writing that out yourself, though) and simply show you the resulting new coordinate sets:

    - - -

    and

    - - -

    So, looking at our matrices, did we really need to compute the second segment matrix? No, we didn't. Actually having one segment's matrix means we implicitly have the other: push the values of each row in the matrix Q to the right, with zeroes getting pushed off the right edge and appearing back on the left, and then flip the matrix vertically. Presto, you just "calculated" Q'.

    -

    Implementing curve splitting this way requires less recursion, and is just straight arithmetic with cached values, so can be cheaper on systems where recursion is expensive. If you're doing computation with devices that are good at matrix multiplication, chopping up a Bézier curve with this method will be a lot faster than applying de Casteljau.

    - -
    -
    -

    Lowering and elevating curve order

    -

    One interesting property of Bézier curves is that an nth order curve can always be perfectly represented by an (n+1)th order curve, by giving the higher-order curve specific control points.

    -

    If we have a curve with three points, then we can create a curve with four points that exactly reproduces the original curve. First, we give it the same start and end points, and for its two control points we pick "1/3rd start + 2/3rd control" and "2/3rd control + 1/3rd end". Now we have exactly the same curve as before, except represented as a cubic curve rather than a quadratic curve.

    -

    The general rule for raising an nth order curve to an (n+1)th order curve is as follows (observing that the start and end weights are the same as the start and end weights for the old curve):

    - - -

    However, this rule also has as direct consequence that you cannot generally safely lower a curve from nth order to (n-1)th order, because the control points cannot be "pulled apart" cleanly. We can try to, but the resulting curve will not be identical to the original, and may in fact look completely different.

    -

    However, there is a surprisingly good way to ensure that a lower order curve looks "as close as reasonably possible" to the original curve: we can optimise the "least-squares distance" between the original curve and the lower order curve, in a single operation (also explained over on Sirver's Castle). However, to use it, we'll need to do some calculus work and then switch over to linear algebra. As mentioned in the section on matrix representations, some things can be done much more easily with matrices than with calculus functions, and this is one of those things. So... let's go!

    -

    We start by taking the standard Bézier function, and condensing it a little:

    - - -

    Then, we apply one of those silly (actually, super useful) calculus tricks: since our t value is always between zero and one (inclusive), we know that (1-t) plus t always sums to 1. As such, we can express any value as a sum of t and 1-t:

    - - -

    So, with that seemingly trivial observation, we rewrite that Bézier function by splitting it up into a sum of a (1-t) and t component:

    - - -

    So far so good. Now, to see why we did this, let's write out the (1-t) and t parts, and see what that gives us. I promise, it's about to make sense. We start with (1-t):

    - - -

    So by using this seemingly silly trick, we can suddenly express part of our nth order Bézier function in terms of an (n+1)th order Bézier function. And that sounds a lot like raising the curve order! Of course we need to be able to repeat that trick for the t part, but that's not a problem:

    - - -

    So, with both of those changed from an order n expression to an order (n+1) expression, we can put them back together again. Now, where the order n function had a summation from 0 to n, the order n+1 function uses a summation from 0 to n+1, but this shouldn't be a problem as long as we can add some new terms that "contribute nothing". In the next section on derivatives, there is a discussion about why "higher terms than there is a binomial for" and "lower than zero terms" both "contribute nothing". So as long as we can add terms that have the same form as the terms we need, we can just include them in the summation, they'll sit there and do nothing, and the resulting function stays identical to the lower order curve.

    -

    Let's do this:

    - - -

    And this is where we switch over from calculus to linear algebra, and matrices: we can now express this relation between Bézier(n,t) and Bézier(n+1,t) as a very simple matrix multiplication:

    - - -

    where the matrix M is an n+1 by n matrix, and looks like:

    - - -

    That might look unwieldy, but it's really just a mostly-zeroes matrix, with a very simply fraction on the diagonal, and an even simpler fraction to the left of it. Multiplying a list of coordinates with this matrix means we can plug the resulting transformed coordinates into the one-order-higher function and get an identical looking curve.

    -

    Not too bad!

    -

    Equally interesting, though, is that with this matrix operation established, we can now use an incredibly powerful and ridiculously simple way to find out a "best fit" way to reverse the operation, called the normal equation. What it does is minimize the sum of the square differences between one set of values and another set of values. Specifically, if we can express that as some function A x = b, we can use it. And as it so happens, that's exactly what we're dealing with, so:

    - - -

    The steps taken here are:

    -
      -
    1. We have a function in a form that the normal equation can be used with, so
    2. -
    3. apply the normal equation!
    4. -
    5. Then, we want to end up with just Bn on the left, so we start by left-multiply both sides such that we'll end up with lots of stuff on the left that simplified to "a factor 1", which in matrix maths is the identity matrix.
    6. -
    7. In fact, by left-multiplying with the inverse of what was already there, we've effectively "nullified" (but really, one-inified) that big, unwieldy block into the identity matrix I. So we substitute the mess with I, and then
    8. -
    9. because multiplication with the identity matrix does nothing (like multiplying by 1 does nothing in regular algebra), we just drop it.
    10. -
    -

    And we're done: we now have an expression that lets us approximate an n+1th order curve with a lower nth order curve. It won't be an exact fit, but it's definitely a best approximation. So, let's implement these rules for raising and lowering curve order to a (semi) random curve, using the following graphic. Select the sketch, which has movable control points, and press your up and down arrow keys to raise or lower the curve order.

    - - - Скрипты отключены. Показываем резервное изображение. - - - - - - - -
    -
    -

    Производные кривых Безье

    -

    Есть целый ряд вещей, которые мы можем сотворить с кривыми Безье базируясь на их производных, и одним восхитительным наблюдением на счет первых, является то, что их производные по сути тоже являются кривыми Безье. На практике, дифференциация кривых Безье относительно логична, хотя нам и потребуется немного математики.

    -

    Для начала рассмотрим правило производных для кривых Безье:

    - - -

    которое мы так же можем записать (отметив что b в этой формуле, то-же что наш w "вес", а n умноженная на функцию сумы — то-же что функция сумы, с каждой составляющей умноженной на n) как:

    - - -

    Другими словами, производная кривой Безье nго порядка равна кривой Безье на порядок ниже (n-1), соответственно имея на один термин составляющих меньше, где новые веса w'0...w'n-1 произведены из оригинальных по принципу n(wi+1 - wi). Так, для кривой 3го порядка, с 4мя весами, производная имеет 3 веса: w'0 = 3(w1-w0), w'1 = 3(w2-w1) и w'2 = 3(w3-w2).

    -
    - -

    "Погодите, а почему это работает?"

    -

    Порой, когда говорят "вот она производная", наше счастье приходит немедленно и остается неизменным вне зависимости от следующих аргументов. Но, возможно, вам захочется покопаться в доказательствах производной. Если так, то для начала оговоримся: веса независимы от общей формулы функции, потому производная кривой затрагивает только производные полиноминалов базовой функции. Найдем сперва это:

    - - -

    Приминение правил продукта и цепного правила (* в оригинале другие ссылки) дает нам следующее:

    - - -

    В таком виде с этой формулой работать сложно, потому раскроем:

    - - -

    Теперь фокус заключается в трансформации этой функции во что-то, что бы имело биноминальные коэффициенты, потому как мы хотим получить выражение в форме вроде "x! деленное на y!(x-y)!". Если нам удастся сделать это так, чтобы учесть в запись термины n-1 и k-1 — мы на верном пути.

    - - -

    Первая часть готова: компоненты в скобках на самом то деле обычные выражения кривых Безье на порядок ниже:

    - - -

    Теперь применим это к записям наших формул с "весами". Начнем с формулы кривой Безье приведенной выше и пройдемся по ее производным.

    - - -

    Раскрыв это (немного раскрасив, чтобы подчеркнуть соотношение терминов), и выстроив термины по возрастанию значений, мы увидим следующее:

    - - -

    Два из приведенных терминов отпадают. Первый — потому что степень -1 не присутствует в суме, оттого каждый раз приводя "ничего" в результат. Можем смело игнорировать это значение в нахождении производной функции. Второй — самый последний термин этой записи, затрагивающий Bn-1,n. Этот термин имел-бы биноминальный коэффициент [i указывая на i+1], который не существуюет. И опять — термин ничего не привносит в результат, затем также может быть проигнорирован. Это значит, нам остается:

    - - -

    И это по сути формула функции сумы на 1 порядок ниже:

    - - -

    Можно переписать по стандартной форме сумы, и готово:

    - - -
    + + -

    Давайте перепишем это по форме схожей с нашей исходной формулой, чтобы легче было разглядеть разницу. Сначала оригинальная формула, за ней производная:

    - - - - -

    И в чем же разница? В терминах формулы кривой Безье, по сути, никакой! Мы уменьшили порядок (вместо порядка n, он теперь n-1), но это все та же функция Безье. Единственное отличие в подсчете изменений в "весах" при нахождении производной. К примеру, исходя из 4-х контрольных точек A, B, C и D, первая производная получит 3 точки, вторая — 2, третья — 1:

    - - -

    Можно продолжать производить этот фокус, до тех пор пока у нас имеется более одного веса. Когда же остается один вес, следующим шагом будет k = 0, и результат сложения "функции сумы Безье" будет равен 0, поскольку мы ничего ни с чем не слагаем. По этому у квадратной функций нету второй производной, у кубической — третей, и, обобщая, кривая Безье nго порядка, имеет n-1 (внятных) производных, с каждой следующей производной равной нулю.

    - -
    -
    -

    Tangents and normals

    -

    If you want to move objects along a curve, or "away from" a curve, the two vectors you're most interested in are the tangent vector and normal vector for curve points. These are actually really easy to find. For moving and orienting along a curve, we use the tangent, which indicates the direction of travel at specific points, and is literally just the first derivative of our curve:

    - - -

    This gives us the directional vector we want. We can normalize it to give us uniform directional vectors (having a length of 1.0) at each point, and then do whatever it is we want to do based on those directions:

    - - -

    The tangent is very useful for moving along a line, but what if we want to move away from the curve instead, perpendicular to the curve at some point t? In that case we want the normal vector. This vector runs at a right angle to the direction of the curve, and is typically of length 1.0, so all we have to do is rotate the normalized directional vector and we're done:

    - - -
    - -

    Rotating coordinates is actually very easy, if you know the rule for it. You might find it explained as "applying a rotation matrix, which is what we'll look at here, too. Essentially, the idea is to take the circles over which we can rotate, and simply "sliding the coordinates" over these circles by the desired -angle. If we want a quarter circle turn, we take the coordinate, slide it along the circle by a quarter turn, and done.

    -

    To turn any point (x,y) into a rotated point (x',y') (over 0,0) by some angle φ, we apply this nice and easy computation:

    - - -

    Which is the "long" version of the following matrix transformation:

    - - -

    And that's all we need to rotate any coordinate. Note that for quarter, half, and three-quarter turns these functions become even easier, since sin and cos for these angles are, respectively: 0 and 1, -1 and 0, and 0 and -1.

    -

    But why does this work? Why this matrix multiplication? Wikipedia (technically, Thomas Herter and Klaus Lott) tells us that a rotation matrix can be -treated as a sequence of three (elementary) shear operations. When we combine this into a single matrix operation (because all matrix multiplications can be collapsed), we get the matrix that you see above. DataGenetics have an excellent article about this very thing: it's really quite cool, and I strongly recommend taking a quick break from this primer to read that article.

    -
    + +

    + And that's all we need to rotate any coordinate. Note that for quarter, half, and three-quarter turns these functions become even + easier, since sin and cos for these angles are, respectively: 0 and 1, -1 and 0, and 0 and -1. +

    +

    + But why does this work? Why this matrix multiplication? + Wikipedia (technically, Thomas Herter and Klaus + Lott) tells us that a rotation matrix can be treated as a sequence of three (elementary) shear operations. When we combine this into a + single matrix operation (because all matrix multiplications can be collapsed), we get the matrix that you see above. + DataGenetics have an excellent article about this very thing: it's + really quite cool, and I strongly recommend taking a quick break from this primer to read that article. +

    + -

    The following two graphics show the tangent and normal along a quadratic and cubic curve, with the direction vector coloured blue, and the normal vector coloured red (the markers are spaced out evenly as t-intervals, not spaced equidistant).

    -
    - - - Скрипты отключены. Показываем резервное изображение. - - - - - - Скрипты отключены. Показываем резервное изображение. - - - -
    +

    + The following two graphics show the tangent and normal along a quadratic and cubic curve, with the direction vector coloured blue, and the + normal vector coloured red (the markers are spaced out evenly as t-intervals, not spaced equidistant). +

    +
    + + + Скрипты отключены. Показываем резервное изображение. + + + + + + Скрипты отключены. Показываем резервное изображение. + + + +
    +
    +
    +

    + + Working with 3D normals +

    +

    + Before we move on to the next section we need to spend a little bit of time on the difference between 2D and 3D. While for many things + this difference is irrelevant and the procedures are identical (for instance, getting the 3D tangent is just doing what we do for 2D, but + for x, y, and z, instead of just for x and y), when it comes to normals things are a little more complex, and thus more work. Mind you, + it's not "super hard", but there are more steps involved and we should have a look at those. +

    +

    + Getting normals in 3D is in principle the same as in 2D: we take the normalised tangent vector, and then rotate it by a quarter turn. + However, this is where things get that little more complex: we can turn in quite a few directions, since "the normal" in 3D is a plane, + not a single vector, so we basically need to define what "the" normal is in the 3D case. +

    +

    + The "naïve" approach is to construct what is known as the + Frenet normal, where we follow a simple recipe that works in + many cases (but does super bizarre things in some others). The idea is that even though there are infinitely many vectors that are + perpendicular to the tangent (i.e. make a 90 degree angle with it), the tangent itself sort of lies on its own plane already: since each + point on the curve (no matter how closely spaced) has its own tangent vector, we can say that each point lies in the same plane as the + local tangent, as well as the tangents "right next to it". +

    +

    + Even if that difference in tangent vectors is minute, "any difference" is all we need to find out what that plane is - or rather, what the + vector perpendicular to that plane is. Which is what we need: if we can calculate that vector, and we have the tangent vector that we know + lies on a plane, then we can rotate the tangent vector over the perpendicular, and presto. We have computed the normal using the same + logic we used for the 2D case: "just rotate it 90 degrees". +

    +

    So let's do that! And in a twist surprise, we can do this in four lines:

    +
      +
    • a = normalize(B'(t))
    • +
    • b = normalize(a + B''(t))
    • +
    • r = normalize(b × a)
    • +
    • normal = normalize(r × a)
    • +
    +

    Let's unpack that a little:

    +
      +
    • + We start by taking the normalized vector for the derivative at some point on the + curve. We normalize it so the maths is less work. Less work is good. +
    • +
    • + Then, we compute b which represents what a next point's tangent would be if the curve stopped changing at our point and + just had the same derivative and second derivative from that point on. +
    • +
    • + This lets us find two vectors (the derivative, and the second derivative added to the derivative) that lie on the same plane, which + means we can use them to compute a vector perpendicular to that plane, using an elementary vector operation called the + cross product. (Note that while that operation uses the × operator, it's most + definitely not a multiplication!) The result of that gives us a vector that we can use as the "axis of rotation" for turning the tangent + a quarter circle to get our normal, just like we did in the 2D case. +
    • +
    • + Since the cross product lets us find a vector that is perpendicular to some plane defined by two other vectors, and since the normal + vector should be perpendicular to the plane that the tangent and the axis of rotation lie in, we can use the cross product a second + time, and immediately get our normal vector. +
    • +
    +

    + And then we're done, we found "the" normal vector for a 3D curve. Let's see what that looks like for a sample curve, shall we? You can + move your cursor across the graphic from left to right, to show the normal at a point with a t value that is based on your cursor + position: all the way on the left is 0, all the way on the right = 1, midway is t=0.5, etc: +

    + + + Скрипты отключены. Показываем резервное изображение. + + + + + +

    + However, if you've played with that graphic a bit, you might have noticed something odd. The normal seems to "suddenly twist around the + curve" between t=0.65 and t=0.75... Why is it doing that? +

    +

    + As it turns out, it's doing that because that's how the maths works, and that's the problem with Frenet normals: while they are + "mathematically correct", they are "practically problematic", and so for any kind of graphics work what we really want is a way to compute + normals that just... look good. +

    +

    Thankfully, Frenet normals are not our only option.

    +

    + Another option is to take a slightly more algorithmic approach and compute a form of + Rotation Minimising Frame + (also known as "parallel transport frame" or "Bishop frame") instead, where a "frame" is a set made up of the tangent, the rotational + axis, and the normal vector, centered on an on-curve point. +

    +

    + These type of frames are computed based on "the previous frame", so we cannot simply compute these "on demand" for single points, as we + could for Frenet frames; we have to compute them for the entire curve. Thankfully, the procedure is pretty simple, and can be performed at + the same time that you're building lookup tables for your curve. +

    +

    + The idea is to take a starting "tangent/rotation axis/normal" frame at t=0, and then compute what the next frame "should" look like by + applying some rules that yield a good looking next frame. In the case of the RMF paper linked above, those rules are: +

    +
      +
    • Take a point on the curve for which we know the RM frame already,
    • +
    • take a next point on the curve for which we don't know the RM frame yet, and
    • +
    • + reflect the known frame onto the next point, by treating the plane through the curve at the point exactly between the next and previous + points as a "mirror". +
    • +
    • + This gives the next point a tangent vector that's essentially pointing in the opposite direction of what it should be, and a normal + that's slightly off-kilter, so: +
    • +
    • + reflect the vectors of our "mirrored frame" a second time, but this time using the plane through the "next point" itself as "mirror". +
    • +
    • Done: the tangent and normal have been fixed, and we have a good looking frame to work with.
    • +
    +

    So, let's write some code for that!

    +
    +

    Implementing Rotation Minimising Frames

    +

    + We first assume we have a function for calculating the Frenet frame at a point, which we already discussed above, inn a way that it + yields a frame with properties: +

    -
    -
    -

    Working with 3D normals

    -

    Before we move on to the next section we need to spend a little bit of time on the difference between 2D and 3D. While for many things this difference is irrelevant and the procedures are identical (for instance, getting the 3D tangent is just doing what we do for 2D, but for x, y, and z, instead of just for x and y), when it comes to normals things are a little more complex, and thus more work. Mind you, it's not "super hard", but there are more steps involved and we should have a look at those.

    -

    Getting normals in 3D is in principle the same as in 2D: we take the normalised tangent vector, and then rotate it by a quarter turn. However, this is where things get that little more complex: we can turn in quite a few directions, since "the normal" in 3D is a plane, not a single vector, so we basically need to define what "the" normal is in the 3D case.

    -

    The "naïve" approach is to construct what is known as the Frenet normal, where we follow a simple recipe that works in many cases (but does super bizarre things in some others). The idea is that even though there are infinitely many vectors that are perpendicular to the tangent (i.e. make a 90 degree angle with it), the tangent itself sort of lies on its own plane already: since each point on the curve (no matter how closely spaced) has its own tangent vector, we can say that each point lies in the same plane as the local tangent, as well as the tangents "right next to it".

    -

    Even if that difference in tangent vectors is minute, "any difference" is all we need to find out what that plane is - or rather, what the vector perpendicular to that plane is. Which is what we need: if we can calculate that vector, and we have the tangent vector that we know lies on a plane, then we can rotate the tangent vector over the perpendicular, and presto. We have computed the normal using the same logic we used for the 2D case: "just rotate it 90 degrees".

    -

    So let's do that! And in a twist surprise, we can do this in four lines:

    -
      -
    • a = normalize(B'(t))
    • -
    • b = normalize(a + B''(t))
    • -
    • r = normalize(b × a)
    • -
    • normal = normalize(r × a)
    • -
    -

    Let's unpack that a little:

    -
      -
    • We start by taking the normalized vector for the derivative at some point on the curve. We normalize it so the maths is less work. Less work is good.
    • -
    • Then, we compute b which represents what a next point's tangent would be if the curve stopped changing at our point and just had the same derivative and second derivative from that point on.
    • -
    • This lets us find two vectors (the derivative, and the second derivative added to the derivative) that lie on the same plane, which means we can use them to compute a vector perpendicular to that plane, using an elementary vector operation called the cross product. (Note that while that operation uses the × operator, it's most definitely not a multiplication!) The result of that gives us a vector that we can use as the "axis of rotation" for turning the tangent a quarter circle to get our normal, just like we did in the 2D case.
    • -
    • Since the cross product lets us find a vector that is perpendicular to some plane defined by two other vectors, and since the normal vector should be perpendicular to the plane that the tangent and the axis of rotation lie in, we can use the cross product a second time, and immediately get our normal vector.
    • -
    -

    And then we're done, we found "the" normal vector for a 3D curve. Let's see what that looks like for a sample curve, shall we? You can move your cursor across the graphic from left to right, to show the normal at a point with a t value that is based on your cursor position: all the way on the left is 0, all the way on the right = 1, midway is t=0.5, etc:

    - - - Скрипты отключены. Показываем резервное изображение. - - - - - -

    However, if you've played with that graphic a bit, you might have noticed something odd. The normal seems to "suddenly twist around the curve" between t=0.65 and t=0.75... Why is it doing that?

    -

    As it turns out, it's doing that because that's how the maths works, and that's the problem with Frenet normals: while they are "mathematically correct", they are "practically problematic", and so for any kind of graphics work what we really want is a way to compute normals that just... look good.

    -

    Thankfully, Frenet normals are not our only option.

    -

    Another option is to take a slightly more algorithmic approach and compute a form of Rotation Minimising Frame (also known as "parallel transport frame" or "Bishop frame") instead, where a "frame" is a set made up of the tangent, the rotational axis, and the normal vector, centered on an on-curve point.

    -

    These type of frames are computed based on "the previous frame", so we cannot simply compute these "on demand" for single points, as we could for Frenet frames; we have to compute them for the entire curve. Thankfully, the procedure is pretty simple, and can be performed at the same time that you're building lookup tables for your curve.

    -

    The idea is to take a starting "tangent/rotation axis/normal" frame at t=0, and then compute what the next frame "should" look like by applying some rules that yield a good looking next frame. In the case of the RMF paper linked above, those rules are:

    -
      -
    • Take a point on the curve for which we know the RM frame already,
    • -
    • take a next point on the curve for which we don't know the RM frame yet, and
    • -
    • reflect the known frame onto the next point, by treating the plane through the curve at the point exactly between the next and previous points as a "mirror".
    • -
    • This gives the next point a tangent vector that's essentially pointing in the opposite direction of what it should be, and a normal that's slightly off-kilter, so:
    • -
    • reflect the vectors of our "mirrored frame" a second time, but this time using the plane through the "next point" itself as "mirror".
    • -
    • Done: the tangent and normal have been fixed, and we have a good looking frame to work with.
    • -
    -

    So, let's write some code for that!

    -
    - -

    Implementing Rotation Minimising Frames

    -

    We first assume we have a function for calculating the Frenet frame at a point, which we already discussed above, inn a way that it yields a frame with properties:

    - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    - -

    Then, we can write a function that generates a sequence of RM frames in the following manner:

    +} + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    20
    21
    22
    23
    24
    25
    26
    27
    28
    29
    30
    31
    32
    33
    34
    35
    36
    - -

    Ignoring comments, this is certainly more code than when we were just computing a single Frenet frame, but it's not a crazy amount more code to get much better looking normals.

    -
    + frames.add(x1) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + 18 + + + 19 + + + 20 + + + 21 + + + 22 + + + 23 + + + 24 + + + 25 + + + 26 + + + 27 + + + 28 + + + 29 + + + 30 + + + 31 + + + 32 + + + 33 + + + 34 + + + 35 + + + 36 + + -

    Speaking of better looking, what does this actually look like? Let's revisit that earlier curve, but this time use rotation minimising frames rather than Frenet frames:

    - - - Скрипты отключены. Показываем резервное изображение. - - - - - -

    That looks so much better!

    -

    For those reading along with the code: we don't even strictly speaking need a Frenet frame to start with: we could, for instance, treat the z-axis as our initial axis of rotation, so that our initial normal is (0,0,1) × tangent, and then take things from there, but having that initial "mathematically correct" frame so that the initial normal seems to line up based on the curve's orientation in 3D space is just nice.

    +

    + Ignoring comments, this is certainly more code than when we were just computing a single Frenet frame, but it's not a crazy amount more + code to get much better looking normals. +

    + -
    -
    -

    Component functions

    -

    One of the first things people run into when they start using Bézier curves in their own programs is "I know how to draw the curve, but how do I determine the bounding box?". It's actually reasonably straightforward to do so, but it requires having some knowledge on exploiting math to get the values we need. For bounding boxes, we aren't actually interested in the curve itself, but only in its "extremities": the minimum and maximum values the curve has for its x- and y-axis values. If you remember your calculus (provided you ever took calculus, otherwise it's going to be hard to remember) we can determine function extremities using the first derivative of that function, but this poses a problem, since our function is parametric: every axis has its own function.

    -

    The solution: compute the derivative for each axis separately, and then fit them back together in the same way we do for the original.

    -

    Let's look at how a parametric Bézier curve "splits up" into two normal functions, one for the x-axis and one for the y-axis. Note the leftmost figure is again an interactive curve, without labeled axes (you get coordinates in the graph instead). The center and rightmost figures are the component functions for computing the x-axis value, given a value for t (between 0 and 1 inclusive), and the y-axis value, respectively.

    -

    If you move points in a curve sideways, you should only see the middle graph change; likewise, moving points vertically should only show a change in the right graph.

    - - - Скрипты отключены. Показываем резервное изображение. - - - -

     

    - - - Скрипты отключены. Показываем резервное изображение. - - - - -
    -
    -

    Finding extremities: root finding

    -

    Now that we understand (well, superficially anyway) the component functions, we can find the extremities of our Bézier curve by finding maxima and minima on the component functions, by solving the equation B'(t) = 0. We've already seen that the derivative of a Bézier curve is a simpler Bézier curve, but how do we solve the equality? Fairly easily, actually, until our derivatives are 4th order or higher... then things get really hard. But let's start simple:

    -

    Quadratic curves: linear derivatives.

    -

    The derivative of a quadratic Bézier curve is a linear Bézier curve, interpolating between just two terms, which means finding the solution for "where is this line 0" is effectively trivial by rewriting it to a function of t and solving. First we turn our quadratic Bézier function into a linear one, by following the rule mentioned at the end of the derivatives section:

    - - -

    And then we turn this into our solution for t using basic arithmetics:

    - - -

    Done.

    -

    Although with the caveat that if b-a is zero, there is no solution and we probably shouldn't try to perform that division.

    -

    Cubic curves: the quadratic formula.

    -

    The derivative of a cubic Bézier curve is a quadratic Bézier curve, and finding the roots for a quadratic polynomial means we can apply the Quadratic formula. If you've seen it before, you'll remember it, and if you haven't, it looks like this:

    - - -

    So, if we can rewrite the Bézier component function as a plain polynomial, we're done: we just plug in the values into the quadratic formula, check if that square root is negative or not (if it is, there are no roots) and then just compute the two values that come out (because of that plus/minus sign we get two). Any value between 0 and 1 is a root that matters for Bézier curves, anything below or above that is irrelevant (because Bézier curves are only defined over the interval [0,1]). So, how do we convert?

    -

    First we turn our cubic Bézier function into a quadratic one, by following the rule mentioned at the end of the derivatives section:

    - - -

    And then, using these v values, we can find out what our a, b, and c should be:

    - - -

    This gives us three coefficients {a, b, c} that are expressed in terms of v values, where the v values are expressions of our original coordinate values, so we can do some substitution to get:

    - - -

    Easy-peasy. We can now almost trivially find the roots by plugging those values into the quadratic formula.

    -

    And as a cubic curve, there is also a meaningful second derivative, which we can compute by simple taking the derivative of the derivative.

    -

    Quartic curves: Cardano's algorithm.

    -

    We haven't really looked at them before now, but the next step up would be a Quartic curve, a fourth degree Bézier curve. As expected, these have a derivative that is a cubic function, and now things get much harder. Cubic functions don't have a "simple" rule to find their roots, like the quadratic formula, and instead require quite a bit of rewriting to a form that we can even start to try to solve.

    -

    Back in the 16th century, before Bézier curves were a thing, and even before calculus itself was a thing, Gerolamo Cardano figured out that even if the general cubic function is really hard to solve, it can be rewritten to a form for which finding the roots is "easier" (even if not "easy"):

    - - -

    We can see that the easier formula only has two constants, rather than four, and only two expressions involving t, rather than three: this makes things considerably easier to solve because it lets us use regular calculus to find the values that satisfy the equation.

    -

    Now, there is one small hitch: as a cubic function, the solutions may be complex numbers rather than plain numbers... And Cardano realised this, centuries before complex numbers were a well-understood and established part of number theory. His interpretation of them was "these numbers are impossible but that's okay because they disappear again in later steps", allowing him to not think about them too much, but we have it even easier: as we're trying to find the roots for display purposes, we don't even care about complex numbers: we're going to simplify Cardano's approach just that tiny bit further by throwing away any solution that's not a plain number.

    -

    So, how do we rewrite the hard formula into the easier formula? This is explained in detail over at Ken J. Ward's page for solving the cubic equation, so instead of showing the maths, I'm simply going to show the programming code for solving the cubic equation, with the complex roots getting totally ignored, but if you're interested you should definitely head over to Ken's page and give the procedure a read-through.

    -
    + +

    + We can see that the easier formula only has two constants, rather than four, and only two expressions involving t, rather + than three: this makes things considerably easier to solve because it lets us use + regular calculus to find the values that satisfy the equation. +

    +

    + Now, there is one small hitch: as a cubic function, the solutions may be + complex numbers rather than plain numbers... And Cardano realised this, + centuries before complex numbers were a well-understood and established part of number theory. His interpretation of them was "these + numbers are impossible but that's okay because they disappear again in later steps", allowing him to not think about them too much, but we + have it even easier: as we're trying to find the roots for display purposes, we don't even care about complex numbers: we're + going to simplify Cardano's approach just that tiny bit further by throwing away any solution that's not a plain number. +

    +

    + So, how do we rewrite the hard formula into the easier formula? This is explained in detail over at + Ken J. Ward's page for solving the + cubic equation, so instead of showing the maths, I'm simply going to show the programming code for solving the cubic equation, with the + complex roots getting totally ignored, but if you're interested you should definitely head over to Ken's page and give the procedure a + read-through. +

    +
    +

    Implementing Cardano's algorithm for finding all real roots

    +

    + The "real roots" part is fairly important, because while you cannot take a square, cube, etc. root of a negative number in the "real" + number space (denoted with ℝ), this is perfectly fine in the + "complex" number space (denoted with ℂ). And, as it so happens, Cardano is + also attributed as the first mathematician in history to have made use of complex numbers in his calculations. For this very algorithm! +

    -

    Implementing Cardano's algorithm for finding all real roots

    -

    The "real roots" part is fairly important, because while you cannot take a square, cube, etc. root of a negative number in the "real" number space (denoted with ℝ), this is perfectly fine in the "complex" number space (denoted with ℂ). And, as it so happens, Cardano is also attributed as the first mathematician in history to have made use of complex numbers in his calculations. For this very algorithm!

    - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    20
    21
    22
    23
    24
    25
    26
    27
    28
    29
    30
    31
    32
    33
    34
    35
    36
    37
    38
    39
    40
    41
    42
    43
    44
    45
    46
    47
    48
    49
    50
    51
    52
    53
    54
    55
    56
    57
    58
    59
    60
    61
    62
    63
    64
    65
    66
    67
    68
    69
    70
    71
    72
    73
    74
    75
    76
    77
    78
    79
    80
    81
    - -
    +} + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + 18 + + + 19 + + + 20 + + + 21 + + + 22 + + + 23 + + + 24 + + + 25 + + + 26 + + + 27 + + + 28 + + + 29 + + + 30 + + + 31 + + + 32 + + + 33 + + + 34 + + + 35 + + + 36 + + + 37 + + + 38 + + + 39 + + + 40 + + + 41 + + + 42 + + + 43 + + + 44 + + + 45 + + + 46 + + + 47 + + + 48 + + + 49 + + + 50 + + + 51 + + + 52 + + + 53 + + + 54 + + + 55 + + + 56 + + + 57 + + + 58 + + + 59 + + + 60 + + + 61 + + + 62 + + + 63 + + + 64 + + + 65 + + + 66 + + + 67 + + + 68 + + + 69 + + + 70 + + + 71 + + + 72 + + + 73 + + + 74 + + + 75 + + + 76 + + + 77 + + + 78 + + + 79 + + + 80 + + + 81 + + +
    -

    And that's it. The maths is complicated, but the code is pretty much just "follow the maths, while caching as many values as we can to prevent recomputing things as much as possible" and now we have a way to find all roots for a cubic function and can just move on with using that to find extremities of our curves.

    -

    And of course, as a quartic curve also has meaningful second and third derivatives, we can quite easily compute those by using the derivative of the derivative (of the derivative), just as for cubic curves.

    -

    Quintic and higher order curves: finding numerical solutions

    -

    And this is where thing stop, because we cannot find the roots for polynomials of degree 5 or higher using algebra (a fact known as the Abel–Ruffini theorem). Instead, for occasions like these, where algebra simply cannot yield an answer, we turn to numerical analysis.

    -

    That's a fancy term for saying "rather than trying to find exact answers by manipulating symbols, find approximate answers by describing the underlying process as a combination of steps, each of which can be assigned a number via symbolic manipulation". For example, trying to mathematically compute how much water fits in a completely crazy three dimensional shape is very hard, even if it got you the perfect, precise answer. A much easier approach, which would be less perfect but still entirely useful, would be to just grab a buck and start filling the shape until it was full: just count the number of buckets of water you used. And if we want a more precise answer, we can use smaller buckets.

    -

    So that's what we're going to do here, too: we're going to treat the problem as a sequence of steps, and the smaller we can make each step, the closer we'll get to that "perfect, precise" answer. And as it turns out, there is a really nice numerical root-finding algorithm, called the Newton-Raphson root finding method (yes, after that Newton), which we can make use of. The Newton-Raphson approach consists of taking our impossible-to-solve function f(x), picking some initial value x (literally any value will do), and calculating f(x). We can think of that value as the "height" of the function at x. If that height is zero, we're done, we have found a root. If it isn't, we calculate the tangent line at f(x) and calculate at which x value its height is zero (which we've already seen is very easy). That will give us a new x and we repeat the process until we find a root.

    -

    Mathematically, this means that for some x, at step n=1, we perform the following calculation until fy(x) is zero, so that the next t is the same as the one we already have:

    - - -

    (The Wikipedia article has a decent animation for this process, so I will not add a graphic for that here)

    -

    Now, this works well only if we can pick good starting points, and our curve is continuously differentiable and doesn't have oscillations. Glossing over the exact meaning of those terms, the curves we're dealing with conform to those constraints, so as long as we pick good starting points, this will work. So the question is: which starting points do we pick?

    -

    As it turns out, Newton-Raphson is so blindingly fast that we could get away with just not picking: we simply run the algorithm from t=0 to t=1 at small steps (say, 1/200th) and the result will be all the roots we want. Of course, this may pose problems for high order Bézier curves: 200 steps for a 200th order Bézier curve is going to go wrong, but that's okay: there is no reason (at least, none that I know of) to ever use Bézier curves of crazy high orders. You might use a fifth order curve to get the "nicest still remotely workable" approximation of a full circle with a single Bézier curve, but that's pretty much as high as you'll ever need to go.

    -

    In conclusion:

    -

    So now that we know how to do root finding, we can determine the first and second derivative roots for our Bézier curves, and show those roots overlaid on the previous graphics. For the quadratic curve, that means just the first derivative, in red:

    - - - Скрипты отключены. Показываем резервное изображение. - - - -

    And for cubic curves, that means first and second derivatives, in red and purple respectively:

    - - - Скрипты отключены. Показываем резервное изображение. - - - + +

    (The Wikipedia article has a decent animation for this process, so I will not add a graphic for that here)

    +

    + Now, this works well only if we can pick good starting points, and our curve is + continuously differentiable and doesn't have + oscillations. Glossing over the exact meaning of those terms, the + curves we're dealing with conform to those constraints, so as long as we pick good starting points, this will work. So the question is: + which starting points do we pick? +

    +

    + As it turns out, Newton-Raphson is so blindingly fast that we could get away with just not picking: we simply run the algorithm from + t=0 to t=1 at small steps (say, 1/200th) and the result will be all the roots we want. Of course, this may + pose problems for high order Bézier curves: 200 steps for a 200th order Bézier curve is going to go wrong, but that's okay: + there is no reason (at least, none that I know of) to ever use Bézier curves of crazy high orders. You might use a fifth order + curve to get the "nicest still remotely workable" approximation of a full circle with a single Bézier curve, but that's pretty much as + high as you'll ever need to go. +

    +

    In conclusion:

    +

    + So now that we know how to do root finding, we can determine the first and second derivative roots for our Bézier curves, and show those + roots overlaid on the previous graphics. For the quadratic curve, that means just the first derivative, in red: +

    + + + Скрипты отключены. Показываем резервное изображение. + + +

    And for cubic curves, that means first and second derivatives, in red and purple respectively:

    + + + Скрипты отключены. Показываем резервное изображение. + + +
    +
    +

    + + Bounding boxes +

    +

    + If we have the extremities, and the start/end points, a simple for-loop that tests for min/max values for x and y means we have the four + values we need to box in our curve: +

    +

    Computing the bounding box for a Bézier curve:

    +
      +
    1. Find all t value(s) for the curve derivative's x- and y-roots.
    2. +
    3. Discard any t value that's lower than 0 or higher than 1, because Bézier curves only use the interval [0,1].
    4. +
    5. + Determine the lowest and highest value when plugging the values t=0, t=1 and each of the found roots into the original + functions: the lowest value is the lower bound, and the highest value is the upper bound for the bounding box we want to construct. +
    6. +
    +

    + Applying this approach to our previous root finding, we get the following + axis-aligned bounding boxes (with all curve extremity points + shown on the curve): +

    +
    + + + Скрипты отключены. Показываем резервное изображение. + + + + + + Скрипты отключены. Показываем резервное изображение. + + + +
    -
    -
    -

    Bounding boxes

    -

    If we have the extremities, and the start/end points, a simple for-loop that tests for min/max values for x and y means we have the four values we need to box in our curve:

    -

    Computing the bounding box for a Bézier curve:

    -
      -
    1. Find all t value(s) for the curve derivative's x- and y-roots.
    2. -
    3. Discard any t value that's lower than 0 or higher than 1, because Bézier curves only use the interval [0,1].
    4. -
    5. Determine the lowest and highest value when plugging the values t=0, t=1 and each of the found roots into the original functions: the lowest value is the lower bound, and the highest value is the upper bound for the bounding box we want to construct.
    6. -
    -

    Applying this approach to our previous root finding, we get the following axis-aligned bounding boxes (with all curve extremity points shown on the curve):

    -
    - - - Скрипты отключены. Показываем резервное изображение. - - - - - - Скрипты отключены. Показываем резервное изображение. - - - -
    - -

    We can construct even nicer boxes by aligning them along our curve, rather than along the x- and y-axis, but in order to do so we first need to look at how aligning works.

    - -
    -
    -

    Выравнивание (пересчет) кривых

    -

    В то время как есть бесконечное множество кривых мы можем задать перемещая x и y кооординаты контрольных точек, не все кривые являются различными между собой. К примеру, если мы зададим кривую, затем повернем ее на 90 градусов, это будет все та же кривая, и все свойста, как, к примеру, точки экстримов, будут на тех же ее участках, только зарисованы на полотне координат в другой точке. Таким образом, одним из способов убедится, что мы работаем с "уникальными" кривыми, является выравнивание их с осями координат.

    -

    Выравнивание так же упрощает уравнение кривой. Мы можем транслитерировать (переместить) кривую таким образом, что первая ее точка будет лежать на точке начала координат (0,0), что, в свою очередь, переведет наш полиноминальную функцию n порядка, на порядок ниже, (n-1). Последовательность останется такой же, но у нас будет меньше условий. Далее мы развернем кривую так, чтобы последняя точка тоже лежала на оси координат, переводя ее значение в (..., 0). Это далее упрощает ф-цию для ее формулы по Y переводя ее еще на порядок ниже. Так, если у нас была кубическая ф-ция:

    - - -

    Мы переместим первую точку в начало координат, смещая все значания x на -120 и y на -160:

    - - -

    Затем, повернем кривую до оси X (новые значания x и y округлены в иллюстративных целях):

    - - -

    Итак, опуская все нулевые значения, получаем

    - - -

    Как мы видим, наше изначальное определение кривой было существенно упрощено. Следущие зарисовки демострируют результат выравнивание кривой с осью X. первый рисунок соотвествует приведеному выше примеру кривой.

    - - - Скрипты отключены. Показываем резервное изображение. - - - -

     

    - - - Скрипты отключены. Показываем резервное изображение. - - - + +

    + Как мы видим, наше изначальное определение кривой было существенно упрощено. Следущие зарисовки демострируют результат выравнивание кривой + с осью X. первый рисунок соотвествует приведеному выше примеру кривой. +

    + + + Скрипты отключены. Показываем резервное изображение. + + +

     

    + + + Скрипты отключены. Показываем резервное изображение. + + +
    +
    +

    + + Tight bounding boxes +

    +

    + With our knowledge of bounding boxes, and curve alignment, We can now form the "tight" bounding box for curves. We first align our curve, + recording the translation we performed, "T", and the rotation angle we used, "R". We then determine the aligned curve's normal bounding + box. Once we have that, we can map that bounding box back to our original curve by rotating it by -R, and then translating it by -T. +

    +

    We now have nice tight bounding boxes for our curves:

    +
    + + + Скрипты отключены. Показываем резервное изображение. + + + + + + Скрипты отключены. Показываем резервное изображение. + + + +
    -
    -
    -

    Tight bounding boxes

    -

    With our knowledge of bounding boxes, and curve alignment, We can now form the "tight" bounding box for curves. We first align our curve, recording the translation we performed, "T", and the rotation angle we used, "R". We then determine the aligned curve's normal bounding box. Once we have that, we can map that bounding box back to our original curve by rotating it by -R, and then translating it by -T.

    -

    We now have nice tight bounding boxes for our curves:

    -
    - - - Скрипты отключены. Показываем резервное изображение. - - - - - - Скрипты отключены. Показываем резервное изображение. - - - -
    - -

    These are, strictly speaking, not necessarily the tightest possible bounding boxes. It is possible to compute the optimal bounding box by determining which spanning lines we need to effect a minimal box area, but because of the parametric nature of Bézier curves this is actually a rather costly operation, and the gain in bounding precision is often not worth it.

    - -
    -
    -

    Curve inflections

    -

    Now that we know how to align a curve, there's one more thing we can calculate: inflection points. Imagine we have a variable size circle that we can slide up against our curve. We place it against the curve and adjust its radius so that where it touches the curve, the curvatures of the curve and the circle are the same, and then we start to slide the circle along the curve - for quadratic curves, we can always do this without the circle behaving oddly: we might have to change the radius of the circle as we slide it along, but it'll always sit against the same side of the curve.

    -

    But what happens with cubic curves? Imagine we have an S curve and we place our circle at the start of the curve, and start sliding it along. For a while we can simply adjust the radius and things will be fine, but once we get to the midpoint of that S, something odd happens: the circle "flips" from one side of the curve to the other side, in order for the curvatures to keep matching. This is called an inflection, and we can find out where those happen relatively easily.

    -

    What we need to do is solve a simple equation:

    - - -

    What we're saying here is that given the curvature function C(t), we want to know for which values of t this function is zero, meaning there is no "curvature", which will be exactly at the point between our circle being on one side of the curve, and our circle being on the other side of the curve. So what does C(t) look like? Actually something that seems not too hard:

    - - -

    The function C(t) is the cross product between the first and second derivative functions for the parametric dimensions of our curve. And, as already shown, derivatives of Bézier curves are just simpler Bézier curves, with very easy to compute new coefficients, so this should be pretty easy.

    -

    However as we've seen in the section on aligning, aligning lets us simplify things a lot, by completely removing the contributions of the first coordinate from most mathematical evaluations, and removing the last y coordinate as well by virtue of the last point lying on the x-axis. So, while we can evaluate C(t) = 0 for our curve, it'll be much easier to first axis-align the curve and then evaluating the curvature function.

    -
    - -

    Let's derive the full formula anyway

    -

    Of course, before we do our aligned check, let's see what happens if we compute the curvature function without axis-aligning. We start with the first and second derivatives, given our basis functions:

    - - -

    And of course the same functions for y:

    - - -

    Asking a computer to now compose the C(t) function for us (and to expand it to a readable form of simple terms) gives us this rather overly complicated set of arithmetic expressions:

    - - -

    That is... unwieldy. So, we note that there are a lot of terms that involve multiplications involving x1, y1, and y4, which would all disappear if we axis-align our curve, which is why aligning is a great idea.

    -
    + +

    + That is... unwieldy. So, we note that there are a lot of terms that involve multiplications involving x1, y1, and y4, which would all + disappear if we axis-align our curve, which is why aligning is a great idea. +

    + -

    Aligning our curve so that three of the eight coefficients become zero, and observing that scale does not affect finding t values, we end up with the following simple term function for C(t):

    - - -

    That's a lot easier to work with: we see a fair number of terms that we can compute and then cache, giving us the following simplification:

    - - -

    This is a plain quadratic curve, and we know how to solve C(t) = 0; we use the quadratic formula:

    - - -

    We can easily compute this value if the discriminator isn't a negative number (because we only want real roots, not complex roots), and if x is not zero, because divisions by zero are rather useless.

    -

    Taking that into account, we compute t, we disregard any t value that isn't in the Bézier interval [0,1], and we now know at which t value(s) our curve will inflect.

    - - - Скрипты отключены. Показываем резервное изображение. - - - - -
    -
    -

    The canonical form (for cubic curves)

    -

    While quadratic curves are relatively simple curves to analyze, the same cannot be said of the cubic curve. As a curvature is controlled by more than one control point, it exhibits all kinds of features like loops, cusps, odd colinear features, and as many as two inflection points because the curvature can change direction up to three times. Now, knowing what kind of curve we're dealing with means that some algorithms can be run more efficiently than if we have to implement them as generic solvers, so is there a way to determine the curve type without lots of work?

    -

    As it so happens, the answer is yes, and the solution we're going to look at was presented by Maureen C. Stone from Xerox PARC and Tony D. deRose from the University of Washington in their joint paper "A Geometric Characterization of Parametric Cubic curves". It was published in 1989, and defines curves as having a "canonical" form (i.e. a form that all curves can be reduced to) from which we can immediately tell what features a curve will have. So how does it work?

    -

    The first observation that makes things work is that if we have a cubic curve with four points, we can apply a linear transformation to these points such that three of the points end up on (0,0), (0,1) and (1,1), with the last point then being "somewhere". After applying that transformation, the location of that last point can then tell us what kind of curve we're dealing with. Specifically, we see the following breakdown:

    - - - Скрипты отключены. Показываем резервное изображение. - - - -

    This is a fairly funky image, so let's see what the various parts of it mean...

    -

    We see the three fixed points at (0,0), (0,1) and (1,1). The various regions and boundaries indicate what property the original curve will have, if the fourth point is in/on that region or boundary. Specifically, if the fourth point is...

    -
      -
    1. ...anywhere inside the red zone, but not on its boundaries, the curve will be self-intersecting (yielding a loop). We won't know where it self-intersects (in terms of t values), but we are guaranteed that it does.

      -
    2. -
    3. ...on the left (red) edge of the red zone, the curve will have a cusp. We again don't know where, but we know there is one. This edge is described by the function:

      - - -
    4. -
    5. ...on the almost circular, lower right (pink) edge, the curve's end point touches the curve, forming a loop. This edge is described by the function:

      - - -
    6. -
    7. ...on the top (blue) edge, the curve's start point touches the curve, forming a loop. This edge is described by the function:

      - - -
    8. -
    9. ...inside the lower (green) zone, past y=1, the curve will have a single inflection (switching concave/convex once).

      -
    10. -
    11. ...between the left and lower boundaries (below the cusp line but above the single-inflection line), the curve will have two inflections (switching from concave to convex and then back again, or from convex to concave and then back again).

      -
    12. -
    13. ...anywhere on the right of self-intersection zone, the curve will have no inflections. It'll just be a simple arch.

      -
    14. -
    -

    Of course, this map is fairly small, but the regions extend to infinity, with well defined boundaries.

    -
    + + +
  • +

    ...inside the lower (green) zone, past y=1, the curve will have a single inflection (switching concave/convex once).

    +
  • +
  • +

    + ...between the left and lower boundaries (below the cusp line but above the single-inflection line), the curve will have two + inflections (switching from concave to convex and then back again, or from convex to concave and then back again). +

    +
  • +
  • ...anywhere on the right of self-intersection zone, the curve will have no inflections. It'll just be a simple arch.

  • + +

    Of course, this map is fairly small, but the regions extend to infinity, with well defined boundaries.

    +
    +

    Wait, where do those lines come from?

    +

    + Without repeating the paper mentioned at the top of this section, the loop-boundaries come from rewriting the curve into canonical form, + and then solving the formulae for which constraints must hold for which possible curve properties. In the paper these functions yield + formulae for where you will find cusp points, or loops where we know t=0 or t=1, but those functions are derived for the full cubic + expression, meaning they apply to t=-∞ to t=∞... For Bézier curves we only care about the "clipped interval" t=0 to t=1, so some of the + properties that apply when you look at the curve over an infinite interval simply don't apply to the Bézier curve interval. +

    +

    + The right bound for the loop region, indicating where the curve switches from "having inflections" to "having a loop", for the general + cubic curve, is actually mirrored over x=1, but for Bézier curves this right half doesn't apply, so we don't need to pay attention to + it. Similarly, the boundaries for t=0 and t=1 loops are also nice clean curves but get "cut off" when we only look at what the general + curve does over the interval t=0 to t=1. +

    +

    + For the full details, head over to the paper and read through sections 3 and 4. If you still remember your high school pre-calculus, you + can probably follow along with this paper, although you might have to read it a few times before all the bits "click". +

    +
    -

    Wait, where do those lines come from?

    -

    Without repeating the paper mentioned at the top of this section, the loop-boundaries come from rewriting the curve into canonical form, and then solving the formulae for which constraints must hold for which possible curve properties. In the paper these functions yield formulae for where you will find cusp points, or loops where we know t=0 or t=1, but those functions are derived for the full cubic expression, meaning they apply to t=-∞ to t=∞... For Bézier curves we only care about the "clipped interval" t=0 to t=1, so some of the properties that apply when you look at the curve over an infinite interval simply don't apply to the Bézier curve interval.

    -

    The right bound for the loop region, indicating where the curve switches from "having inflections" to "having a loop", for the general cubic curve, is actually mirrored over x=1, but for Bézier curves this right half doesn't apply, so we don't need to pay attention to it. Similarly, the boundaries for t=0 and t=1 loops are also nice clean curves but get "cut off" when we only look at what the general curve does over the interval t=0 to t=1.

    -

    For the full details, head over to the paper and read through sections 3 and 4. If you still remember your high school pre-calculus, you can probably follow along with this paper, although you might have to read it a few times before all the bits "click".

    -
    - -

    So now the question becomes: how do we manipulate our curve so that it fits this canonical form, with three fixed points, and one "free" point? Enter linear algebra. Don't worry, I'll be doing all the math for you, as well as show you what the effect is on our curves, but basically we're going to be using linear algebra, rather than calculus, because "it's way easier". Sometimes a calculus approach is very hard to work with, when the equivalent geometrical solution is super obvious.

    -

    The approach is going to start with a curve that doesn't have all-colinear points (so we need to make sure the points don't all fall on a straight line), and then applying three graphics operations that you will probably have heard of: translation (moving all points by some fixed x- and y-distance), scaling (multiplying all points by some x and y scale factor), and shearing (an operation that turns rectangles into parallelograms).

    -

    Step 1: we translate any curve by -p1.x and -p1.y, so that the curve starts at (0,0). We're going to make use of an interesting trick here, by pretending our 2D coordinates are 3D, with the z coordinate simply always being 1. This is an old trick in graphics to overcome the limitations of 2D transformations: without it, we can only turn (x,y) coordinates into new coordinates of the form (ax + by, cx + dy), which means we can't do translation, since that requires we end up with some kind of (x + a, y + b). If we add a bogus z coordinate that is always 1, then we can suddenly add arbitrary values. For example:

    - - -

    Sweet! z stays 1, so we can effectively ignore it entirely, but we added some plain values to our x and y coordinates. So, if we want to subtract p1.x and p1.y, we use:

    - - -

    Running all our coordinates through this transformation gives a new set of coordinates, let's call those U, where the first coordinate lies on (0,0), and the rest is still somewhat free. Our next job is to make sure point 2 ends up lying on the x=0 line, so what we want is a transformation matrix that, when we run it, subtracts x from whatever x we currently have. This is called shearing, and the typical x-shear matrix and its transformation looks like this:

    - - -

    So we want some shearing value that, when multiplied by y, yields -x, so our x coordinate becomes zero. That value is simply -x/y, because *-x/y * y = -x*. Done:

    - - -

    Now, running this on all our points generates a new set of coordinates, let's call those V, which now have point 1 on (0,0) and point 2 on (0, some-value), and we wanted it at (0,1), so we need to do some scaling to make sure it ends up at (0,1). Additionally, we want point 3 to end up on (1,1), so we can also scale x to make sure its x-coordinate will be 1 after we run the transform. That means we'll be x-scaling by 1/point3x, and y-scaling by point2y. This is really easy:

    - - -

    Then, finally, this generates a new set of coordinates, let's call those W, of which point 1 lies on (0,0), point 2 lies on (0,1), and point three lies on (1, ...) so all that's left is to make sure point 3 ends up at (1,1) - but we can't scale! Point 2 is already in the right place, and y-scaling would move it out of (0,1) again, so our only option is to y-shear point three, just like how we x-sheared point 2 earlier. In this case, we do the same trick, but with y/x rather than x/y because we're not x-shearing but y-shearing. Additionally, we don't actually want to end up at zero (which is what we did before) so we need to shear towards an offset, in this case 1:

    - - -

    And this generates our final set of four coordinates. Of these, we already know that points 1 through 3 are (0,0), (0,1) and (1,1), and only the last coordinate is "free". In fact, given any four starting coordinates, the resulting "transformation mapped" coordinate will be:

    - - -

    Okay, well, that looks plain ridiculous, but: notice that every coordinate value is being offset by the initial translation, and also notice that a lot of terms in that expression are repeated. Even though the maths looks crazy as a single expression, we can just pull this apart a little and end up with an easy-to-calculate bit of code!

    -

    First, let's just do that translation step as a "preprocessing" operation so we don't have to subtract the values all the time. What does that leave?

    - - -

    Suddenly things look a lot simpler: the mapped x is fairly straight forward to compute, and we see that the mapped y actually contains the mapped x in its entirety, so we'll have that part already available when we need to evaluate it. In fact, let's pull out all those common factors to see just how simple this is:

    - - -

    That's kind of super-simple to write out in code, I think you'll agree. Coding math tends to be easier than the formulae initially make it look!

    -
    + +

    + That's kind of super-simple to write out in code, I think you'll agree. Coding math tends to be easier than the formulae initially make it + look! +

    +
    +

    How do you track all that?

    +

    + Doing maths can be a pain, so whenever possible, I like to make computers do the work for me. Especially for things like this, I simply + use Mathematica. Tracking all this math by hand is insane, and we invented computers, + literally, to do this for us. I have no reason to use pen and paper when I can write out what I want to do in a program, and have the + program do the math for me. And real math, too, with symbols, not with numbers. In fact, + here's the Mathematica notebook if you want to see how + this works for yourself. +

    +

    + Now, I know, you're thinking "but Mathematica is super expensive!" and that's true, it's + $344 for home use, up from $295 when I original wrote this, but it's + also free when you buy a $35 raspberry pi. Obviously, I bought a + raspberry pi, and I encourage you to do the same. With that, as long as you know what you want to do, Mathematica can just do + it for you. And we don't have to be geniuses to work out what the maths looks like. That's what we have computers for. +

    +
    -

    How do you track all that?

    -

    Doing maths can be a pain, so whenever possible, I like to make computers do the work for me. Especially for things like this, I simply use Mathematica. Tracking all this math by hand is insane, and we invented computers, literally, to do this for us. I have no reason to use pen and paper when I can write out what I want to do in a program, and have the program do the math for me. And real math, too, with symbols, not with numbers. In fact, here's the Mathematica notebook if you want to see how this works for yourself.

    -

    Now, I know, you're thinking "but Mathematica is super expensive!" and that's true, it's $344 for home use, up from $295 when I original wrote this, but it's also free when you buy a $35 raspberry pi. Obviously, I bought a raspberry pi, and I encourage you to do the same. With that, as long as you know what you want to do, Mathematica can just do it for you. And we don't have to be geniuses to work out what the maths looks like. That's what we have computers for.

    -
    - -

    So, let's write up a sketch that'll show us the canonical form for any curve drawn in blue, overlaid on our canonical map, so that we can immediately tell which features our curve must have, based on where the fourth coordinate is located on the map:

    - - - Скрипты отключены. Показываем резервное изображение. - - - - -
    -
    -

    Finding Y, given X

    -

    One common task that pops up in things like CSS work, or parametric equalizers, or image leveling, or any other number of applications where Bézier curves are used as control curves in a way that there is really only ever one "y" value associated with one "x" value, you might want to cut out the middle man, as it were, and compute "y" directly based on "x". After all, the function looks simple enough, finding the "y" value should be simple too, right? Unfortunately, not really. However, it is possible and as long as you have some code in place to help, it's not a lot of a work either.

    -

    We'll be tackling this problem in two stages: the first, which is the hard part, is figuring out which "t" value belongs to any given "x" value. For instance, have a look at the following graphic. On the left we have a Bézier curve that looks for all intents and purposes like it fits our criteria: every "x" has one and only one associated "y" value. On the right we see the function for just the "x" values: that's a cubic curve, but not a really crazy cubic curve. If you move the graphic's slider, you will see a red line drawn that corresponds to the x coordinate: this is a vertical line in the left graphic, and a horizontal line on the right.

    - - - Скрипты отключены. Показываем резервное изображение. - - - - - -

    Now, if you look more closely at that right graphic, you'll notice something interesting: if we treat the red line as "the x axis", then the point where the function crosses our line is really just a root for the cubic function x(t) through a shifted "x-axis"... and we've already seen how to calculate roots, so let's just run cubic root finding - and not even the complicated cubic case either: because of the kind of curve we're starting with, we know there is only root, simplifying the code we need!

    -

    First, let's look at the function for x(t):

    - - -

    We can rewrite this to a plain polynomial form, by just fully writing out the expansion and then collecting the polynomial factors, as:

    - - -

    Nothing special here: that's a standard cubic polynomial in "power" form (i.e. all the terms are ordered by their power of t). So, given that a, b, c, d, and x(t) are all known constants, we can trivially rewrite this (by moving the x(t) across the equal sign) as:

    - - -

    You might be wondering "where did all the other 'minus x' for all the other values a, b, c, and d go?" and the answer there is that they all cancel out, so the only one we actually need to subtract is the one at the end. Handy! So now we just solve this equation using Cardano's algorithm, and we're left with some rather short code:

    + +

    + You might be wondering "where did all the other 'minus x' for all the other values a, b, c, and d go?" and the answer there is that they + all cancel out, so the only one we actually need to subtract is the one at the end. Handy! So now we just solve this equation using + Cardano's algorithm, and we're left with some rather short code: +

    - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    - -

    So the procedure is fairly straight forward: pick an x, find the associated t value, evaluate our curve for that t value, which gives us the curve's {x,y} coordinate, which means we know y for this x. Move the slider for the following graphic to see this in action:

    - - - Скрипты отключены. Показываем резервное изображение. - - - - - +y = curve.get(t).y + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + -
    -
    -

    Arc length

    -

    How long is a Bézier curve? As it turns out, that's not actually an easy question, because the answer requires maths that —much like root finding— cannot generally be solved the traditional way. If we have a parametric curve with fx(t) and fy(t), then the length of the curve, measured from start point to some point t = z, is computed using the following seemingly straight forward (if a bit overwhelming) formula:

    - - -

    or, more commonly written using Leibnitz notation as:

    - - -

    This formula says that the length of a parametric curve is in fact equal to the area underneath a function that looks a remarkable amount like Pythagoras' rule for computing the diagonal of a straight angled triangle. This sounds pretty simple, right? Sadly, it's far from simple... cutting straight to after the chase is over: for quadratic curves, this formula generates an unwieldy computation, and we're simply not going to implement things that way. For cubic Bézier curves, things get even more fun, because there is no "closed form" solution, meaning that due to the way calculus works, there is no generic formula that allows you to calculate the arc length. Let me just repeat this, because it's fairly crucial: for cubic and higher Bézier curves, there is no way to solve this function if you want to use it "for all possible coordinates".

    -

    Seriously: It cannot be done.

    -

    So we turn to numerical approaches again. The method we'll look at here is the Gauss quadrature. This approximation is a really neat trick, because for any nth degree polynomial it finds approximated values for an integral really efficiently. Explaining this procedure in length is way beyond the scope of this page, so if you're interested in finding out why it works, I can recommend the University of South Florida video lecture on the procedure, linked in this very paragraph. The general solution we're looking for is the following:

    - - -

    In plain text: an integral function can always be treated as the sum of an (infinite) number of (infinitely thin) rectangular strips sitting "under" the function's plotted graph. To illustrate this idea, the following graph shows the integral for a sinusoid function. The more strips we use (and of course the more we use, the thinner they get) the closer we get to the true area under the curve, and thus the better the approximation:

    -
    - - - Скрипты отключены. Показываем резервное изображение. - - - - - - Скрипты отключены. Показываем резервное изображение. - - - - - - Скрипты отключены. Показываем резервное изображение. - - - -
    + +

    + In plain text: an integral function can always be treated as the sum of an (infinite) number of (infinitely thin) rectangular strips + sitting "under" the function's plotted graph. To illustrate this idea, the following graph shows the integral for a sinusoid function. The + more strips we use (and of course the more we use, the thinner they get) the closer we get to the true area under the curve, and thus the + better the approximation: +

    +
    + + + Скрипты отключены. Показываем резервное изображение. + + + + + + Скрипты отключены. Показываем резервное изображение. + + + + + + Скрипты отключены. Показываем резервное изображение. + + + +
    -

    Now, infinitely many terms to sum and infinitely thin rectangles are not something that computers can work with, so instead we're going to approximate the infinite summation by using a sum of a finite number of "just thin" rectangular strips. As long as we use a high enough number of thin enough rectangular strips, this will give us an approximation that is pretty close to what the real value is.

    -

    So, the trick is to come up with useful rectangular strips. A naive way is to simply create n strips, all with the same width, but there is a far better way using special values for C and f(t) depending on the value of n, which indicates how many strips we'll use, and it's called the Legendre-Gauss quadrature.

    -

    This approach uses strips that are not spaced evenly, but instead spaces them in a special way based on describing the function as a polynomial (the more strips, the more accurate the polynomial), and then computing the exact integral for that polynomial. We're essentially performing arc length computation on a flattened curve, but flattening it based on the intervals dictated by the Legendre-Gauss solution.

    -
    - -

    Note that one requirement for the approach we'll use is that the integral must run from -1 to 1. That's no good, because we're dealing with Bézier curves, and the length of a section of curve applies to values which run from 0 to "some value smaller than or equal to 1" (let's call that value z). Thankfully, we can quite easily transform any integral interval to any other integral interval, by shifting and scaling the inputs. Doing so, we get the following:

    - - -

    That may look a bit more complicated, but the fraction involving z is a fixed number, so the summation, and the evaluation of the f(t) values are still pretty simple.

    -

    So, what do we need to perform this calculation? For one, we'll need an explicit formula for f(t), because that derivative notation is handy on paper, but not when we have to implement it. We'll also need to know what these Ci and ti values should be. Luckily, that's less work because there are actually many tables available that give these values, for any n, so if we want to approximate our integral with only two terms (which is a bit low, really) then these tables would tell us that for n=2 we must use the following values:

    - - -

    Which means that in order for us to approximate the integral, we must plug these values into the approximate function, which gives us:

    - - -

    We can program that pretty easily, provided we have that f(t) available, which we do, as we know the full description for the Bézier curve functions Bx(t) and By(t).

    -
    + +

    + We can program that pretty easily, provided we have that f(t) available, which we do, as we know the full description for the + Bézier curve functions Bx(t) and By(t). +

    + -

    If we use the Legendre-Gauss values for our C values (thickness for each strip) and t values (location of each strip), we can determine the approximate length of a Bézier curve by computing the Legendre-Gauss sum. The following graphic shows a cubic curve, with its computed lengths; Go ahead and change the curve, to see how its length changes. One thing worth trying is to see if you can make a straight line, and see if the length matches what you'd expect. What if you form a line with the control points on the outside, and the start/end points on the inside?

    - - - Скрипты отключены. Показываем резервное изображение. - - - +

    + If we use the Legendre-Gauss values for our C values (thickness for each strip) and t values (location of each strip), + we can determine the approximate length of a Bézier curve by computing the Legendre-Gauss sum. The following graphic shows a cubic curve, + with its computed lengths; Go ahead and change the curve, to see how its length changes. One thing worth trying is to see if you can make + a straight line, and see if the length matches what you'd expect. What if you form a line with the control points on the outside, and the + start/end points on the inside? +

    + + + Скрипты отключены. Показываем резервное изображение. + + + +
    +
    +

    + + Approximated arc length +

    +

    + Sometimes, we don't actually need the precision of a true arc length, and we can get away with simply computing the approximate arc length + instead. The by far fastest way to do this is to flatten the curve and then simply calculate the linear distance from point to point. This + will come with an error, but this can be made arbitrarily small by increasing the segment count. +

    +

    + If we combine the work done in the previous sections on curve flattening and arc length computation, we can implement these with minimal + effort: +

    +
    + + + Скрипты отключены. Показываем резервное изображение. + + + + + + + + Скрипты отключены. Показываем резервное изображение. + + + + + +
    -
    -
    -

    Approximated arc length

    -

    Sometimes, we don't actually need the precision of a true arc length, and we can get away with simply computing the approximate arc length instead. The by far fastest way to do this is to flatten the curve and then simply calculate the linear distance from point to point. This will come with an error, but this can be made arbitrarily small by increasing the segment count.

    -

    If we combine the work done in the previous sections on curve flattening and arc length computation, we can implement these with minimal effort:

    -
    - - - - Скрипты отключены. Показываем резервное изображение. - - - - - - - - Скрипты отключены. Показываем резервное изображение. - - - - - -
    - -

    You may notice that even though the error in length is actually pretty significant in absolute terms, even at a low number of segments we get a length that agrees with the true length when it comes to just the integer part of the arc length. Quite often, approximations can drastically speed things up!

    - -
    -
    -

    Curvature of a curve

    -

    If we have two curves, and we want to line them in up in a way that "looks right", what would we use as metric to let a computer decide what "looks right" means?

    -

    For instance, we can start by ensuring that the two curves share an end coordinate, so that there is no "gap" between the end of one and the start of the next curve, but that won't guarantee that things look right: both curves can be going in wildly different directions, and the resulting joined geometry will have a corner in it, rather than a smooth transition from one curve to the next.

    -

    What we want is to ensure that the curvature at the transition from one curve to the next "looks good". So, we start with a shared coordinate, and then also require that derivatives for both curves match at that coordinate. That way, we're assured that their tangents line up, which must mean the curve transition is perfectly smooth. We can even make the second, third, etc. derivatives match up for better and better transitions.

    -

    Problem solved!

    -

    However, there's a problem with this approach: if we think about this a little more, we realise that "what a curve looks like" and its derivative values are pretty much entirely unrelated. After all, the section on reordering curves showed us that the same looking curve can have an infinite number of curve expressions of arbitrarily high Bézier degree, and each of those will have wildly different derivative values.

    -

    So what we really want is some kind of expression that's not based on any particular expression of t, but is based on something that is invariant to the kind of function(s) we use to draw our curve. And the prime candidate for this is our curve expression, reparameterised for distance: no matter what order of Bézier curve we use, if we were able to rewrite it as a function of distance-along-the-curve, all those different degree Bézier functions would end up being the same function for "coordinate at some distance D along the curve".

    -

    We've seen this before... that's the arc length function.

    -

    So you might think that in order to find the curvature of a curve, we now need to solve the arc length function itself, and that this would be quite a problem because we just saw that there is no way to actually do that. Thankfully, we don't. We only need to know the form of the arc length function, which we saw above and is fairly simple, rather than needing to solve the arc length function. If we start with the arc length expression and the run through the steps necessary to determine its derivative (with an alternative, shorter demonstration of how to do this found over on Stackexchange), then the integral that was giving us so much problems in solving the arc length function disappears entirely (because of the fundamental theorem of calculus), and what we're left with us some surprisingly simple maths that relates curvature (denoted as κ, "kappa") to—and this is the truly surprising bit—a specific combination of derivatives of our original function.

    -

    Let me highlight what just happened, because it's pretty special:

    -
      -
    1. we wanted to make curves line up, and initially thought to match the curves' derivatives, but
    2. -
    3. that turned out to be a really bad choice, so instead
    4. -
    5. we picked a function that is basically impossible to work with, and then worked with that, which
    6. -
    7. gives us a simple formula that is and expression using the curves' derivatives.
    8. -
    -

    That's crazy!

    -

    But that's also one of the things that makes maths so powerful: even if your initial ideas are off the mark, you might be much closer than you thought you were, and the journey from "thinking we're completely wrong" to "actually being remarkably close to being right" is where we can find a lot of insight.

    -

    So, what does the function look like? This:

    - - -

    Which is really just a "short form" that glosses over the fact that we're dealing with functions of t, so let's expand that a tiny bit:

    - - -

    And while that's a little more verbose, it's still just as simple to work with as the first function: the curvature at some point on any (and this cannot be overstated: any) curve is a ratio between the first and second derivative cross product, and something that looks oddly similar to the standard Euclidean distance function. And nothing in these functions is hard to calculate either: for Bézier curves, simply knowing our curve coordinates means we know what the first and second derivatives are, and so evaluating this function for any t value is just a matter of basic arithematics.

    -

    In fact, let's just implement it right now:

    + +

    + And while that's a little more verbose, it's still just as simple to work with as the first function: the curvature at some point on any + (and this cannot be overstated: any) curve is a ratio between the first and second derivative cross product, and something that + looks oddly similar to the standard Euclidean distance function. And nothing in these functions is hard to calculate either: for Bézier + curves, simply knowing our curve coordinates means we know what the first and second derivatives are, and so + evaluating this function for any t value is just a matter of basic arithematics. +

    +

    In fact, let's just implement it right now:

    - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    - -

    That was easy! (Well okay, that "not a number" value will need to be taken into account by downstream code, but that's a reality of programming anyway)

    -

    With all of that covered, let's line up some curves! The following graphic gives you two curves that look identical, but use quadratic and cubic functions, respectively. As you can see, despite their derivatives being necessarily different, their curvature (thanks to being derived based on maths that "ignores" specific function derivative, and instead gives a formula that smooths out any differences) is exactly the same. And because of that, we can put them together such that the point where they overlap has the same curvature for both curves, giving us the smoothest transition.

    - - - Скрипты отключены. Показываем резервное изображение. - - - -

    One thing you may have noticed in this sketch is that sometimes the curvature looks fine, but seems to be pointing in the wrong direction, making it hard to line up the curves properly. A way around that, of course, is to show the curvature on both sides of the curve, so let's just do that. But let's take it one step further: we can also compute the associated "radius of curvature", which gives us the implicit circle that "fits" the curve's curvature at any point, using what is possibly the simplest bit of maths found in this entire primer:

    - - -

    So let's revisit the previous graphic with the curvature visualised on both sides of our curves, as well as showing the circle that "fits" our curve at some point that we can control by using a slider:

    - - - Скрипты отключены. Показываем резервное изображение. - - - - - + +

    + So let's revisit the previous graphic with the curvature visualised on both sides of our curves, as well as showing the circle that "fits" + our curve at some point that we can control by using a slider: +

    + + + Скрипты отключены. Показываем резервное изображение. + + + + + +
    +
    +

    + + Tracing a curve at fixed distance intervals +

    +

    + Say you want to draw a curve with a dashed line, rather than a solid line, or you want to move something along the curve at fixed distance + intervals over time, like a train along a track, and you want to use Bézier curves. +

    +

    Now you have a problem.

    +

    + The reason you have a problem is that Bézier curves are parametric functions with non-linear behaviour, whereas moving a train along a + track is about as close to a practical example of linear behaviour as you can get. The problem we're faced with is that we can't just pick + t values at some fixed interval and expect the Bézier functions to generate points that are spaced a fixed distance apart. In + fact, let's look at the relation between "distance along a curve" and "t value", by plotting them against one another. +

    +

    + The following graphic shows a particularly illustrative curve, and its distance-for-t plot. For linear traversal, this line needs to be + straight, running from (0,0) to (length,1). That is, it's safe to say, not what we'll see: we'll see something very wobbly, instead. To + make matters even worse, the distance-for-t function is also of a much higher order than our curve is: while the curve we're using for + this exercise is a cubic curve, which can switch concave/convex form twice at best, the distance function is our old friend the arc length + function, which can have more inflection points. +

    + + + Скрипты отключены. Показываем резервное изображение. + + +

    + So, how do we "cut up" the arc length function at regular intervals, when we can't really work with it? We basically cheat: we run through + the curve using t values, determine the distance-for-this-t-value at each point we generate during the run, and + then we find "the closest t value that matches some required distance" using those values instead. If we have a low number of + points sampled, we can then even refine which t value "should" work for our desired distance by interpolating between two + points, but if we have a high enough number of samples, we don't even need to bother. +

    +

    + So let's do exactly that: the following graph is similar to the previous one, showing how we would have to "chop up" our distance-for-t + curve in order to get regularly spaced points on the curve. It also shows what using those t values on the real curve looks + like, by coloring each section of curve between two distance markers differently: +

    + + + Скрипты отключены. Показываем резервное изображение. + + + + + +

    Use the slider to increase or decrease the number of equidistant segments used to colour the curve.

    +

    + However, are there better ways? One such way is discussed in "Moving Along a Curve with Specified Speed" by David Eberly of Geometric Tools, LLC, but basically because we have no explicit length function (or rather, one we don't have to + constantly compute for different intervals), you may simply be better off with a traditional lookup table (LUT). +

    +
    +
    +

    + + Intersections +

    +

    + Let's look at some more things we will want to do with Bézier curves. Almost immediately after figuring out how to get bounding boxes to + work, people tend to run into the problem that even though the minimal bounding box (based on rotation) is tight, it's not sufficient to + perform true collision detection. It's a good first step to make sure there might be a collision (if there is no bounding box + overlap, there can't be one), but in order to do real collision detection we need to know whether or not there's an intersection on the + actual curve. +

    +

    + We'll do this in steps, because it's a bit of a journey to get to curve/curve intersection checking. First, let's start simple, by + implementing a line-line intersection checker. While we can solve this the traditional calculus way (determine the functions for both + lines, then compute the intersection by equating them and solving for two unknowns), linear algebra actually offers a nicer solution. +

    +

    Line-line intersections

    +

    + If we have two line segments with two coordinates each, segments A-B and C-D, we can find the intersection of the lines these segments are + an intervals on by linear algebra, using the procedure outlined in this + top coder + article. Of course, we need to make sure that the intersection isn't just on the lines our line segments lie on, but actually on our line + segments themselves. So after we find the intersection, we need to verify that it lies without the bounds of our original line segments. +

    +

    + The following graphic implements this intersection detection, showing a red point for an intersection on the lines our segments lie on + (thus being a virtual intersection point), and a green point for an intersection that lies on both segments (being a real intersection + point). +

    + + + Скрипты отключены. Показываем резервное изображение. + + + +
    +

    Implementing line-line intersections

    +

    + Let's have a look at how to implement a line-line intersection checking function. The basics are covered in the article mentioned above, + but sometimes you need more function signatures, because you might not want to call your function with eight distinct parameters. Maybe + you're using point structs for the line. Let's get coding: +

    -
    -
    -

    Tracing a curve at fixed distance intervals

    -

    Say you want to draw a curve with a dashed line, rather than a solid line, or you want to move something along the curve at fixed distance intervals over time, like a train along a track, and you want to use Bézier curves.

    -

    Now you have a problem.

    -

    The reason you have a problem is that Bézier curves are parametric functions with non-linear behaviour, whereas moving a train along a track is about as close to a practical example of linear behaviour as you can get. The problem we're faced with is that we can't just pick t values at some fixed interval and expect the Bézier functions to generate points that are spaced a fixed distance apart. In fact, let's look at the relation between "distance along a curve" and "t value", by plotting them against one another.

    -

    The following graphic shows a particularly illustrative curve, and its distance-for-t plot. For linear traversal, this line needs to be straight, running from (0,0) to (length,1). That is, it's safe to say, not what we'll see: we'll see something very wobbly, instead. To make matters even worse, the distance-for-t function is also of a much higher order than our curve is: while the curve we're using for this exercise is a cubic curve, which can switch concave/convex form twice at best, the distance function is our old friend the arc length function, which can have more inflection points.

    - - - Скрипты отключены. Показываем резервное изображение. - - - -

    So, how do we "cut up" the arc length function at regular intervals, when we can't really work with it? We basically cheat: we run through the curve using t values, determine the distance-for-this-t-value at each point we generate during the run, and then we find "the closest t value that matches some required distance" using those values instead. If we have a low number of points sampled, we can then even refine which t value "should" work for our desired distance by interpolating between two points, but if we have a high enough number of samples, we don't even need to bother.

    -

    So let's do exactly that: the following graph is similar to the previous one, showing how we would have to "chop up" our distance-for-t curve in order to get regularly spaced points on the curve. It also shows what using those t values on the real curve looks like, by coloring each section of curve between two distance markers differently:

    - - - Скрипты отключены. Показываем резервное изображение. - - - - - -

    Use the slider to increase or decrease the number of equidistant segments used to colour the curve.

    -

    However, are there better ways? One such way is discussed in "Moving Along a Curve with Specified Speed" by David Eberly of Geometric Tools, LLC, but basically because we have no explicit length function (or rather, one we don't have to constantly compute for different intervals), you may simply be better off with a traditional lookup table (LUT).

    - -
    -
    -

    Intersections

    -

    Let's look at some more things we will want to do with Bézier curves. Almost immediately after figuring out how to get bounding boxes to work, people tend to run into the problem that even though the minimal bounding box (based on rotation) is tight, it's not sufficient to perform true collision detection. It's a good first step to make sure there might be a collision (if there is no bounding box overlap, there can't be one), but in order to do real collision detection we need to know whether or not there's an intersection on the actual curve.

    -

    We'll do this in steps, because it's a bit of a journey to get to curve/curve intersection checking. First, let's start simple, by implementing a line-line intersection checker. While we can solve this the traditional calculus way (determine the functions for both lines, then compute the intersection by equating them and solving for two unknowns), linear algebra actually offers a nicer solution.

    -

    Line-line intersections

    -

    If we have two line segments with two coordinates each, segments A-B and C-D, we can find the intersection of the lines these segments are an intervals on by linear algebra, using the procedure outlined in this top coder article. Of course, we need to make sure that the intersection isn't just on the lines our line segments lie on, but actually on our line segments themselves. So after we find the intersection, we need to verify that it lies without the bounds of our original line segments.

    -

    The following graphic implements this intersection detection, showing a red point for an intersection on the lines our segments lie on (thus being a virtual intersection point), and a green point for an intersection that lies on both segments (being a real intersection point).

    - - - Скрипты отключены. Показываем резервное изображение. - - - -
    - -

    Implementing line-line intersections

    -

    Let's have a look at how to implement a line-line intersection checking function. The basics are covered in the article mentioned above, but sometimes you need more function signatures, because you might not want to call your function with eight distinct parameters. Maybe you're using point structs for the line. Let's get coding:

    - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    - -
    + return lli4(line1.p1, line1.p2, line2.p1, line2.p2) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + -

    What about curve-line intersections?

    -

    Curve/line intersection is more work, but we've already seen the techniques we need to use in order to perform it: first we translate/rotate both the line and curve together, in such a way that the line coincides with the x-axis. This will position the curve in a way that makes it cross the line at points where its y-function is zero. By doing this, the problem of finding intersections between a curve and a line has now become the problem of performing root finding on our translated/rotated curve, as we already covered in the section on finding extremities.

    -
    - - - Скрипты отключены. Показываем резервное изображение. - - - - - - Скрипты отключены. Показываем резервное изображение. - - - -
    +

    What about curve-line intersections?

    +

    + Curve/line intersection is more work, but we've already seen the techniques we need to use in order to perform it: first we + translate/rotate both the line and curve together, in such a way that the line coincides with the x-axis. This will position the curve in + a way that makes it cross the line at points where its y-function is zero. By doing this, the problem of finding intersections between a + curve and a line has now become the problem of performing root finding on our translated/rotated curve, as we already covered in the + section on finding extremities. +

    +
    + + + Скрипты отключены. Показываем резервное изображение. + + + + + + Скрипты отключены. Показываем резервное изображение. + + + +
    -

    Curve/curve intersection, however, is more complicated. Since we have no straight line to align to, we can't simply align one of the curves and be left with a simple procedure. Instead, we'll need to apply two techniques we've met before: de Casteljau's algorithm, and curve splitting.

    +

    + Curve/curve intersection, however, is more complicated. Since we have no straight line to align to, we can't simply align one of the + curves and be left with a simple procedure. Instead, we'll need to apply two techniques we've met before: de Casteljau's algorithm, and + curve splitting. +

    +
    +
    +

    + + Curve/curve intersection +

    +

    + Using de Casteljau's algorithm to split the curve we can now implement curve/curve intersection finding using a "divide and conquer" + technique: +

    +
      +
    1. + Take two curves C1 and C2, and treat them as a pair. +
    2. +
    3. If their bounding boxes overlap, split up each curve into two sub-curves
    4. +
    5. + With C1.1, C1.2, C2.1 and C2.2, form four new pairs (C1.1,C2.1), (C1.1, C2.2), (C1.2,C2.1), and (C1.2,C2.2). +
    6. +
    7. + For each pair, check whether their bounding boxes overlap. +
        +
      1. If their bounding boxes do not overlap, discard the pair, as there is no intersection between this pair of curves.
      2. +
      3. If there is overlap, rerun all steps for this pair.
      4. +
      +
    8. +
    9. + Once the sub-curves we form are so small that they effectively occupy sub-pixel areas, we consider an intersection found, noting that we + might have a cluster of multiple intersections at the sub-pixel level, out of which we pick one to act as "found" t value + (we can either throw all but one away, we can average the cluster's t values, or you can do something even more creative). +
    10. +
    +

    + This algorithm will start with a single pair, "balloon" until it runs in parallel for a large number of potential sub-pairs, and then + taper back down as it homes in on intersection coordinates, ending up with as many pairs as there are intersections. +

    +

    + The following graphic applies this algorithm to a pair of cubic curves, one step at a time, so you can see the algorithm in action. Click + the button to run a single step in the algorithm, after setting up your curves in some creative arrangement. You can also change the value + that is used in step 5 to determine whether the curves are small enough. Manipulating the curves or changing the threshold will reset the + algorithm, so you can try this with lots of different curves. +

    +

    (can you find the configuration that yields the maximum number of intersections between two cubic curves? Nine intersections!)

    + + + Скрипты отключены. Показываем резервное изображение. + + + + + + +

    + Finding self-intersections is effectively the same procedure, except that we're starting with a single curve, so we need to turn that into + two separate curves first. This is trivially achieved by splitting at an inflection point, or if there are none, just splitting at + t=0.5 first, and then running the exact same algorithm as above, with all non-overlapping curve pairs getting removed at each + iteration, and each successive step homing in on the curve's self-intersection points. +

    +
    +
    +

    + + The projection identity +

    +

    + De Casteljau's algorithm is the pivotal algorithm when it comes to Bézier curves. You can use it not just to split curves, but also to + draw them efficiently (especially for high-order Bézier curves), as well as to come up with curves based on three points and a tangent. + Particularly this last thing is really useful because it lets us "mold" a curve, by picking it up at some point, and dragging that point + around to change the curve's shape. +

    +

    How does that work? Succinctly: we run de Casteljau's algorithm in reverse!

    +

    + In order to run de Casteljau's algorithm in reverse, we need a few basic things: a start and end point, a point on the curve that we want + to be moving around, which has an associated t value, and a point we've not explicitly talked about before, and as far as I know + has no explicit name, but lives one iteration higher in the de Casteljau process then our on-curve point does. I like to call it "A" for + reasons that will become obvious. +

    +

    + So let's use graphics instead of text to see where this "A" is, because text only gets us so far: move the sliders for the following + graphics to see what, given a specific t value, our A coordinate is. As well as some other coordinates, which + taken together let us derive a value that the graphics call "ratio": if you move the curve's points around, A, B, and C will move, what + happens to that value? +

    +
    + + + Скрипты отключены. Показываем резервное изображение. + + + + + + + + Скрипты отключены. Показываем резервное изображение. + + + + + +
    -
    -
    -

    Curve/curve intersection

    -

    Using de Casteljau's algorithm to split the curve we can now implement curve/curve intersection finding using a "divide and conquer" technique:

    -
      -
    1. Take two curves C1 and C2, and treat them as a pair.
    2. -
    3. If their bounding boxes overlap, split up each curve into two sub-curves
    4. -
    5. With C1.1, C1.2, C2.1 and C2.2, form four new pairs (C1.1,C2.1), (C1.1, C2.2), (C1.2,C2.1), and (C1.2,C2.2).
    6. -
    7. For each pair, check whether their bounding boxes overlap.
        -
      1. If their bounding boxes do not overlap, discard the pair, as there is no intersection between this pair of curves.
      2. -
      3. If there is overlap, rerun all steps for this pair.
      4. -
      -
    8. -
    9. Once the sub-curves we form are so small that they effectively occupy sub-pixel areas, we consider an intersection found, noting that we might have a cluster of multiple intersections at the sub-pixel level, out of which we pick one to act as "found" t value (we can either throw all but one away, we can average the cluster's t values, or you can do something even more creative).
    10. -
    -

    This algorithm will start with a single pair, "balloon" until it runs in parallel for a large number of potential sub-pairs, and then taper back down as it homes in on intersection coordinates, ending up with as many pairs as there are intersections.

    -

    The following graphic applies this algorithm to a pair of cubic curves, one step at a time, so you can see the algorithm in action. Click the button to run a single step in the algorithm, after setting up your curves in some creative arrangement. You can also change the value that is used in step 5 to determine whether the curves are small enough. Manipulating the curves or changing the threshold will reset the algorithm, so you can try this with lots of different curves.

    -

    (can you find the configuration that yields the maximum number of intersections between two cubic curves? Nine intersections!)

    - - - Скрипты отключены. Показываем резервное изображение. - - - - - - -

    Finding self-intersections is effectively the same procedure, except that we're starting with a single curve, so we need to turn that into two separate curves first. This is trivially achieved by splitting at an inflection point, or if there are none, just splitting at t=0.5 first, and then running the exact same algorithm as above, with all non-overlapping curve pairs getting removed at each iteration, and each successive step homing in on the curve's self-intersection points.

    - -
    -
    -

    The projection identity

    -

    De Casteljau's algorithm is the pivotal algorithm when it comes to Bézier curves. You can use it not just to split curves, but also to draw them efficiently (especially for high-order Bézier curves), as well as to come up with curves based on three points and a tangent. Particularly this last thing is really useful because it lets us "mold" a curve, by picking it up at some point, and dragging that point around to change the curve's shape.

    -

    How does that work? Succinctly: we run de Casteljau's algorithm in reverse!

    -

    In order to run de Casteljau's algorithm in reverse, we need a few basic things: a start and end point, a point on the curve that we want to be moving around, which has an associated t value, and a point we've not explicitly talked about before, and as far as I know has no explicit name, but lives one iteration higher in the de Casteljau process then our on-curve point does. I like to call it "A" for reasons that will become obvious.

    -

    So let's use graphics instead of text to see where this "A" is, because text only gets us so far: move the sliders for the following graphics to see what, given a specific t value, our A coordinate is. As well as some other coordinates, which taken together let us derive a value that the graphics call "ratio": if you move the curve's points around, A, B, and C will move, what happens to that value?

    -
    - - - - Скрипты отключены. Показываем резервное изображение. - - - - - - - - Скрипты отключены. Показываем резервное изображение. - - - - - -
    - -

    So these graphics show us several things:

    -
      -
    1. a point at the tip of the curve construction's "hat": let's call that A, as well as
    2. -
    3. our on-curve point give our chosen t value: let's call that B, and finally,
    4. -
    5. a point that we get by projecting A, through B, onto the line between the curve's start and end points: let's call that C.
    6. -
    7. for both quadratic and cubic curves, two points e1 and e2, which represent the single-to-last step in de Casteljau's algorithm: in the last step, we find B at (1-t) * e1 + t * e2.
    8. -
    9. for cubic curves, also the points v1 and v2, which together with A represent the first step in de Casteljau's algorithm: in the next step, we find e1 and e2.
    10. -
    -

    These three values A, B, and C allow us to derive an important identity formula for quadratic and cubic Bézier curves: for any point on the curve with some t value, the ratio of distances from A to B and B to C is fixed: if some t value sets up a C that is 20% away from the start and 80% away from the end, then it doesn't matter where the start, end, or control points are; for that t value, C will always lie at 20% from the start and 80% from the end point. Go ahead, pick an on-curve point in either graphic and then move all the other points around: if you only move the control points, start and end won't move, and so neither will C, and if you move either start or end point, C will move but its relative position will not change.

    -

    So, how can we compute C? We start with our observation that C always lies somewhere between the start and end points, so logically C will have a function that interpolates between those two coordinates:

    - - -

    If we can figure out what the function u(t) looks like, we'll be done. Although we do need to remember that this u(t) will have a different form depending on whether we're working with quadratic or cubic curves. Running through the maths (with thanks to Boris Zbarsky) shows us the following two formulae:

    - - -

    And

    - - -

    So, if we know the start and end coordinates and the t value, we know C without having to calculate the A or even B coordinates. In fact, we can do the same for the ratio function. As another function of t, we technically don't need to know what A or B or C are. It, too, can be expressed as a pure function of t.

    -

    We start by observing that, given A, B, and C, the following always holds:

    - - -

    Working out the maths for this, we see the following two formulae for quadratic and cubic curves:

    - - -

    And

    - - -

    Which now leaves us with some powerful tools: given three points (start, end, and "some point on the curve"), as well as a t value, we can construct curves. We can compute C using the start and end points and our u(t) function, and once we have C, we can use our on-curve point (B) and the ratio(t) function to find A:

    - - -

    With A found, finding e1 and e2 for quadratic curves is a matter of running the linear interpolation with t between start and A to yield e1, and between A and end to yield e2. For cubic curves, there is no single pair of points that can act as e1 and e2 (there are infinitely many, because the tangent at B is a free parameter for cubic curves) so as long as the distance ratio between e1 to B and B to e2 is the Bézier ratio (1-t):t, we are free to pick any pair, after which we can reverse engineer v1 and v2:

    - - -

    And then reverse engineer the curve's control points:

    - - -

    So: if we have a curve's start and end points, as well as some third point B that we want the curve to pass through, then for any t value we implicitly know all the ABC values, which (combined with an educated guess on appropriate e1 and e2 coordinates for cubic curves) gives us the necessary information to reconstruct a curve's "de Casteljau skeleton". Which means that we can now do several things: we can "fit" curves using only three points, which means we can also "mold" curves by moving an on-curve point but leaving its start and end points, and then reconstruct the curve based on where we moved the on-curve point to. These are very useful things, and we'll look at both in the next few sections.

    - -
    -
    -

    Creating a curve from three points

    -

    Given the preceding section, you might be wondering if we can use that knowledge to just "create" curves by placing some points and having the computer do the rest, to which the answer is: that's exactly what we can now do!

    -

    For quadratic curves, things are pretty easy. Technically, we'll need a t value in order to compute the ratio function used in computing the ABC coordinates, but we can just as easily approximate one by treating the distance between the start and B point, and B and end point as a ratio, using

    - - -

    With this code in place, creating a quadratic curve from three points is literally just computing the ABC values, and using A as our curve's control point:

    - - - Скрипты отключены. Показываем резервное изображение. - - - -

    For cubic curves we need to do a little more work, but really only just a little. We're first going to assume that a decent curve through the three points should approximate a circular arc, which first requires knowing how to fit a circle to three points. You may remember (if you ever learned it!) that a line between two points on a circle is called a chord, and that one property of chords is that the line from the center of any chord, perpendicular to that chord, passes through the center of the circle.

    -

    That means that if we have three points on a circle, we have three (different) chords, and consequently, three (different) lines that go from those chords through the center of the circle: if we find two of those lines, then their intersection will be our circle's center, and the circle's radius will—by definition!—be the distance from the center to any of our three points:

    - - - Скрипты отключены. Показываем резервное изображение. - - - -

    With that covered, we now also know the tangent line to our point B, because the tangent to any point on the circle is a line through that point, perpendicular to the line from that point to the center. That just leaves marking appropriate points e1 and e2 on that tangent, so that we can construct a new cubic curve hull. We use the approach as we did for quadratic curves to automatically determine a reasonable t value, and then our e1 and e2 coordinates must obey the standard de Casteljau rule for linear interpolation:

    - - -

    Where d is the total length of the line segment from e1 to e2. So how long do we make that? There are again all kinds of approaches we can take, and a simple-but-effective one is to set the length of that segment to "one third the length of the baseline". This forces e1 and e2 to always be the "linear curve" distance apart, which means if we place our three points on a line, it will actually look like a line. Nice! The last thing we'll need to do is make sure to flip the sign of d depending on which side of the baseline our B is located, so we don't end up creating a funky curve with a loop in it. To do this, we can use the atan2 function:

    - - -

    This angle φ will be between 0 and π if B is "above" the baseline (rotating all three points so that the start is on the left and the end is the right), so we can use a relatively straight forward check to make sure we're using the correct sign for our value d:

    - - -

    The result of this approach looks as follows:

    - - - Скрипты отключены. Показываем резервное изображение. - - - -

    It is important to remember that even though we're using a circular arc to come up with decent e1 and e2 terms, we're not trying to perfectly create a circular arc with a cubic curve (which is good, because we can't; more on that later), we're only trying to come up with some reasonable e1 and e2 points so we can construct a new cubic curve... so now that we have those: let's see what kind of cubic curve that gives us:

    - - - Скрипты отключены. Показываем резервное изображение. - - - -

    That looks perfectly serviceable!

    -

    Of course, we can take this one step further: we can't just "create" curves, we also have (almost!) all the tools available to "mold" curves, where we can reshape a curve by dragging a point on the curve around while leaving the start and end fixed, effectively molding the shape as if it were clay or the like. We'll see the last tool we need to do that in the next section, and then we'll look at implementing curve molding in the section after that, so read on!

    + +

    The result of this approach looks as follows:

    + + + Скрипты отключены. Показываем резервное изображение. + + + +

    + It is important to remember that even though we're using a circular arc to come up with decent e1 and e2 terms, + we're not trying to perfectly create a circular arc with a cubic curve (which is good, because we can't; + more on that later), we're only trying to come up with some reasonable e1 and + e2 points so we can construct a new cubic curve... so now that we have those: let's see what kind of cubic curve that gives + us: +

    + + + Скрипты отключены. Показываем резервное изображение. + + + +

    That looks perfectly serviceable!

    +

    + Of course, we can take this one step further: we can't just "create" curves, we also have (almost!) all the tools available to "mold" + curves, where we can reshape a curve by dragging a point on the curve around while leaving the start and end fixed, effectively molding + the shape as if it were clay or the like. We'll see the last tool we need to do that in the next section, and then we'll look at + implementing curve molding in the section after that, so read on! +

    +
    +
    +

    + + Projecting a point onto a Bézier curve +

    +

    + Before we can move on to actual curve molding, it'll be good if know how to actually be able to find "some point on the curve" that we're + trying to click on. After all, if all we have is our Bézier coordinates, that is not in itself enough to figure out which point on the + curve our cursor will be closest to. So, how do we project points onto a curve? +

    +

    + If the Bézier curve is of low enough order, we might be able to + work out the maths for how to do this, and get a perfect t value back, but in general this is an incredibly hard problem and the easiest solution is, really, a + numerical approach again. We'll be finding our ideal t value using a + binary search. First, we do a coarse distance-check based on + t values associated with the curve's "to draw" coordinates (using a lookup table, or LUT). This is pretty fast: +

    -
    -
    -

    Projecting a point onto a Bézier curve

    -

    Before we can move on to actual curve molding, it'll be good if know how to actually be able to find "some point on the curve" that we're trying to click on. After all, if all we have is our Bézier coordinates, that is not in itself enough to figure out which point on the curve our cursor will be closest to. So, how do we project points onto a curve?

    -

    If the Bézier curve is of low enough order, we might be able to work out the maths for how to do this, and get a perfect t value back, but in general this is an incredibly hard problem and the easiest solution is, really, a numerical approach again. We'll be finding our ideal t value using a binary search. First, we do a coarse distance-check based on t values associated with the curve's "to draw" coordinates (using a lookup table, or LUT). This is pretty fast:

    - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    - -

    After this runs, we know that LUT[i] is the coordinate on the curve in our LUT that is closest to the point we want to project, so that's a pretty good initial guess as to what the best projection onto our curve is. To refine it, we note that LUT[i] is a better guess than both LUT[i-1] and LUT[i+1], but there might be an even better projection somewhere else between those two values, so that's what we're going to be testing for, using a variation of the binary search.

    -
      -
    1. we start with our point p, and the t values t1=LUT[i-1].t and t2=LUT[i+1].t, which span an interval v = t2-t1.
    2. -
    3. we test this interval in five spots: the start, middle, and end (which we already have), and the two points in between the middle and start/end points
    4. -
    5. we then check which of these five points is the closest to our original point p, and then repeat step 1 with the points before and after the closest point we just found.
    6. -
    -

    This makes the interval we check smaller and smaller at each iteration, and we can keep running the three steps until the interval becomes so small as to lead to distances that are, for all intents and purposes, the same for all points.

    -

    So, let's see that in action: in this case, I'm going to arbitrarily say that if we're going to run the loop until the interval is smaller than 0.001, and show you what that means for projecting your mouse cursor or finger tip onto a rather complex Bézier curve (which, of course, you can reshape as you like). Also shown are the original three points that our coarse check finds.

    - - - Скрипты отключены. Показываем резервное изображение. - - - + i = index + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + -
    -
    -

    Intersections with a circle

    -

    It might seem odd to cover this subject so much later than the line/line, line/curve, and curve/curve intersection topics from several sections earlier, but the reason we can't cover circle/curve intersections is that we can't really discuss circle/curve intersection until we've covered the kind of lookup table (LUT) walking that the section on projecting a point onto a curve uses. To see why, let's look at what we would have to do if we wanted to find the intersections between a curve and a circle using calculus.

    -

    First, we observe that "finding intersections" in this case means that, given a circle defined by a center point c = (x,y) and a radius r, we want to find all points on the Bezier curve for which the distance to the circle's center point is equal to the circle radius, which by definition means those points lie on the circle, and so count as intersections. In maths, that means we're trying to solve:

    - - -

    Which seems simple enough. Unfortunately, when we expand that dist function, things get a lot more problematic:

    - - -

    And now we have a problem because that's a sixth degree polynomial inside the square root. So, thanks to the Abel-Ruffini theorem that we saw before, we can't solve this by just going "square both sides because we don't care about signs"... we can't solve a sixth degree polynomial. So, we're going to have to actually evaluate that expression. We can "simplify" this by translating all our coordinates so that the center of the circle is (0,0) and all our coordinates are shifted accordingly, which makes the cx and cy terms fall away, but then we're still left with a monstrous function to solve.

    -

    So instead, we turn to the same kind of "LUT walking" that we saw for projecting points onto a curve, with a twist: instead of finding the on-curve point with the smallest distance to our projection point, we want to find the on-curve point that has the exact distance r to our projection point (namely, our circle center). Of course, there can be more than one such point, so there's also a bit more code to make sure we find all of them, but let's look at the steps involved:

    + +

    + And now we have a problem because that's a sixth degree polynomial inside the square root. So, thanks to the + Abel-Ruffini theorem that we saw before, we can't solve this by + just going "square both sides because we don't care about signs"... we can't solve a sixth degree polynomial. So, we're going to have to + actually evaluate that expression. We can "simplify" this by translating all our coordinates so that the center of the circle is (0,0) and + all our coordinates are shifted accordingly, which makes the cx and cy terms fall away, but then we're still left + with a monstrous function to solve. +

    +

    + So instead, we turn to the same kind of "LUT walking" that we saw for projecting points onto a curve, with a twist: instead of finding the + on-curve point with the smallest distance to our projection point, we want to find the on-curve point that has the exact distance + r to our projection point (namely, our circle center). Of course, there can be more than one such point, so there's also a + bit more code to make sure we find all of them, but let's look at the steps involved: +

    - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    - -

    This is very similar to the code in the previous section, with an extra input r for the circle radius, and a minor change in the "distance for this coordinate": rather than just distance(coordinate, p) we want to know the difference between that distance and the circle radius. After all, if that difference is zero, then the distance from the coordinate to the circle center is exactly the radius, so the coordinate lies on both the curve and the circle.

    -

    So far so good.

    -

    However, we also want to make sure we find all the points, not just a single one, so we need a little more code for that:

    + i = index + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    - -

    After running this code, values will be the list of all LUT coordinates that are closest to the distance r: we can use those values to run the same kind of refinement lookup we used for point projection (with the caveat that we're now not checking for smallest distance, but for "distance closest to r"), and we'll have all our intersection points. Of course, that does require explaining what findClosest does: rather than looking for a global minimum, we're now interested in finding a local minimum, so instead of checking a single point and looking at its distance value, we check three points ("current", "previous" and "before previous") and then check whether they form a local minimum:

    + start = i + 2; + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + - - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    - -

    In words: given a start index, the circle center and radius, and our LUT, we check where (closest to our start index) we can find a local minimum for the difference between "the distance from the curve to the circle center", and the circle's radius. We track this by looking at three values (associated with the indices index-2, index-1, and index), and we know we've found a local minimum if the three values show that the middle value (pd1) is less than either value beside it. When we do, we can set our "best guess, relative to start" as index-1. Of course, since we're now checking values relative to some start value, we might not find another candidate value at all, in which case we return start - 1, so that a simple "is the result less than start?" lets us determine that there are no more intersections to find.

    -

    Finally, while not necessary for point projection, there is one more step we need to perform when we run the binary refinement function on our candidate LUT indices, because we've so far only been testing using distances "closest to the radius of the circle", and that's actually not good enough... we need distances that are the radius of the circle. So, after running the refinement for each of these indices, we need to discard any final value that isn't the circle radius. And because we're working with floating point numbers, what this really means is that we need to discard any value that's a pixel or more "off". Or, if we want to get really fancy, "some small epsilon value".

    -

    Based on all of that, the following graphic shows this off for the standard cubic curve (which you can move the coordinates around for, of course) and a circle with a controllable radius centered on the graphic's center, using the code approach described above.

    - - - Скрипты отключены. Показываем резервное изображение. - - - - - -

    And of course, for the full details, click that "view source" link.

    + return start + i + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + 18 + + + 19 + + -
    -
    -

    Molding a curve

    -

    Armed with knowledge of the "ABC" relation, point-on-curve projection, and guestimating reasonable looking helper values for cubic curve construction, we can finally cover curve molding: updating a curve's shape interactively, by dragging points on the curve around.

    -

    For quadratic curve, this is a really simple trick: we project our cursor onto the curve, which gives us a t value and initial B coordinate. We don't even need the latter: with our t value and "wherever the cursor is" as target B, we can compute the associated C:

    - - -

    And then the associated A:

    - - -

    And we're done, because that's our new quadratic control point!

    - - - Скрипты отключены. Показываем резервное изображение. - - - -

    As before, cubic curves are a bit more work, because while it's easy to find our initial t value and ABC values, getting those all-important e1 and e2 coordinates is going to pose a bit of a problem... in the section on curve creation, we were free to pick an appropriate t value ourselves, which allowed us to find appropriate e1 and e2 coordinates. That's great, but when we're curve molding we don't have that luxury: whatever point we decide to start moving around already has its own t value, and its own e1 and e2 values, and those may not make sense for the rest of the curve.

    -

    For example, let's see what happens if we just "go with what we get" when we pick a point and start moving it around, preserving its t value and e1/e2 coordinates:

    - - - Скрипты отключены. Показываем резервное изображение. - - - -

    That looks reasonable, close to the original point, but the further we drag our point, the less "useful" things become. Especially if we drag our point across the baseline, rather than turning into a nice curve.

    -

    One way to combat this might be to combine the above approach with the approach from the creating curves section: generate both the "unchanged t/e1/e2" curve, as well as the "idealized" curve through the start/cursor/end points, with idealized t value, and then interpolating between those two curves:

    - - - Скрипты отключены. Показываем резервное изображение. - - - - - -

    The slide controls the "falloff distance" relative to where the original point on the curve is, so that as we drag our point around, it interpolates with a bias towards "preserving t/e1/e2" closer to the original point, and bias towards "idealized" form the further away we move our point, with anything that's further than our falloff distance simply being the idealized curve. We don't even try to interpolate at that point.

    -

    A more advanced way to try to smooth things out is to implement continuous molding, where we constantly update the curve as we move around, and constantly change what our B point is, based on constantly projecting the cursor on the curve as we're updating it - this is, you won't be surprised to learn, tricky, and beyond the scope of this section: interpolation (with a reasonable distance) will do for now!

    - -
    -
    -

    Curve fitting

    -

    Given the previous section, one question you might have is "what if I don't want to guess t values?". After all, plenty of graphics packages do automated curve fitting, so how can we implement that in a way that just finds us reasonable t values all on its own?

    -

    And really this is just a variation on the question "how do I get the curve through these X points?", so let's look at that. Specifically, let's look at the answer: "curve fitting". This is in fact a rather rich field in geometry, applying to anything from data modelling to path abstraction to "drawing", so there's a fair number of ways to do curve fitting, but we'll look at one of the most common approaches: something called a least squares polynomial regression. In this approach, we look at the number of points we have in our data set, roughly determine what would be an appropriate order for a curve that would fit these points, and then tackle the question "given that we want an nth order curve, what are the coordinates we can find such that our curve is "off" by the least amount?".

    -

    Now, there are many ways to determine how "off" points are from the curve, which is where that "least squares" term comes in. The most common tool in the toolbox is to minimise the squared distance between each point we have, and the corresponding point on the curve we end up "inventing". A curve with a snug fit will have zero distance between those two, and a bad fit will have non-zero distances between every such pair. It's a workable metric. You might wonder why we'd need to square, rather than just ensure that distance is a positive value (so that the total error is easy to compute by just summing distances) and the answer really is "because it tends to be a little better". There's lots of literature on the web if you want to deep-dive the specific merits of least squared error metrics versus least absolute error metrics, but those are well beyond the scope of this material.

    -

    So let's look at what we end up with in terms of curve fitting if we start with the idea of performing least squares Bézier fitting. We're going to follow a procedure similar to the one described by Jim Herold over on his "Least Squares Bézier Fit" article, and end with some nice interactive graphics for doing some curve fitting.

    -

    Before we begin, we're going to use the curve in matrix form. In the section on matrices, I mentioned that some things are easier if we use the matrix representation of a Bézier curve rather than its calculus form, and this is one of those things.

    -

    As such, the first step in the process is expressing our Bézier curve as powers/coefficients/coordinate matrix T x M x C, by expanding the Bézier functions.

    -
    - -

    Revisiting the matrix representation

    -

    Rewriting Bézier functions to matrix form is fairly easy, if you first expand the function, and then arrange them into a multiple line form, where each line corresponds to a power of t, and each column is for a specific coefficient. First, we expand the function:

    - - -

    And then we (trivially) rearrange the terms across multiple lines:

    - - -

    This rearrangement has "factors of t" at each row (the first row is t⁰, i.e. "1", the second row is t¹, i.e. "t", the third row is t²) and "coefficient" at each column (the first column is all terms involving "a", the second all terms involving "b", the third all terms involving "c").

    -

    With that arrangement, we can easily decompose this as a matrix multiplication:

    - - -

    We can do the same for the cubic curve, of course. We know the base function for cubics:

    - - -

    So we write out the expansion and rearrange:

    - - -

    Which we can then decompose:

    - - -

    And, of course, we can do this for quartic curves too (skipping the expansion step):

    - - -

    And so and on so on. Now, let's see how to use these T, M, and C, to do some curve fitting.

    -
    + +

    + And so and on so on. Now, let's see how to use these T, M, and C, to do some curve + fitting. +

    + -

    Let's get started: we're going to assume we picked the right order curve: for n points we're fitting an n-1th order curve, so we "start" with a vector P that represents the coordinates we already know, and for which we want to do curve fitting:

    - - -

    Next, we need to figure out appropriate t values for each point in the curve, because we need something that lets us tie "the actual coordinate" to "some point on the curve". There's a fair number of different ways to do this (and a large part of optimizing "the perfect fit" is about picking appropriate t values), but in this case let's look at two "obvious" choices:

    -
      -
    1. equally spaced t values, and
    2. -
    3. t values that align with distance along the polygon.
    4. -
    -

    The first one is really simple: if we have n points, then we'll just assign each point i a t value of (i-1)/(n-1). So if we have four points, the first point will have t=(1-1)/(4-1)=0/3, the second point will have t=(2-1)/(4-1)=1/3, the third point will have t=2/3, and the last point will be t=1. We're just straight up spacing the t values to match the number of points we have.

    -

    The second one is a little more interesting: since we're doing polynomial regression, we might as well exploit the fact that our base coordinates just constitute a collection of line segments. At the first point, we're fixing t=0, and the last point, we want t=1, and anywhere in between we're simply going to say that t is equal to the distance along the polygon, scaled to the [0,1] domain.

    -

    To get these values, we first compute the general "distance along the polygon" matrix:

    - - -

    Where length() is literally just that: the length of the line segment between the point we're looking at, and the previous point. This isn't quite enough, of course: we still need to make sure that all the values between i=1 and i=n fall in the [0,1] interval, so we need to scale all values down by whatever the total length of the polygon is:

    - - -

    And now we can move on to the actual "curve fitting" part: what we want is a function that lets us compute "ideal" control point values such that if we build a Bézier curve with them, that curve passes through all our original points. Or, failing that, have an overall error distance that is as close to zero as we can get it. So, let's write out what the error distance looks like.

    -

    As mentioned before, this function is really just "the distance between the actual coordinate, and the coordinate that the curve evaluates to for the associated t value", which we'll square to get rid of any pesky negative signs:

    - - -

    Since this function only deals with individual coordinates, we'll need to sum over all coordinates in order to get the full error function. So, we literally just do that; the total error function is simply the sum of all these individual errors:

    - - -

    And here's the trick that justifies using matrices: while we can work with individual values using calculus, with matrices we can compute as many values as we make our matrices big, all at the "same time", We can replace the individual terms pi with the full P coordinate matrix, and we can replace Bézier(si) with the matrix representation T x M x C we talked about before, which gives us:

    - - -

    In which we can replace the rather cumbersome "squaring" operation with a more conventional matrix equivalent:

    - - -

    Here, the letter T is used instead of the number 2, to represent the matrix transpose; each row in the original matrix becomes a column in the transposed matrix instead (row one becomes column one, row two becomes column two, and so on).

    -

    This leaves one problem: T isn't actually the matrix we want: we don't want symbolic t values, we want the actual numerical values that we computed for S, so we need to form a new matrix, which we'll call 𝕋, that makes use of those, and then use that 𝕋 instead of T in our error function:

    - - -

    Which, because of the first and last values in S, means:

    - - -

    Now we can properly write out the error function as matrix operations:

    - - -

    So, we have our error function: we now need to figure out the expression for where that function has minimal value, e.g. where the error between the true coordinates and the coordinates generated by the curve fitting is smallest. Like in standard calculus, this requires taking the derivative, and determining where that derivative is zero:

    - - -
    + +
    +

    Where did this derivative come from?

    +

    + That... is a good question. In fact, when trying to run through this approach, I ran into the same question! And you know what? I + straight up had no idea. I'm decent enough at calculus, I'm decent enough at linear algebra, and I just don't know. +

    +

    + So I did what I always do when I don't understand something: I asked someone to help me understand how things work. In this specific + case, I + posted a question + to Math.stackexchange, and received a answer that goes into way more detail than I had + hoped to receive. +

    +

    + Is that answer useful to you? Probably: no. At least, not unless you like understanding maths on a recreational level. And I do mean + maths in general, not just basic algebra. But it does help in giving us a reference in case you ever wonder "Hang on. Why was that + true?". There are answers. They might just require some time to come to understand. +

    +
    -

    Where did this derivative come from?

    -

    That... is a good question. In fact, when trying to run through this approach, I ran into the same question! And you know what? I straight up had no idea. I'm decent enough at calculus, I'm decent enough at linear algebra, and I just don't know.

    -

    So I did what I always do when I don't understand something: I asked someone to help me understand how things work. In this specific case, I posted a question to Math.stackexchange, and received a answer that goes into way more detail than I had hoped to receive.

    -

    Is that answer useful to you? Probably: no. At least, not unless you like understanding maths on a recreational level. And I do mean maths in general, not just basic algebra. But it does help in giving us a reference in case you ever wonder "Hang on. Why was that true?". There are answers. They might just require some time to come to understand.

    -
    - -

    Now, given the above derivative, we can rearrange the terms (following the rules of matrix algebra) so that we end up with an expression for C:

    - - -

    Here, the "to the power negative one" is the notation for the matrix inverse. But that's all we have to do: we're done. Starting with P and inventing some t values based on the polygon the coordinates in P define, we can compute the corresponding Bézier coordinates C that specify a curve that goes through our points. Or, if it can't go through them exactly, as near as possible.

    -

    So before we try that out, how much code is involved in implementing this? Honestly, that answer depends on how much you're going to be writing yourself. If you already have a matrix maths library available, then really not that much code at all. On the other hand, if you are writing this from scratch, you're going to have to write some utility functions for doing your matrix work for you, so it's really anywhere from 50 lines of code to maybe 200 lines of code. Not a bad price to pay for being able to fit curves to pre-specified coordinates.

    -

    So let's try it out! The following graphic lets you place points, and will start computing exact-fit curves once you've placed at least three. You can click for more points, and the code will simply try to compute an exact fit using a Bézier curve of the appropriate order. Four points? Cubic Bézier. Five points? Quartic. And so on. Of course, this does break down at some point: depending on where you place your points, it might become mighty hard for the fitter to find an exact fit, and things might actually start looking horribly off once there's enough points for compound floating point rounding errors to start making a difference (which is around 10~11 points).

    - - - Скрипты отключены. Показываем резервное изображение. - - - - - - -

    You'll note there is a convenient "toggle" buttons that lets you toggle between equidistant t values, and distance ratio along the polygon formed by the points. Arguably more interesting is that once you have points to abstract a curve, you also get direct control over the time values through sliders for each, because if the time values are our degree of freedom, you should be able to freely manipulate them and see what the effect on your curve is.

    - -
    -
    -

    Bézier curves and Catmull-Rom curves

    -

    Taking an excursion to different splines, the other common design curve is the Catmull-Rom spline, which unlike Bézier curves pass through each control point, so they offer a kind of "built-in" curve fitting.

    -

    In fact, let's start with just playing with one: the following graphic has a predefined curve that you manipulate the points for, and lets you add points by clicking/tapping the background, as well as let you control "how fast" the curve passes through its point using the tension slider. The tenser the curve, the more the curve tends towards straight lines from one point to the next.

    - - - Скрипты отключены. Показываем резервное изображение. - - - - - -

    Now, it may look like Catmull-Rom curves are very different from Bézier curves, because these curves can get very long indeed, but what looks like a single Catmull-Rom curve is actually a spline: a single curve built up of lots of identically-computed pieces, similar to if you just took a whole bunch of Bézier curves, placed them end to end, and lined up their control points so that things look like a single curve. For a Catmull-Rom curve, each "piece" between two points is defined by the point's coordinates, and the tangent for those points, the latter of which can trivially be derived from knowing the previous and next point:

    - + +

    + You'll note there is a convenient "toggle" buttons that lets you toggle between equidistant t values, and distance ratio + along the polygon formed by the points. Arguably more interesting is that once you have points to abstract a curve, you also get + direct control over the time values through sliders for each, because if the time values are our degree of freedom, you should be + able to freely manipulate them and see what the effect on your curve is. +

    +
    +
    +

    + + Bézier curves and Catmull-Rom curves +

    +

    + Taking an excursion to different splines, the other common design curve is the + Catmull-Rom spline, which unlike Bézier curves + pass through each control point, so they offer a kind of "built-in" curve fitting. +

    +

    + In fact, let's start with just playing with one: the following graphic has a predefined curve that you manipulate the points for, and lets + you add points by clicking/tapping the background, as well as let you control "how fast" the curve passes through its point using the + tension slider. The tenser the curve, the more the curve tends towards straight lines from one point to the next. +

    + + + Скрипты отключены. Показываем резервное изображение. + + + + + +

    + Now, it may look like Catmull-Rom curves are very different from Bézier curves, because these curves can get very long indeed, but what + looks like a single Catmull-Rom curve is actually a spline: a single + curve built up of lots of identically-computed pieces, similar to if you just took a whole bunch of Bézier curves, placed them end to end, + and lined up their control points so that things look like a single curve. For a Catmull-Rom curve, each "piece" between two points is + defined by the point's coordinates, and the tangent for those points, the latter of which + can trivially be derived from knowing the + previous and next point: +

    + - -

    One downside of this is that—as you may have noticed from the graphic—the first and last point of the overall curve don't actually join up with the rest of the curve: they don't have a previous/next point respectively, and so there is no way to calculate what their tangent should be. Which also makes it rather tricky to fit a Catmull-Rom curve to three points like we were able to do for Bézier curves. More on that in the next section.

    -

    In fact, before we move on, let's look at how to actually draw the basic form of these curves (I say basic, because there are a number of variations that make things considerable more complex):

    + +

    + One downside of this is that—as you may have noticed from the graphic—the first and last point of the overall curve don't actually join up + with the rest of the curve: they don't have a previous/next point respectively, and so there is no way to calculate what their tangent + should be. Which also makes it rather tricky to fit a Catmull-Rom curve to three points like we were able to do for Bézier curves. More on + that in the next section. +

    +

    + In fact, before we move on, let's look at how to actually draw the basic form of these curves (I say basic, because there are a number of + variations that make things + considerable more + complex): +

    - - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    - -

    Now, since a Catmull-Rom curve is a form of cubic Hermite spline, and as cubic Bézier curves are also a form of cubic Hermite spline, we run into an interesting bit of maths programming: we can convert one to the other and back, and the maths for doing so is surprisingly simple!

    -

    The main difference between Catmull-Rom curves and Bézier curves is "what the points mean":

    -
      -
    • A cubic Bézier curve is defined by a start point, a control point that implies the tangent at the start, a control point that implies the tangent at the end, and an end point, plus a characterizing matrix that we can multiply by that point vector to get on-curve coordinates.
    • -
    • A Catmull-Rom curve is defined by a start point, a tangent that for that starting point, an end point, and a tangent for that end point, plus a characteristic matrix that we can multiple by the point vector to get on-curve coordinates.
    • -
    -

    Those are very similar, so let's see exactly how similar they are. We've already see the matrix form for Bézier curves, so how different is the matrix form for Catmull-Rom curves?:

    - - -

    That's pretty dang similar. So the question is: how can we convert that expression with Catmull-Rom matrix and vector into an expression of the Bézier matrix and vector? The short answer is of course "by using linear algebra", but the longer answer is the rest of this section, and involves some maths that you may not even care for: if you just want to know the (incredibly simple) conversions between the two curve forms, feel free to skip to the end of the following explanation, but if you want to how we can get one from the other... let's get mathing!

    -
    - -

    Deriving the conversion formulae

    -

    In order to convert between Catmull-Rom curves and Bézier curves, we need to know two things. Firstly, how to express the Catmull-Rom curve using a "set of four coordinates", rather than a mix of coordinates and tangents, and secondly, how to convert those Catmull-Rom coordinates to and from Bézier form.

    -

    We start with the first part, to figure out how we can go from Catmull-Rom V coordinates to Bézier P coordinates, by applying "some matrix T". We don't know what that T is yet, but we'll get to that:

    - - -

    So, this mapping says that in order to map a Catmull-Rom "point + tangent" vector to something based on an "all coordinates" vector, we need to determine the mapping matrix such that applying T yields P2 as start point, P3 as end point, and two tangents based on the lines between P1 and P3, and P2 nd P4, respectively.

    -

    Computing T is really more "arranging the numbers":

    - - -

    Thus:

    - - -

    However, we're not quite done, because Catmull-Rom curves have that "tension" parameter, written as τ (a lowercase"tau"), which is a scaling factor for the tangent vectors: the bigger the tension, the smaller the tangents, and the smaller the tension, the bigger the tangents. As such, the tension factor goes in the denominator for the tangents, and before we continue, let's add that tension factor into both our coordinate vector representation, and mapping matrix T:

    - - -

    With the mapping matrix properly done, let's rewrite the "point + tangent" Catmull-Rom matrix form to a matrix form in terms of four coordinates, and see what we end up with:

    - - -

    Replace point/tangent vector with the expression for all-coordinates:

    - - -

    and merge the matrices:

    - - -

    This looks a lot like the Bézier matrix form, which as we saw in the chapter on Bézier curves, should look like this:

    - - -

    So, if we want to express a Catmull-Rom curve using a Bézier curve, we'll need to turn this Catmull-Rom bit:

    - - -

    Into something that looks like this:

    - - -

    And the way we do that is with a fairly straight forward bit of matrix rewriting. We start with the equality we need to ensure:

    - - -

    Then we remove the coordinate vector from both sides without affecting the equality:

    - - -

    Then we can "get rid of" the Bézier matrix on the right by left-multiply both with the inverse of the Bézier matrix:

    - - -

    A matrix times its inverse is the matrix equivalent of 1, and because "something times 1" is the same as "something", so we can just outright remove any matrix/inverse pair:

    - - -

    And now we're basically done. We just multiply those two matrices and we know what V is:

    - - -

    We now have the final piece of our function puzzle. Let's run through each step.

    -
      -
    1. Start with the Catmull-Rom function:
    2. -
    - - -
      -
    1. rewrite to pure coordinate form:
    2. -
    - - -
      -
    1. rewrite for "normal" coordinate vector:
    2. -
    - - -
      -
    1. merge the inner matrices:
    2. -
    - - -
      -
    1. rewrite for Bézier matrix form:
    2. -
    - - -
      -
    1. and transform the coordinates so we have a "pure" Bézier expression:
    2. -
    - - -

    And we're done: we finally know how to convert these two curves!

    -
    + +

    And we're done: we finally know how to convert these two curves!

    + -

    If we have a Catmull-Rom curve defined by four coordinates P1 through P4, then we can draw that curve using a Bézier curve that has the vector:

    - - -

    Similarly, if we have a Bézier curve defined by four coordinates P1 through P4, we can draw that using a standard tension Catmull-Rom curve with the following coordinate values:

    - - -

    Or, if your API allows you to specify Catmull-Rom curves using plain coordinates:

    - - - -
    -
    -

    Creating a Catmull-Rom curve from three points

    -

    Much shorter than the previous section: we saw that Catmull-Rom curves need at least 4 points to draw anything sensible, so how do we create a Catmull-Rom curve from three points?

    -

    Short and sweet: we don't.

    -

    We run through the maths that lets us create a cubic Bézier curve, and then convert its coordinates to Catmull-Rom form using the conversion formulae we saw above.

    - -
    -
    -

    Forming poly-Bézier curves

    -

    Much like lines can be chained together to form polygons, Bézier curves can be chained together to form poly-Béziers, and the only trick required is to make sure that:

    -
      -
    1. the end point of each section is the starting point of the following section, and
    2. -
    3. the derivatives across that dual point line up.
    4. -
    -

    Unless you want sharp corners, of course. Then you don't even need 2.

    -

    We'll cover three forms of poly-Bézier curves in this section. First, we'll look at the kind that just follows point 1. where the end point of a segment is the same point as the start point of the next segment. This leads to poly-Béziers that are pretty hard to work with, but they're the easiest to implement:

    - - - Скрипты отключены. Показываем резервное изображение. - - - - - - Скрипты отключены. Показываем резервное изображение. - - - -

    Dragging the control points around only affects the curve segments that the control point belongs to, and moving an on-curve point leaves the control points where they are, which is not the most useful for practical modelling purposes. So, let's add in the logic we need to make things a little better. We'll start by linking up control points by ensuring that the "incoming" derivative at an on-curve point is the same as it's "outgoing" derivative:

    - - -

    We can effect this quite easily, because we know that the vector from a curve's last control point to its last on-curve point is equal to the derivative vector. If we want to ensure that the first control point of the next curve matches that, all we have to do is mirror that last control point through the last on-curve point. And mirroring any point A through any point B is really simple:

    - - -

    So let's implement that and see what it gets us. The following two graphics show a quadratic and a cubic poly-Bézier curve again, but this time moving the control points around moves others, too. However, you might see something unexpected going on for quadratic curves...

    - - - Скрипты отключены. Показываем резервное изображение. - - - - - - Скрипты отключены. Показываем резервное изображение. - - - -

    As you can see, quadratic curves are particularly ill-suited for poly-Bézier curves, as all the control points are effectively linked. Move one of them, and you move all of them. Not only that, but if we move the on-curve points, it's possible to get a situation where a control point cannot satisfy the constraint that it's the reflection of its two neighbouring control points... This means that we cannot use quadratic poly-Béziers for anything other than really, really simple shapes. And even then, they're probably the wrong choice. Cubic curves are pretty decent, but the fact that the derivatives are linked means we can't manipulate curves as well as we might if we relaxed the constraints a little.

    -

    So: let's relax the requirement a little.

    -

    We can change the constraint so that we still preserve the angle of the derivatives across sections (so transitions from one section to the next will still look natural), but give up the requirement that they should also have the same vector length. Doing so will give us a much more useful kind of poly-Bézier curve:

    - - - Скрипты отключены. Показываем резервное изображение. - - - - - - Скрипты отключены. Показываем резервное изображение. - - - -

    Cubic curves are now better behaved when it comes to dragging control points around, but the quadratic poly-Bézier still has the problem that moving one control points will move the control points and may ending up defining "the next" control point in a way that doesn't work. Quadratic curves really aren't very useful to work with...

    -

    Finally, we also want to make sure that moving the on-curve coordinates preserves the relative positions of the associated control points. With that, we get to the kind of curve control that you might be familiar with from applications like Photoshop, Inkscape, Blender, etc.

    - - - Скрипты отключены. Показываем резервное изображение. - - - - - - Скрипты отключены. Показываем резервное изображение. - - - -

    Again, we see that cubic curves are now rather nice to work with, but quadratic curves have a new, very serious problem: we can move an on-curve point in such a way that we can't compute what needs to "happen next". Move the top point down, below the left and right points, for instance. There is no way to preserve correct control points without a kink at the bottom point. Quadratic curves: just not that good...

    -

    A final improvement is to offer fine-level control over which points behave which, so that you can have "kinks" or individually controlled segments when you need them, with nicely well-behaved curves for the rest of the path. Implementing that, is left as an exercise for the reader.

    - -
    -
    -

    Curve offsetting

    -

    Perhaps you're like me, and you've been writing various small programs that use Bézier curves in some way or another, and at some point you make the step to implementing path extrusion. But you don't want to do it pixel based; you want to stay in the vector world. You find that extruding lines is relatively easy, and tracing outlines is coming along nicely (although junction caps and fillets are a bit of a hassle), and then you decide to do things properly and add Bézier curves to the mix. Now you have a problem.

    -

    Unlike lines, you can't simply extrude a Bézier curve by taking a copy and moving it around, because of the curvatures; rather than a uniform thickness, you get an extrusion that looks too thin in places, if you're lucky, but more likely will self-intersect. The trick, then, is to scale the curve, rather than simply copying it. But how do you scale a Bézier curve?

    -

    Bottom line: you can't. So you cheat. We're not going to do true curve scaling, or rather curve offsetting, because that's impossible. Instead we're going to try to generate 'looks good enough' offset curves.

    -
    - -

    "What do you mean, you can't? Prove it."

    -

    First off, when I say "you can't," what I really mean is "you can't offset a Bézier curve with another Bézier curve", not even by using a really high order curve. You can find the function that describes the offset curve, but it won't be a polynomial, and as such it cannot be represented as a Bézier curve, which has to be a polynomial. Let's look at why this is:

    -

    From a mathematical point of view, an offset curve O(t) is a curve such that, given our original curve B(t), any point on O(t) is a fixed distance d away from coordinate B(t). So let's math that:

    - - -

    However, we're working in 2D, and d is a single value, so we want to turn it into a vector. If we want a point distance d "away" from the curve B(t) then what we really mean is that we want a point at d times the "normal vector" from point B(t), where the "normal" is a vector that runs perpendicular ("at a right angle") to the tangent at B(t). Easy enough:

    - - -

    Now this still isn't very useful unless we know what the formula for N(t) is, so let's find out. N(t) runs perpendicular to the original curve tangent, and we know that the tangent is simply B'(t), so we could just rotate that 90 degrees and be done with it. However, we need to ensure that N(t) has the same magnitude for every t, or the offset curve won't be at a uniform distance, thus not being an offset curve at all. The easiest way to guarantee this is to make sure N(t) always has length 1, which we can achieve by dividing B'(t) by its magnitude:

    - - -

    Determining the length requires computing an arc length, and this is where things get Tricky with a capital T. First off, to compute arc length from some start a to end b, we must use the formula we saw earlier. Noting that "length" is usually denoted with double vertical bars:

    - - -

    So if we want the length of the tangent, we plug in B'(t), with t = 0 as start and -t = 1 as end:

    - - -

    And that's where things go wrong. It doesn't even really matter what the second derivative for B(t) is, that square root is screwing everything up, because it turns our nice polynomials into things that are no longer polynomials.

    -

    There is a small class of polynomials where the square root is also a polynomial, but they're utterly useless to us: any polynomial with unweighted binomial coefficients has a square root that is also a polynomial. Now, you might think that Bézier curves are just fine because they do, but they don't; remember that only the base function has binomial coefficients. That's before we factor in our coordinates, which turn it into a non-binomial polygon. The only way to make sure the functions stay binomial is to make all our coordinates have the same value. And that's not a curve, that's a point. We can already create offset curves for points, we call them circles, and they have much simpler functions than Bézier curves.

    -

    So, since the tangent length isn't a polynomial, the normalised tangent won't be a polynomial either, which means N(t) won't be a polynomial, which means that d times N(t) won't be a polynomial, which means that, ultimately, O(t) won't be a polynomial, which means that even if we can determine the function for O(t) just fine (and that's far from trivial!), it simply cannot be represented as a Bézier curve.

    -

    And that's one reason why Bézier curves are tricky: there are actually a lot of curves that cannot be represented as a Bézier curve at all. They can't even model their own offset curves. They're weird that way. So how do all those other programs do it? Well, much like we're about to do, they cheat. We're going to approximate an offset curve in a way that will look relatively close to what the real offset curve would look like, if we could compute it.

    -
    + +

    + And that's where things go wrong. It doesn't even really matter what the second derivative for B(t) is, that square root is + screwing everything up, because it turns our nice polynomials into things that are no longer polynomials. +

    +

    + There is a small class of polynomials where the square root is also a polynomial, but they're utterly useless to us: any polynomial with + unweighted binomial coefficients has a square root that is also a polynomial. Now, you might think that Bézier curves are just fine + because they do, but they don't; remember that only the base function has binomial coefficients. That's before we + factor in our coordinates, which turn it into a non-binomial polygon. The only way to make sure the functions stay binomial is to make + all our coordinates have the same value. And that's not a curve, that's a point. We can already create offset curves for points, we call + them circles, and they have much simpler functions than Bézier curves. +

    +

    + So, since the tangent length isn't a polynomial, the normalised tangent won't be a polynomial either, which means + N(t) won't be a polynomial, which means that d times N(t) won't be a polynomial, which means + that, ultimately, O(t) won't be a polynomial, which means that even if we can determine the function for + O(t) just fine (and that's far from trivial!), it simply cannot be represented as a Bézier curve. +

    +

    + And that's one reason why Bézier curves are tricky: there are actually a lot of curves that cannot be represented as a Bézier + curve at all. They can't even model their own offset curves. They're weird that way. So how do all those other programs do it? Well, + much like we're about to do, they cheat. We're going to approximate an offset curve in a way that will look relatively close to what the + real offset curve would look like, if we could compute it. +

    + -

    So, you cannot offset a Bézier curve perfectly with another Bézier curve, no matter how high-order you make that other Bézier curve. However, we can chop up a curve into "safe" sub-curves (where "safe" means that all the control points are always on a single side of the baseline, and the midpoint of the curve at t=0.5 is roughly in the center of the polygon defined by the curve coordinates) and then point-scale each sub-curve with respect to its scaling origin (which is the intersection of the point normals at the start and end points).

    -

    A good way to do this reduction is to first find the curve's extreme points, as explained in the earlier section on curve extremities, and use these as initial splitting points. After this initial split, we can check each individual segment to see if it's "safe enough" based on where the center of the curve is. If the on-curve point for t=0.5 is too far off from the center, we simply split the segment down the middle. Generally this is more than enough to end up with safe segments.

    -

    The following graphics show off curve offsetting, and you can use the slider to control the distance at which the curve gets offset. The curve first gets reduced to safe segments, each of which is then offset at the desired distance. Especially for simple curves, particularly easily set up for quadratic curves, no reduction is necessary, but the more twisty the curve gets, the more the curve needs to be reduced in order to get segments that can safely be scaled.

    - - - Скрипты отключены. Показываем резервное изображение. - - - - - - - - Скрипты отключены. Показываем резервное изображение. - - - - - -

    You may notice that this may still lead to small 'jumps' in the sub-curves when moving the curve around. This is caused by the fact that we're still performing a naive form of offsetting, moving the control points the same distance as the start and end points. If the curve is large enough, this may still lead to incorrect offsets.

    - -
    -
    -

    Graduated curve offsetting

    -

    What if we want to do graduated offsetting, starting at some distance s but ending at some other distance e? Well, if we can compute the length of a curve (which we can if we use the Legendre-Gauss quadrature approach) then we can also determine how far "along the line" any point on the curve is. With that knowledge, we can offset a curve so that its offset curve is not uniformly wide, but graduated between with two different offset widths at the start and end.

    -

    Like normal offsetting we cut up our curve in sub-curves, and then check at which distance along the original curve each sub-curve starts and ends, as well as to which point on the curve each of the control points map. This gives us the distance-along-the-curve for each interesting point in the sub-curve. If we call the total length of all sub-curves seen prior to seeing "the current" sub-curve S (and if the current sub-curve is the first one, S is zero), and we call the full length of our original curve L, then we get the following graduation values:

    -
      -
    • start: map S from interval (0,L) to interval (s,e)
    • -
    • c1: map(<strong>S+d1</strong>, 0,L, s,e), d1 = distance along curve to projection of c1
    • -
    • c2: map(<strong>S+d2</strong>, 0,L, s,e), d2 = distance along curve to projection of c2
    • -
    • ...
    • -
    • end: map(<strong>S+length(subcurve)</strong>, 0,L, s,e)
    • -
    -

    At each of the relevant points (start, end, and the projections of the control points onto the curve) we know the curve's normal, so offsetting is simply a matter of taking our original point, and moving it along the normal vector by the offset distance for each point. Doing so will give us the following result (these have with a starting width of 0, and an end width of 40 pixels, but can be controlled with your up and down arrow keys):

    - - - Скрипты отключены. Показываем резервное изображение. - - - - - - - - Скрипты отключены. Показываем резервное изображение. - - - - - - -
    -
    -

    Circles and quadratic Bézier curves

    -

    Circles and Bézier curves are very different beasts, and circles are infinitely easier to work with than Bézier curves. Their formula is much simpler, and they can be drawn more efficiently. But, sometimes you don't have the luxury of using circles, or ellipses, or arcs. Sometimes, all you have are Bézier curves. For instance, if you're doing font design, fonts have no concept of geometric shapes, they only know straight lines, and Bézier curves. OpenType fonts with TrueType outlines only know quadratic Bézier curves, and OpenType fonts with Type 2 outlines only know cubic Bézier curves. So how do you draw a circle, or an ellipse, or an arc?

    -

    You approximate.

    -

    We already know that Bézier curves cannot model all curves that we can think of, and this includes perfect circles, as well as ellipses, and their arc counterparts. However, we can certainly approximate them to a degree that is visually acceptable. Quadratic and cubic curves offer us different curvature control, so in order to approximate a circle we will first need to figure out what the error is if we try to approximate arcs of increasing degree with quadratic and cubic curves, and where the coordinates even lie.

    -

    Since arcs are mid-point-symmetrical, we need the control points to set up a symmetrical curve. For quadratic curves this means that the control point will be somewhere on a line that intersects the baseline at a right angle. And we don't get any choice on where that will be, since the derivatives at the start and end point have to line up, so our control point will lie at the intersection of the tangents at the start and end point.

    -

    First, let's try to fit the quadratic curve onto a circular arc. In the following sketch you can move the mouse around over a unit circle, to see how well, or poorly, a quadratic curve can approximate the arc from (1,0) to where your mouse cursor is:

    - - - Скрипты отключены. Показываем резервное изображение. - - - - - -

    As you can see, things go horribly wrong quite quickly; even trying to approximate a quarter circle using a quadratic curve is a bad idea. An eighth of a turns might look okay, but how okay is okay? Let's apply some maths and find out. What we're interested in is how far off our on-curve coordinates are with respect to a circular arc, given a specific start and end angle. We'll be looking at how much space there is between the circular arc, and the quadratic curve's midpoint.

    -

    We start out with our start and end point, and for convenience we will place them on a unit circle (a circle around 0,0 with radius 1), at some angle φ:

    - - -

    What we want to find is the intersection of the tangents, so we want a point C such that:

    - - -

    i.e. we want a point that lies on the vertical line through S (at some distance a from S) and also lies on the tangent line through E (at some distance b from E). Solving this gives us:

    - - -

    First we solve for b:

    - - -

    which yields:

    - - -

    which we can then substitute in the expression for a:

    - - -

    A quick check shows that plugging these values for a and b into the expressions for Cx and Cy give the same x/y coordinates for both "a away from A" and "b away from B", so let's continue: now that we know the coordinate values for C, we know where our on-curve point T for t=0.5 (or angle φ/2) is, because we can just evaluate the Bézier polynomial, and we know where the circle arc's actual point P is for angle φ/2:

    - - -

    We compute T, observing that if t=0.5, the polynomial values (1-t)², 2(1-t)t, and t² are 0.25, 0.5, and 0.25 respectively:

    - - -

    Which, worked out for the x and y components, gives:

    - - -

    And the distance between these two is the standard Euclidean distance:

    - - -

    So, what does this distance function look like when we plot it for a number of ranges for the angle φ, such as a half circle, quarter circle and eighth circle?

    -
    - - plotted for 0 ≤ φ ≤ π: - - - plotted for 0 ≤ φ ≤ ½π: - - - - - plotted for 0 ≤ φ ≤ ¼π: -
    + +

    + So, what does this distance function look like when we plot it for a number of ranges for the angle φ, such as a half circle, quarter + circle and eighth circle? +

    + + + + + + + + +
    + + plotted for 0 ≤ φ ≤ π: + + + plotted for 0 ≤ φ ≤ ½π: + + + + + plotted for 0 ≤ φ ≤ ¼π: +
    -

    We now see why the eighth circle arc looks decent, but the quarter circle arc doesn't: an error of roughly 0.06 at t=0.5 means we're 6% off the mark... we will already be off by one pixel on a circle with pixel radius 17. Any decent sized quarter circle arc, say with radius 100px, will be way off if approximated by a quadratic curve! For the eighth circle arc, however, the error is only roughly 0.003, or 0.3%, which explains why it looks so close to the actual eighth circle arc. In fact, if we want a truly tiny error, like 0.001, we'll have to contend with an angle of (rounded) 0.593667, which equates to roughly 34 degrees. We'd need 11 quadratic curves to form a full circle with that precision! (technically, 10 and ten seventeenth, but we can't do partial curves, so we have to round up). That's a whole lot of curves just to get a shape that can be drawn using a simple function!

    -

    In fact, let's flip the function around, so that if we plug in the precision error, labelled ε, we get back the maximum angle for that precision:

    - - -

    And frankly, things are starting to look a bit ridiculous at this point, we're doing way more maths than we've ever done, but thankfully this is as far as we need the maths to take us: If we plug in the precisions 0.1, 0.01, 0.001 and 0.0001 we get the radians values 1.748, 1.038, 0.594 and 0.3356; in degrees, that means we can cover roughly 100 degrees (requiring four curves), 59.5 degrees (requiring six curves), 34 degrees (requiring 11 curves), and 19.2 degrees (requiring a whopping nineteen curves).

    -

    The bottom line? Quadratic curves are kind of lousy if you want circular (or elliptical, which are circles that have been squashed in one dimension) curves. We can do better, even if it's just by raising the order of our curve once. So let's try the same thing for cubic curves.

    - -
    -
    -

    Circular arcs and cubic Béziers

    -

    Let's look at approximating circles and circular arcs using cubic Béziers. How much better is that?

    - - - Скрипты отключены. Показываем резервное изображение. - - - - - -

    At cursory glance, a fair bit better, but let's find out how much better by looking at how to construct the Bézier curve.

    -

    A construction diagram for a cubic approximation of a circular arc

    -

    The start and end points are trivial, but the mid point requires a bit of work, but it's mostly basic trigonometry once we know the angle θ for our circular arc: if we scale our circular arc to a unit circle, we can always start our arc, with radius 1, at (1,0) and then given our arc angle θ, we also know that the circular arc has length θ (because unit circles are nice that way). We also know our end point, because that's just (cos(θ), sin(θ)), and so the challenge is to figure out what control points we need in order for the curve at t=0.5 to exactly touch the circular arc at the angle θ/2:

    -

    So let's again formally describe this:

    - - -

    Only P3 isn't quite straight-forward here, and its description is based on the fact that the triangle (origin, P4, P3) is a right angled triangle, with the distance between the origin and P4 being 1 (because we're working with a unit circle), and the distance between P4 and P3 being k, so that we can represent P3 as "The point P4 plus the vector from the origin to P4 but then rotated a quarter circle, counter-clockwise, and scaled by k".

    -

    With that, we can determine the y-coordinates for A, B, e1, and e2, after which we have all the information we need to determine what the value of k is. We can find these values by using (no surprise here) linear interpolation between known points, as A is midway between P2 and P3, e1 is between A and "midway between P1 and P2" (which is "half height" P2), and so forth:

    - - -

    Which now gives us two identities for B, because in addition to determining B through linear interpolation, we also know that B's y coordinate is just sin(θ/2): we started this exercise by saying we were going to approximate the circular arc using a Bézier curve that had its midpoint, which is point B, touching the unit circle at the arc's half-angle, by definition making B the point at (cos(θ/2), sin(θ/2)).

    -

    This means we can equate the two identities we now have for By and solve for k.

    -
    - -

    Deriving k

    -

    Solving for k is fairly straight forward, but it's a fair few steps, and if you just the immediate result: using a tool like Wolfram Alpha is definitely the way to go. That said, let's get going:

    - - -

    And finally, we can take further advantage of several trigonometric identities to drastically simplify our formula for k:

    - - -

    And we're done.

    -
    + +

    And we're done.

    + -

    So, the distance of our control points to the start/end points can be expressed as a number that we get from an almost trivial expression involving the circular arc's angle:

    - - -

    Which means that for any circular arc with angle θ and radius r, our Bézier approximation based on three points of incidence is:

    - - -

    Which also gives us the commonly found value of 0.55228 for quarter circles, based on them having an angle of half π:

    - - -

    And thus giving us the following Bézier coordinates for a quarter circle of radius r:

    - - -
    + +
    +

    So, how accurate is this?

    +

    + Unlike for the quadratic curve, we can't use t=0.5 as our reference point because by its very nature it's one of the three points + that are actually guaranteed to be on the circular arc itself. Instead, we need a different t value that will give us the maximum + deflection - there are two possible choices (as our curve is still strictly "overshoots" the circular arc, and it's symmetrical) but + rather than trying to use calculus to find the perfect t value—which we could! the maths is perfectly reasonable as long as we + get to use computers—we can also just perform a binary search for the biggest deflection and not bother with all this maths stuff. +

    +

    + So let's do that instead: we can run a maximum deflection check that just runs through t from 0 to 1 at some coarse interval, + finds a t value that has "the highest deflection of the bunch", then reruns the same check with a much smaller interval around + that t value, repeating as many times as necessary to get us an arbitrarily precise value of t: +

    -

    So, how accurate is this?

    -

    Unlike for the quadratic curve, we can't use t=0.5 as our reference point because by its very nature it's one of the three points that are actually guaranteed to be on the circular arc itself. Instead, we need a different t value that will give us the maximum deflection - there are two possible choices (as our curve is still strictly "overshoots" the circular arc, and it's symmetrical) but rather than trying to use calculus to find the perfect t value—which we could! the maths is perfectly reasonable as long as we get to use computers—we can also just perform a binary search for the biggest deflection and not bother with all this maths stuff.

    -

    So let's do that instead: we can run a maximum deflection check that just runs through t from 0 to 1 at some coarse interval, finds a t value that has "the highest deflection of the bunch", then reruns the same check with a much smaller interval around that t value, repeating as many times as necessary to get us an arbitrarily precise value of t:

    - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    - -

    Plus, how often do you get to write a function with that name?

    -

    Using this code, we find that our t values are approximately 0.211325 and 0.788675, so let's pick the lower of the two and see what the maximum deflection is across our domain of angles, with the original quadratic error show in green (rocketing off to infinity first, and then coming back down as we approach 2π)

    - - -
    - - - - - -
    - error plotted for 0 ≤ φ ≤ 2π - - error plotted for 0 ≤ φ ≤ π - - error plotted for 0 ≤ φ ≤ ½π -
    + return getMostWrongT(radius, bezier, worst_t - stepsize, worst_t + stepsize) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + -

    That last image is probably not quite clear enough: the cubic approximation of a quarter circle is so incredibly much better that we can't even really see it at the same scale of our quadratic curve. Let's scale the y-axis a little, and try that again:

    -

    +

    Plus, how often do you get to write a function with that name?

    +

    + Using this code, we find that our t values are approximately 0.211325 and 0.788675, so let's pick the lower of the two and see + what the maximum deflection is across our domain of angles, with the original quadratic error show in green (rocketing off to infinity + first, and then coming back down as we approach 2π) +

    + + + + + + + + + + + + + +
    + + + + + +
    + error plotted for 0 ≤ φ ≤ 2π + + error plotted for 0 ≤ φ ≤ π + + error plotted for 0 ≤ φ ≤ ½π +
    -

    Yeah... the error of a cubic approximation for a quarter circle turns out to be two orders of magnitude better. At approximately 0.00027 (or: just shy of being 2.7 pixels off for a circle with a radius of 10,000 pixels) the increase in precision over quadratic curves is quite spectacular - certainly good enough that no one in their right mind should ever use quadratic curves.

    -
    +

    + That last image is probably not quite clear enough: the cubic approximation of a quarter circle is so incredibly much better that we + can't even really see it at the same scale of our quadratic curve. Let's scale the y-axis a little, and try that again: +

    +

    -

    So that's it, kappa is 4/3 · tan(θ/4) , we're done! ...or are we?

    -

    Can we do better?

    -

    Technically: yes, we can. But I'm going to prefix this section with "we can, and we should investigate that possibility, but let me warn you up front that the result is only better if we're going to hard-code the values". We're about to get into the weeds and the standard three-points-of-incidence value is so good already that for most applications, trying to do better won't make any sense at all.

    -

    So with that said: what we calculated above is an upper bound for a best fit Bézier curve for a circular arc: anywhere we don't touch the circular arc in our approximation, we've "overshot" the arc. What if we dropped our value for k just a little, so that the curve starts out as an over-estimation, but then crosses the circular arc, yielding an region of underestimation, and then crosses the circular arc again, with another region of overestimation. This might give us a lower overall error, so let's see what we can do.

    -

    First, let's express the total error (given circular arc angle θ, and some k) using standard calculus notation:

    - - -

    This says that the error function for a given angle and value of k is equal to the "infinite" sum of differences between our curve and the circular arc, as we run t from 0 to 1, using an infinitely small step size. between subsequent t values.

    -

    Now, since we want to find the minimal error, that means we want to know where along this function things go from "error is getting progressively less" to "error is increasing again", which means we want to know where its derivative is zero, which as mathematical expression looks like:

    - - -

    And here we have the most direct application of the Fundamental Theorem of Calculus: the derivative and integral are each other's inverse operations, so they cancel out, leaving us with our original function:

    - - -

    And now we just solve for that... oh wait. We've seen this before. In order to solve this, we'd end up needing to solve this:

    - - -

    And both of those terms on the left of the equal sign are 6th degree polynomials, which means—as we've covered in the section on arc lengths—there is no symbolic solution for this equasion. Instead, we'll have to use a numerical approach to find the solutions here, so... to the computer!

    -
    + +

    + And both of those terms on the left of the equal sign are 6th degree polynomials, which means—as we've covered in the section + on arc lengths—there is no symbolic solution for this equasion. + Instead, we'll have to use a numerical approach to find the solutions here, so... to the computer! +

    +
    +

    Iterating on a solution

    +

    + By which I really mean "to the binary search algorithm", because we're dealing with a reasonably well behaved function: depending on the + value for k , we're either going to end up with a Bézier curve that's on average "not at distance r from the arc's + center", "exactly distance r from the arc's center", or "more than distance r from the arc's center", so we can just + binary search our way to the most accurate value for c that gets us that middle case. +

    +

    First our setup, where we determine our upper and lower bounds, before entering our binary search:

    -

    Iterating on a solution

    -

    By which I really mean "to the binary search algorithm", because we're dealing with a reasonably well behaved function: depending on the value for k , we're either going to end up with a Bézier curve that's on average "not at distance r from the arc's center", "exactly distance r from the arc's center", or "more than distance r from the arc's center", so we can just binary search our way to the most accurate value for c that gets us that middle case.

    -

    First our setup, where we determine our upper and lower bounds, before entering our binary search:

    - - - - -
    1 - -
    2
    3
    4
    - -

    And then the binary search algorithm, which can be found in pretty much any CS textbook, as well as more online articles, tutorials, and blog posts than you can ever read in a life time:

    + return binarySearch(radius, angle, points, lowerBound, upperBound) + + + + 2 + + + 3 + + + 4 + + - - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    - -

    Using the following radialError function, which samples the curve's approximation of the circular arc over several points (although the first and last point will never contribute anything, so we skip them):

    + return binarySearch(radius, angle, points, value, upperBound) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + 18 + + + 19 + + - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    - -

    In this, getOnCurvePoint is just the standard Bézier evaluation function, yielding a point. Treating that point as a vector, we can get its length to the origin using a magnitude call.

    -

    Examining the result

    -

    Running the above code we can get a list of k values associated with a list of angles θ from 0 to π, and we can use that to, for each angle, plot what the difference between the circular arc and the Bézier approximation looks like:

    -

    image-20210419085430711

    -

    Here we see the difference between an arc and its Bézier approximation plotted as we run t from 0 to 1. Just by looking at the plot we can tell that there is maximum deflection at t = 0.5, so let's plot the maximum deflection "function", for angles from 0 to θ:

    -

    In fact, let's plot the maximum deflections for both approaches as a functions over θ:

    - - -
    - - - - - -
    - max deflection using unit scale - - max deflection at 10x scale - - max deflection at 100x scale -
    + return err + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + -

    That doesn't actually appear to be all that much better, so let's look at some numbers, to see what the improvement actually is:

    - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
    angle"improved" deflection"upper bound" deflectiondifference
    1/8 π6.202833502388927E-86.657161222278773E-84.5432771988984655E-9
    1/4 π3.978021202111215E-64.246252911066506E-62.68231708955291E-7
    3/8 π4.547652269037972E-54.8397483513262785E-52.9209608228830675E-6
    1/2 π2.569196199214696E-42.7251652752280364E-41.559690760133403E-5
    5/8 π9.877526288810667E-40.00104441758597118025.666495709011343E-5
    3/4 π0.002981649786796270.00314556284145806051.6391305466179062E-4
    7/8 π0.00763231828070198850.0080477779099483734.1545962924638413E-4
    π0.0173621859640437080.0183490165195459029.86830555502194E-4
    -

    As we can see, the increase in precision is not particularly big: for a quarter circle (π/2) the traditional k will be off by 2.75 pixels on a circle with radius 10,000 pixels, whereas this "better" fit will be off by 2.56 pixels. And while that's certainly an almost 10% improvement, it's also nowhere near enough of an improvement to make a discernible difference.

    -
    +

    + In this, getOnCurvePoint is just the standard Bézier evaluation function, yielding a point. Treating that point as a + vector, we can get its length to the origin using a magnitude call. +

    +

    Examining the result

    +

    + Running the above code we can get a list of k values associated with a list of angles θ from 0 to π, and we can use that to, + for each angle, plot what the difference between the circular arc and the Bézier approximation looks like: +

    +

    image-20210419085430711

    +

    + Here we see the difference between an arc and its Bézier approximation plotted as we run t from 0 to 1. Just by looking at the + plot we can tell that there is maximum deflection at t = 0.5, so let's plot the maximum deflection "function", for angles from + 0 to θ: +

    +

    In fact, let's plot the maximum deflections for both approaches as a functions over θ:

    + + + + + + + + + + + + + +
    + + + + + +
    + max deflection using unit scale + + max deflection at 10x scale + + max deflection at 100x scale +
    -

    At this point it should be clear that while, yes, there are improvement to be had, they're essentially insignificant while also being much more computationally expensive.

    -

    TL;DR: just tell me which value I should be using

    -

    It depends on what we need to do. If we just want the best value for quarter circles, and we're going to hard code the value for k, then there is no reason to hard-code the constant k=4/3*tan(pi/8) when you can just as easily hard-code the constant as k=0.551784777779014 instead.

    -

    If you need "the" value for quarter circles, use 0.551785 instead of 0.55228

    -

    However, for dynamic arc approximation, in code that tries to fit circular paths using Bézier paths instead, it should be fairly obvious that the simple function involving a tangent computation, two divisions, and one multiplication, is vastly more performant than running all the code we ended writing just to get a 25% lower error value, and most certainly worth preferring over getting the "more accurate" value.

    -

    If you need to fit Béziers to circular arcs on the fly, use 4/3 * tan(θ/4)

    -

    However, always remember that if you're writing for humans, you can typically use the best of both worlds: as the user interacts with their curves, you should draw their curves instead of drawing approximations of them. If they need to draw circles or circular arcs, draw those, and only approximate them with a Bézier curve when the data needs to be exported to a format that doesn't support those. Ideally with a preview mechanism that highlights where the errors will be, and how large they will be.

    -

    If you're writing code for graphics design by humans, use circular arcs for circular arcs

    -

    And that's it. We have pretty well exhausted this subject. There are different metrics we could use to find "different best k values", like trying to match arc length (e.g. when we're optimizing for material cost), or minimizing the area between the circular arc and the Bézier curve (e.g. when we're optimizing for inking), or minimizing the rate of change of the Bézier's curvature (e.g. when we're optimizing for curve traversal) and they all yield values that are so similar that it's almost certainly not worth it. (For instance, for quarter circle approximations those values are 0.551777, 0.5533344, and 0.552184 respectively. Much like the 0.551785 we get from minimizing the maximum deflection, none of these values are significantly better enough to prefer them over the upper bound value).

    +

    That doesn't actually appear to be all that much better, so let's look at some numbers, to see what the improvement actually is:

    + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
    angle"improved" deflection"upper bound" deflectiondifference
    1/8 π6.202833502388927E-86.657161222278773E-84.5432771988984655E-9
    1/4 π3.978021202111215E-64.246252911066506E-62.68231708955291E-7
    3/8 π4.547652269037972E-54.8397483513262785E-52.9209608228830675E-6
    1/2 π2.569196199214696E-42.7251652752280364E-41.559690760133403E-5
    5/8 π9.877526288810667E-40.00104441758597118025.666495709011343E-5
    3/4 π0.002981649786796270.00314556284145806051.6391305466179062E-4
    7/8 π0.00763231828070198850.0080477779099483734.1545962924638413E-4
    π0.0173621859640437080.0183490165195459029.86830555502194E-4
    +

    + As we can see, the increase in precision is not particularly big: for a quarter circle (π/2) the traditional k will be off by + 2.75 pixels on a circle with radius 10,000 pixels, whereas this "better" fit will be off by 2.56 pixels. And while that's certainly an + almost 10% improvement, it's also nowhere near enough of an improvement to make a discernible difference. +

    +
    -
    -
    -

    Approximating Bézier curves with circular arcs

    -

    Let's look at doing the exact opposite of the previous section: rather than approximating circular arc using Bézier curves, let's approximate Bézier curves using circular arcs.

    -

    We already saw in the section on circle approximation that this will never yield a perfect equivalent, but sometimes you need circular arcs, such as when you're working with fabrication machinery, or simple vector languages that understand lines and circles, but not much else.

    -

    The approach is fairly simple: pick a starting point on the curve, and pick two points that are further along the curve. Determine the circle that goes through those three points, and see if it fits the part of the curve we're trying to approximate. Decent fit? Try spacing the points further apart. Bad fit? Try spacing the points closer together. Keep doing this until you've found the "good approximation/bad approximation" boundary, record the "good" arc, and then move the starting point up to overlap the end point we previously found. Rinse and repeat until we've covered the entire curve.

    -

    We already saw how to fit a circle through three points in the section on creating a curve from three points, and finding the arc through those points is straight-forward: pick one of the three points as start point, pick another as an end point, and the arc has to necessarily go from the start point, to the end point, over the remaining point.

    -

    So, how can we convert a Bézier curve into a (sequence of) circular arc(s)?

    -
      -
    • Start at t=0
    • -
    • Pick two points further down the curve at some value m = t + n and e = t + 2n
    • -
    • Find the arc that these points define
    • -
    • Determine how close the found arc is to the curve:
        -
      • Pick two additional points e1 = t + n/2 and e2 = t + n + n/2.
      • -
      • These points, if the arc is a good approximation of the curve interval chosen, should - lie on the circle, so their distance to the center of the circle should be the - same as the distance from any of the three other points to the center.
      • -
      • For point points, determine the (absolute) error between the radius of the circle, and the -actual distance from the center of the circle to the point on the curve.
      • -
      • If this error is too high, we consider the arc bad, and try a smaller interval.
      • -
      -
    • -
    -

    The result of this is shown in the next graphic: we start at a guaranteed failure: s=0, e=1. That's the entire curve. The midpoint is simply at t=0.5, and then we start performing a binary search.

    -
      -
    1. We start with low=0, mid=0.5 and high=1
    2. -
    3. That'll fail, so we retry with the interval halved: {0, 0.25, 0.5}
        -
      • If that arc's good, we move back up by half distance: {0, 0.375, 0.75}.
      • -
      • However, if the arc was still bad, we move down by half the distance: {0, 0.125, 0.25}.
      • -
      -
    4. -
    5. We keep doing this over and over until we have two arcs, in sequence, of which the first arc is good, and the second arc is bad. When we find that pair, we've found the boundary between a good approximation and a bad approximation, and we pick the good arc.
    6. -
    -

    The following graphic shows the result of this approach, with a default error threshold of 0.5, meaning that if an arc is off by a combined half pixel over both verification points, then we treat the arc as bad. This is an extremely simple error policy, but already works really well. Note that the graphic is still interactive, and you can use your up and down arrow keys keys to increase or decrease the error threshold, to see what the effect of a smaller or larger error threshold is.

    - - - Скрипты отключены. Показываем резервное изображение. - - - - - -

    With that in place, all that's left now is to "restart" the procedure by treating the found arc's end point as the new to-be-determined arc's starting point, and using points further down the curve. We keep trying this until the found end point is for t=1, at which point we are done. Again, the following graphic allows for up and down arrow key input to increase or decrease the error threshold, so you can see how picking a different threshold changes the number of arcs that are necessary to reasonably approximate a curve:

    - - - Скрипты отключены. Показываем резервное изображение. - - - - - -

    So... what is this good for? Obviously, if you're working with technologies that can't do curves, but can do lines and circles, then the answer is pretty straightforward, but what else? There are some reasons why you might need this technique: using circular arcs means you can determine whether a coordinate lies "on" your curve really easily (simply compute the distance to each circular arc center, and if any of those are close to the arc radii, at an angle between the arc start and end, bingo, this point can be treated as lying "on the curve"). Another benefit is that this approximation is "linear": you can almost trivially travel along the arcs at fixed speed. You can also trivially compute the arc length of the approximated curve (it's a bit like curve flattening). The only thing to bear in mind is that this is a lossy equivalence: things that you compute based on the approximation are guaranteed "off" by some small value, and depending on how much precision you need, arc approximation is either going to be super useful, or completely useless. It's up to you to decide which, based on your application!

    - -
    -
    -

    B-Splines

    -

    No discussion on Bézier curves is complete without also giving mention of that other beast in the curve design space: B-Splines. Easily confused to mean Bézier splines, that's not actually what they are; they are "basis function" splines, which makes a lot of difference, and we'll be looking at those differences in this section. We're not going to dive as deep into B-Splines as we have for Bézier curves (that would be an entire primer on its own) but we'll be looking at how B-Splines work, what kind of maths is involved in computing them, and how to draw them based on a number of parameters that you can pick for individual B-Splines.

    -

    First off: B-Splines are piecewise, polynomial interpolation curves, where the "single curve" is built by performing polynomial interpolation over a set of points, using a sliding window of a fixed number of points. For instance, a "cubic" B-Spline defined by twelve points will have its curve built by evaluating the polynomial interpolation of four points, and the curve can be treated as a lot of different sections, each controlled by four points at a time, such that the full curve consists of smoothly connected sections defined by points {1,2,3,4}, {2,3,4,5}, ..., {8,9,10,11}, and finally {9,10,11,12}, for eight sections.

    -

    What do they look like? They look like this! Tap on the graphic to add more points, and move points around to see how they map to the spline curve drawn.

    - - - Скрипты отключены. Показываем резервное изображение. - - - -

    The important part to notice here is that we are not doing the same thing with B-Splines that we do for poly-Béziers or Catmull-Rom curves: both of the latter simply define new sections as literally "new sections based on new points", so a 12 point cubic poly-Bézier curve is actually impossible, because we start with a four point curve, and then add three more points for each section that follows, so we can only have 4, 7, 10, 13, 16, etc. point Poly-Béziers. Similarly, while Catmull-Rom curves can grow by adding single points, this addition of a single point introduces three implicit Bézier points. Cubic B-Splines, on the other hand, are smooth interpolations of each possible curve involving four consecutive points, such that at any point along the curve except for our start and end points, our on-curve coordinate is defined by four control points.

    -

    Consider the difference to be this:

    -
      -
    • for Bézier curves, the curve is defined as an interpolation of points, but:
    • -
    • for B-Splines, the curve is defined as an interpolation of curves.
    • -
    -

    In fact, let's look at that again, but this time with the base curves shown, too. Each consecutive four points define one curve:

    - - - Скрипты отключены. Показываем резервное изображение. - - - - - -

    In order to make this interpolation of curves work, the maths is necessarily more complex than the maths for Bézier curves, so let's have a look at how things work.

    -

    How to compute a B-Spline curve: some maths

    -

    Given a B-Spline of degree d and thus order k=d+1 (so a quadratic B-Spline is degree 2 and order 3, a cubic B-Spline is degree 3 and order 4, etc) and n control points P0 through Pn-1, we can compute a point on the curve for some value t in the interval [0,1] (where 0 is the start of the curve, and 1 the end, just like for Bézier curves), by evaluating the following function:

    - + +

    + In order to make this interpolation of curves work, the maths is necessarily more complex than the maths for Bézier curves, so let's have + a look at how things work. +

    +

    How to compute a B-Spline curve: some maths

    +

    + Given a B-Spline of degree d and thus order k=d+1 (so a quadratic B-Spline is degree 2 and order 3, a cubic + B-Spline is degree 3 and order 4, etc) and n control points P0 through Pn-1, we can compute a point on the curve for some value t in the interval [0,1] (where 0 is the start of the curve, and 1 the + end, just like for Bézier curves), by evaluating the following function: +

    + - -

    Which, honestly, doesn't tell us all that much. All we can see is that a point on a B-Spline curve is defined as "a mix of all the control points, weighted somehow", where the weighting is achieved through the N(...) function, subscripted with an obvious parameter i, which comes from our summation, and some magical parameter k. So we need to know two things: 1. what does N(t) do, and 2. what is that k? Let's cover both, in reverse order.

    -

    The parameter k represents the "knot interval" over which a section of curve is defined. As we learned earlier, a B-Spline curve is itself an interpolation of curves, and we can treat each transition where a control point starts or stops influencing the total curvature as a "knot on the curve". -Doing so for a degree d B-Spline with n control point gives us d + n + 1 knots, defining d + n intervals along the curve, and it is these intervals that the above k subscript to the N() function applies to.

    -

    Then the N() function itself. What does it look like?

    - - -

    So this is where we see the interpolation: N(t) for an (i,k) pair (that is, for a step in the above summation, on a specific knot interval) is a mix between N(t) for (i,k-1) and N(t) for (i+1,k-1), so we see that this is a recursive iteration where i goes up, and k goes down, so it seem reasonable to expect that this recursion has to stop at some point; obviously, it does, and specifically it does so for the following i/k values:

    - - -

    And this function finally has a straight up evaluation: if a t value lies within a knot-specific interval once we reach a k=1 value, it "counts", otherwise it doesn't. We did cheat a little, though, because for all these values we need to scale our t value first, so that it lies in the interval bounded by knots[d] and knots[n], which are the start point and end point where curvature is controlled by exactly order control points. For instance, for degree 3 (=order 4) and 7 control points, with knot vector [1,2,3,4,5,6,7,8,9,10,11], we map t from [the interval 0,1] to the interval [4,8], and then use that value in the functions above, instead.

    -

    Can we simplify that?

    -

    We can, yes.

    -

    People far smarter than us have looked at this work, and two in particular — Maurice Cox and Carl de Boor — came to a mathematically pleasing solution: to compute a point P(t), we can compute this point by evaluating d(t) on a curve section between knots i and i+1:

    - - -

    This is another recursive function, with k values decreasing from the curve order to 1, and the value α (alpha) defined by:

    - - -

    That looks complicated, but it's not. Computing alpha is just a fraction involving known, plain numbers. And, once we have our alpha value, we also have (1-alpha) because it's a trivial subtraction. Computing the d() function is thus mostly a matter of computing pretty simple arithmetical statements, with some caching of results so we can refer to them as we recurve. While the recursion might see computationally expensive, the total algorithm is cheap, as each step only involves very simple maths.

    -

    Of course, the recursion does need a stop condition:

    - - -

    So, we actually see two stopping conditions: either i becomes 0, in which case d() is zero, or k becomes zero, in which case we get the same "either 1 or 0" that we saw in the N() function above.

    -

    Thanks to Cox and de Boor, we can compute points on a B-Spline pretty easily using the same kind of linear interpolation we saw in de Casteljau's algorithm. For instance, if we write out d() for i=3 and k=3, we get the following recursion diagram:

    - - -

    That is, we compute d(3,3) as a mixture of d(2,3) and d(2,2), where those two are themselves a mixture of d(1,3) and d(1,2), and d(1,2) and d(1,1), respectively, which are themselves a mixture of etc. etc. We simply keep expanding our terms until we reach the stop conditions, and then sum everything back up. It's really quite elegant.

    -

    One thing we need to keep in mind is that we're working with a spline that is constrained by its control points, so even though the d(..., k) values are zero or one at the lowest level, they are really "zero or one, times their respective control point", so in the next section you'll see the algorithm for running through the computation in a way that starts with a copy of the control point vector and then works its way up to that single point, rather than first starting "on the left", working our way "to the right" and then summing back up "to the left". We can just start on the right and work our way left immediately.

    -

    Running the computation

    -

    Unlike the de Casteljau algorithm, where the t value stays the same at every iteration, for B-Splines that is not the case, and so we end having to (for each point we evaluate) run a fairly involving bit of recursive computation. The algorithm is discussed on this Michigan Tech page, but an easier to read version is implemented by b-spline.js, so we'll look at its code.

    -

    Given an input value t, we first map the input to a value from the domain [0,1] to the domain [knots[degree], knots[knots.length - 1 - degree]. Then, we find the section number s that this mapped t value lies on:

    + +

    + That is, we compute d(3,3) as a mixture of d(2,3) and d(2,2), where those two are themselves a + mixture of d(1,3) and d(1,2), and d(1,2) and d(1,1), respectively, which are + themselves a mixture of etc. etc. We simply keep expanding our terms until we reach the stop conditions, and then sum everything back up. + It's really quite elegant. +

    +

    + One thing we need to keep in mind is that we're working with a spline that is constrained by its control points, so even though the + d(..., k) values are zero or one at the lowest level, they are really "zero or one, times their respective control point", so + in the next section you'll see the algorithm for running through the computation in a way that starts with a copy of the control point + vector and then works its way up to that single point, rather than first starting "on the left", working our way "to the right" and then + summing back up "to the left". We can just start on the right and work our way left immediately. +

    +

    Running the computation

    +

    + Unlike the de Casteljau algorithm, where the t value stays the same at every iteration, for B-Splines that is not the case, + and so we end having to (for each point we evaluate) run a fairly involving bit of recursive computation. The algorithm is discussed on + this Michigan Tech page, but an easier to read version + is implemented by b-spline.js, so we'll look at its code. +

    +

    + Given an input value t, we first map the input to a value from the domain [0,1] to the domain + [knots[degree], knots[knots.length - 1 - degree]. Then, we find the section number s that this mapped + t value lies on: +

    - - -
    1 - -
    2
    3
    - -

    after running this code, s is the index for the section the point will lie on. We then run the algorithm mentioned on the MU page (updated to use this description's variable names):

    +} + + + + 2 + + + 3 + + - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    - -

    (A nice bit of behaviour in this code is that we work the interpolation "backwards", starting at i=s at each level of the interpolation, and we stop when i = s - order + level, so we always end up with a value for i such that those v[i-1] don't try to use an array index that doesn't exist)

    -

    Open vs. closed paths

    -

    Much like poly-Béziers, B-Splines can be either open, running from the first point to the last point, or closed, where the first and last point are the same coordinate. However, because B-Splines are an interpolation of curves, not just points, we can't simply make the first and last point the same, we need to link as many points as are necessary to form "a curve" that the spline performs interpolation with. As such, for an order d B-Spline, we need to make the first and last d points the same. This is of course hardly more work than before (simply append points.splice(0,d) to points) but it's important to remember that you need more than just a single point.

    -

    Of course if we want to manipulate these kind of curves we need to make sure to mark them as "closed" so that we know the coordinate for points[0] and points[n-k] etc. don't just happen to have the same x/y values, but really are the same coordinate, so that manipulating one will equally manipulate the other, but programming generally makes this really easy by storing references to points, rather than copies (or other linked values such as coordinate weights, discussed in the NURBS section) rather than separate coordinate objects.

    -

    Manipulating the curve through the knot vector

    -

    The most important thing to understand when it comes to B-Splines is that they work because of the concept of a knot vector. As mentioned above, knots represent "where individual control points start/stop influencing the curve", but we never looked at the values that go in the knot vector. If you look back at the N() and a() functions, you see that interpolations are based on intervals in the knot vector, rather than the actual values in the knot vector, and we can exploit this to do some pretty interesting things with clever manipulation of the knot vector. Specifically there are four things we can do that are worth looking at:

    -
      -
    1. we can use a uniform knot vector, with equally spaced intervals,
    2. -
    3. we can use a non-uniform knot vector, without enforcing equally spaced intervals,
    4. -
    5. we can collapse sequential knots to the same value, locally lowering curve complexity using "null" intervals, and
    6. -
    7. we can form a special case non-uniform vector, by combining (1) and (3) to for a vector with collapsed start and end knots, with a uniform vector in between.
    8. -
    -

    Uniform B-Splines

    -

    The most straightforward type of B-Spline is the uniform spline. In a uniform spline, the knots are distributed uniformly over the entire curve interval. For instance, if we have a knot vector of length twelve, then a uniform knot vector would be [0,1,2,3,...,9,10,11]. Or [4,5,6,...,13,14,15], which defines the same intervals, or even [0,2,3,...,18,20,22], which also defines the same intervals, just scaled by a constant factor, which becomes normalised during interpolation and so does not contribute to the curvature.

    - - - Скрипты отключены. Показываем резервное изображение. - - - - - -

    This is an important point: the intervals that the knot vector defines are relative intervals, so it doesn't matter if every interval is size 1, or size 100 - the relative differences between the intervals is what shapes any particular curve.

    -

    The problem with uniform knot vectors is that, as we need order control points before we have any curve with which we can perform interpolation, the curve does not "start" at the first point, nor "ends" at the last point. Instead there are "gaps". We can get rid of these, by being clever about how we apply the following uniformity-breaking approach instead...

    -

    Reducing local curve complexity by collapsing intervals

    -

    Collapsing knot intervals, by making two or more consecutive knots have the same value, allows us to reduce the curve complexity in the sections that are affected by the knots involved. This can have drastic effects: for every interval collapse, the curve order goes down, and curve continuity goes down, to the point where collapsing order knots creates a situation where all continuity is lost and the curve "kinks".

    - - - Скрипты отключены. Показываем резервное изображение. - - - - - -

    Open-Uniform B-Splines

    -

    By combining knot interval collapsing at the start and end of the curve, with uniform knots in between, we can overcome the problem of the curve not starting and ending where we'd kind of like it to:

    -

    For any curve of degree D with control points N, we can define a knot vector of length N+D+1 in which the values 0 ... D+1 are the same, the values D+1 ... N+1 follow the "uniform" pattern, and the values N+1 ... N+D+1 are the same again. For example, a cubic B-Spline with 7 control points can have a knot vector [0,0,0,0,1,2,3,4,4,4,4], or it might have the "identical" knot vector [0,0,0,0,2,4,6,8,8,8,8], etc. Again, it is the relative differences that determine the curve shape.

    - - - Скрипты отключены. Показываем резервное изображение. - - - - - -

    Non-uniform B-Splines

    -

    This is essentially the "free form" version of a B-Spline, and also the least interesting to look at, as without any specific reason to pick specific knot intervals, there is nothing particularly interesting going on. There is one constraint to the knot vector, other than that any value knots[k+1] should be greater than or equal to knots[k].

    -

    One last thing: Rational B-Splines

    -

    While it is true that this section on B-Splines is running quite long already, there is one more thing we need to talk about, and that's "Rational" splines, where the rationality applies to the "ratio", or relative weights, of the control points themselves. By introducing a ratio vector with weights to apply to each control point, we greatly increase our influence over the final curve shape: the more weight a control point carries, the closer to that point the spline curve will lie, a bit like turning up the gravity of a control point, just like for rational Bézier curves.

    - - - Скрипты отключены. Показываем резервное изображение. - - - - - -

    Of course this brings us to the final topic that any text on B-Splines must touch on before calling it a day: the NURBS, or Non-Uniform Rational B-Spline (NURBS is not a plural, the capital S actually just stands for "spline", but a lot of people mistakenly treat it as if it is, so now you know better). NURBS is an important type of curve in computer-facilitated design, used a lot in 3D modelling (typically as NURBS surfaces) as well as in arbitrary-precision 2D design due to the level of control a NURBS curve offers designers.

    -

    While a true non-uniform rational B-Spline would be hard to work with, when we talk about NURBS we typically mean the Open-Uniform Rational B-Spline, or OURBS, but that doesn't roll off the tongue nearly as nicely, and so remember that when people talk about NURBS, they typically mean open-uniform, which has the useful property of starting the curve at the first control point, and ending it at the last.

    -

    Extending our implementation to cover rational splines

    -

    The algorithm for working with Rational B-Splines is virtually identical to the regular algorithm, and the extension to work in the control point weights is fairly simple: we extend each control point from a point in its original number of dimensions (2D, 3D, etc.) to one dimension higher, scaling the original dimensions by the control point's weight, and then assigning that weight as its value for the extended dimension.

    -

    For example, a 2D point (x,y) with weight w becomes a 3D point (w * x, w * y, w).

    -

    We then run the same algorithm as before, which will automatically perform weight interpolation in addition to regular coordinate interpolation, because all we've done is pretended we have coordinates in a higher dimension. The algorithm doesn't really care about how many dimensions it needs to interpolate.

    -

    In order to recover our "real" curve point, we take the final result of the point generation algorithm, and "unweigh" it: we take the final point's derived weight w' and divide all the regular coordinate dimensions by it, then throw away the weight information.

    -

    Based on our previous example, we take the final 3D point (x', y', w'), which we then turn back into a 2D point by computing (x'/w', y'/w'). And that's it, we're done!

    +} + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + -
    -
    - +

    + (A nice bit of behaviour in this code is that we work the interpolation "backwards", starting at i=s at each level of the + interpolation, and we stop when i = s - order + level, so we always end up with a value for i such that those + v[i-1] don't try to use an array index that doesn't exist) +

    +

    Open vs. closed paths

    +

    + Much like poly-Béziers, B-Splines can be either open, running from the first point to the last point, or closed, where the first and last + point are the same coordinate. However, because B-Splines are an interpolation of curves, not just points, we can't simply make the first + and last point the same, we need to link as many points as are necessary to form "a curve" that the spline performs interpolation with. As + such, for an order d B-Spline, we need to make the first and last d points the same. This is of course hardly + more work than before (simply append points.splice(0,d) to points) but it's important to remember that you need + more than just a single point. +

    +

    + Of course if we want to manipulate these kind of curves we need to make sure to mark them as "closed" so that we know the coordinate for + points[0] and points[n-k] etc. don't just happen to have the same x/y values, but really are the same + coordinate, so that manipulating one will equally manipulate the other, but programming generally makes this really easy by storing + references to points, rather than copies (or other linked values such as coordinate weights, discussed in the NURBS section) rather than + separate coordinate objects. +

    +

    Manipulating the curve through the knot vector

    +

    + The most important thing to understand when it comes to B-Splines is that they work because of the concept of a knot vector. As + mentioned above, knots represent "where individual control points start/stop influencing the curve", but we never looked at the + values that go in the knot vector. If you look back at the N() and a() functions, you see that interpolations are based on + intervals in the knot vector, rather than the actual values in the knot vector, and we can exploit this to do some pretty interesting + things with clever manipulation of the knot vector. Specifically there are four things we can do that are worth looking at: +

    +
      +
    1. we can use a uniform knot vector, with equally spaced intervals,
    2. +
    3. we can use a non-uniform knot vector, without enforcing equally spaced intervals,
    4. +
    5. we can collapse sequential knots to the same value, locally lowering curve complexity using "null" intervals, and
    6. +
    7. + we can form a special case non-uniform vector, by combining (1) and (3) to for a vector with collapsed start and end knots, with a + uniform vector in between. +
    8. +
    +

    Uniform B-Splines

    +

    + The most straightforward type of B-Spline is the uniform spline. In a uniform spline, the knots are distributed uniformly over the entire + curve interval. For instance, if we have a knot vector of length twelve, then a uniform knot vector would be [0,1,2,3,...,9,10,11]. Or + [4,5,6,...,13,14,15], which defines the same intervals, or even [0,2,3,...,18,20,22], which also defines + the same intervals, just scaled by a constant factor, which becomes normalised during interpolation and so does not contribute to + the curvature. +

    + + + Скрипты отключены. Показываем резервное изображение. + + + + + +

    + This is an important point: the intervals that the knot vector defines are relative intervals, so it doesn't matter if every + interval is size 1, or size 100 - the relative differences between the intervals is what shapes any particular curve. +

    +

    + The problem with uniform knot vectors is that, as we need order control points before we have any curve with which we can + perform interpolation, the curve does not "start" at the first point, nor "ends" at the last point. Instead there are "gaps". We can get + rid of these, by being clever about how we apply the following uniformity-breaking approach instead... +

    +

    Reducing local curve complexity by collapsing intervals

    +

    + Collapsing knot intervals, by making two or more consecutive knots have the same value, allows us to reduce the curve complexity in the + sections that are affected by the knots involved. This can have drastic effects: for every interval collapse, the curve order goes down, + and curve continuity goes down, to the point where collapsing order knots creates a situation where all continuity is lost + and the curve "kinks". +

    + + + Скрипты отключены. Показываем резервное изображение. + + + + + +

    Open-Uniform B-Splines

    +

    + By combining knot interval collapsing at the start and end of the curve, with uniform knots in between, we can overcome the problem of the + curve not starting and ending where we'd kind of like it to: +

    +

    + For any curve of degree D with control points N, we can define a knot vector of length N+D+1 in + which the values 0 ... D+1 are the same, the values D+1 ... N+1 follow the "uniform" pattern, and the values + N+1 ... N+D+1 are the same again. For example, a cubic B-Spline with 7 control points can have a knot vector + [0,0,0,0,1,2,3,4,4,4,4], or it might have the "identical" knot vector [0,0,0,0,2,4,6,8,8,8,8], etc. Again, it is the relative differences + that determine the curve shape. +

    + + + Скрипты отключены. Показываем резервное изображение. + + + + + +

    Non-uniform B-Splines

    +

    + This is essentially the "free form" version of a B-Spline, and also the least interesting to look at, as without any specific reason to + pick specific knot intervals, there is nothing particularly interesting going on. There is one constraint to the knot vector, other than + that any value knots[k+1] should be greater than or equal to knots[k]. +

    +

    One last thing: Rational B-Splines

    +

    + While it is true that this section on B-Splines is running quite long already, there is one more thing we need to talk about, and that's + "Rational" splines, where the rationality applies to the "ratio", or relative weights, of the control points themselves. By introducing a + ratio vector with weights to apply to each control point, we greatly increase our influence over the final curve shape: the more weight a + control point carries, the closer to that point the spline curve will lie, a bit like turning up the gravity of a control point, just like + for rational Bézier curves. +

    + + + Скрипты отключены. Показываем резервное изображение. + + + + + +

    + Of course this brings us to the final topic that any text on B-Splines must touch on before calling it a day: the + NURBS, or Non-Uniform Rational B-Spline (NURBS is not a plural, + the capital S actually just stands for "spline", but a lot of people mistakenly treat it as if it is, so now you know better). NURBS is an + important type of curve in computer-facilitated design, used a lot in 3D modelling (typically as NURBS surfaces) as well as in + arbitrary-precision 2D design due to the level of control a NURBS curve offers designers. +

    +

    + While a true non-uniform rational B-Spline would be hard to work with, when we talk about NURBS we typically mean the Open-Uniform + Rational B-Spline, or OURBS, but that doesn't roll off the tongue nearly as nicely, and so remember that when people talk about NURBS, + they typically mean open-uniform, which has the useful property of starting the curve at the first control point, and ending it at the + last. +

    +

    Extending our implementation to cover rational splines

    +

    + The algorithm for working with Rational B-Splines is virtually identical to the regular algorithm, and the extension to work in the + control point weights is fairly simple: we extend each control point from a point in its original number of dimensions (2D, 3D, etc.) to + one dimension higher, scaling the original dimensions by the control point's weight, and then assigning that weight as its value for the + extended dimension. +

    +

    For example, a 2D point (x,y) with weight w becomes a 3D point (w * x, w * y, w).

    +

    + We then run the same algorithm as before, which will automatically perform weight interpolation in addition to regular coordinate + interpolation, because all we've done is pretended we have coordinates in a higher dimension. The algorithm doesn't really care about how + many dimensions it needs to interpolate. +

    +

    + In order to recover our "real" curve point, we take the final result of the point generation algorithm, and "unweigh" it: we take the + final point's derived weight w' and divide all the regular coordinate dimensions by it, then throw away the weight + information. +

    +

    + Based on our previous example, we take the final 3D point (x', y', w'), which we then turn back into a 2D point by computing + (x'/w', y'/w'). And that's it, we're done! +

    +
    +
    + -

    Comments and questions

    -

    First off, if you enjoyed this book, or you simply found it useful for something you were trying to get done, and you were wondering how to let me know you appreciated this book, you have two options: you can either head on over to the Patreon page for this book, or if you prefer to make a one-time donation, head on over to the buy Pomax a coffee page. This work has grown from a small primer to a 70-plus print-page-equivalent reader on the subject of Bézier curves over the years, and a lot of coffee went into the making of it. I don't regret a minute I spent on writing it, but I can always do with some more coffee to keep on writing.

    -

    With that said, on to the comments!

    -
    +

    + + Comments and questions +

    +

    + First off, if you enjoyed this book, or you simply found it useful for something you were trying to get done, and you were wondering how + to let me know you appreciated this book, you have two options: you can either head on over to the + Patreon page for this book, or if you prefer to make a one-time donation, head on over to + the buy Pomax a coffee page. This + work has grown from a small primer to a 70-plus print-page-equivalent reader on the subject of Bézier curves over the years, and a lot of + coffee went into the making of it. I don't regret a minute I spent on writing it, but I can always do with some more coffee to keep on + writing. +

    +

    With that said, on to the comments!

    +
    +
    +
    +
    +
    -
    - -
    + -
    - - - - - - - - - \ No newline at end of file + + + diff --git a/docs/uk-UA/index.html b/docs/uk-UA/index.html index d2628228..c1419e23 100644 --- a/docs/uk-UA/index.html +++ b/docs/uk-UA/index.html @@ -1,161 +1,168 @@ + + + + Підручник з кривих Безьє - - - - Підручник з кривих Безьє + - + - - - - + - - - - @font-face { - font-family: Roboto; - src: url(./fonts/roboto-v20-latin-regular.woff) format("WOFF"); - } + - :root[lang="uk-UA"] { - font-family: Roboto, "Helvetica Neue", Helvetica, Arial, sans-serif; - font-size: 17.4px; - } + + + + + + + + + + + + + + + - + + + + + + + - + + - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - + + - - - - + + + - - + + -
    - This page on GitHub - - This page on GitHub - -
    +
    + This page on GitHub + + This page on GitHub + +
    -
    - - - submit to reddit - submit to hacker news - tweet your read - -
    +
    + + + submit to reddit + submit to hacker news + tweet your read + +
    - + +
    +

    + Підручник з кривих Безьє +

    +

    Безкоштовна онлайн-книга, яка навчить вас всьому необхідному, щоб працювати з кривими Безьє.

    +
    + Читати рідною мовою: + +

    + (Не бачите своєї мови у списку або хочете, щоб вона досягла 100%? + Допоможіть перекласти цей контент!) +

    +
    -
    - -

    Підручник з кривих Безьє

    -

    Безкоштовна онлайн-книга, яка навчить вас всьому необхідному, щоб працювати з кривими Безьє.

    -
    - Читати рідною мовою: - -

    (Не бачите своєї мови у списку або хочете, щоб вона досягла 100%? Допоможіть перекласти цей контент!)

    -
    - -

    - Ласкаво прошу до Підручника з кривих Безьє. Це безкоштовний вебсайт/електронна книга, що пояснює і математичні, - і програмувальні аспекти кривих Безьє, покриваючи широкий спектр областей, які стосуються малювання та роботи з - цими кривими. Криві Безьє застосовуються всюди, починаючи з кривих у Photoshop, і закінчуючи функціями пом'якшення - (easing function) CSS та описом контурів популярних шрифтів. -

    -

    - Якщо ви тут вперше: ласкаво прошу! Напишіть мені сюди, - якщо вас цікавить щось, пов'язане з кривими Безьє, чого немає у підручнику. -

    - -

    Donations and sponsorship

    +

    + Ласкаво прошу до Підручника з кривих Безьє. Це безкоштовний вебсайт/електронна книга, що пояснює і математичні, і програмувальні аспекти + кривих Безьє, покриваючи широкий спектр областей, які стосуються малювання та роботи з цими кривими. Криві Безьє застосовуються всюди, + починаючи з кривих у Photoshop, і закінчуючи функціями пом'якшення (easing function) CSS та описом контурів популярних шрифтів. +

    +

    + Якщо ви тут вперше: ласкаво прошу! Напишіть мені сюди, якщо вас цікавить щось, + пов'язане з кривими Безьє, чого немає у підручнику. +

    +

    Donations and sponsorship

    -

    - Якщо ви використовуєте цей ресурс для наукових досліджень, або пишете власне програмне забезпечення, будь ласка, - розгляньте можливість фінансової - підтримки (будь-яка сума вітається), або станьте патроном на Patreon. - Мені не платять за цю роботу, тому якщо ви вважаєте цей сайт корисним, і хотіли б, щоб він підтримувався протягом - тривалого часу, знайте: багато кави потрібно було за ці роки, і ще багато буде потрібно. Тому, якщо можете допомогти - з кавою, то можете бути впевнені, що цей ресурс буде підтримуватися ще довго! -

    - +

    + Якщо ви використовуєте цей ресурс для наукових досліджень, або пишете власне програмне забезпечення, будь ласка, розгляньте можливість + фінансової підтримки (будь-яка сума + вітається), або станьте патроном на Patreon. Мені не платять за цю роботу, тому якщо ви + вважаєте цей сайт корисним, і хотіли б, щоб він підтримувався протягом тривалого часу, знайте: багато кави потрібно було за ці роки, і ще + багато буде потрібно. Тому, якщо можете допомогти з кавою, то можете бути впевнені, що цей ресурс буде підтримуватися ще довго! +

    - -
    +
    -

    - — Pomax -

    - + +
    - -
    +
    +
    +

    Preface

    +

    + In order to draw things in 2D, we usually rely on lines, which typically get classified into two categories: straight lines, and curves. The + first of these are as easy to draw as they are easy to make a computer draw. Give a computer the first and last point in the line, and BAM! + straight line. No questions asked. +

    +

    + Curves, however, are a much bigger problem. While we can draw curves with ridiculous ease freehand, computers are a bit handicapped in that + they can't draw curves unless there is a mathematical function that describes how it should be drawn. In fact, they even need this for + straight lines, but the function is ridiculously easy, so we tend to ignore that as far as computers are concerned; all lines are + "functions", regardless of whether they're straight or curves. However, that does mean that we need to come up with fast-to-compute + functions that lead to nice looking curves on a computer. There are a number of these, and in this article we'll focus on a particular + function that has received quite a bit of attention and is used in pretty much anything that can draw curves: Bézier curves. +

    +

    + They're named after Pierre Bézier, who is principally responsible for making + them known to the world as a curve well-suited for design work (publishing his investigations in 1962 while working for Renault), although + he was not the first, or only one, to "invent" these type of curves. One might be tempted to say that the mathematician + Paul de Casteljau was first, as he began investigating the nature of these + curves in 1959 while working at Citroën, and came up with a really elegant way of figuring out how to draw them. However, de Casteljau did + not publish his work, making the question "who was first" hard to answer in any absolute sense. Or is it? Bézier curves are, at their core, + "Bernstein polynomials", a family of mathematical functions investigated by + Sergei Natanovich Bernstein, whose publications on them date back at + least as far as 1912. +

    +

    + Anyway, that's mostly trivia, what you are more likely to care about is that these curves are handy: you can link up multiple Bézier curves + so that the combination looks like a single curve. If you've ever drawn Photoshop "paths" or worked with vector drawing programs like Flash, + Illustrator or Inkscape, those curves you've been drawing are Bézier curves. +

    +

    + But what if you need to program them yourself? What are the pitfalls? How do you draw them? What are the bounding boxes, how do you + determine intersections, how can you extrude a curve, in short: how do you do everything that you might want to do with these curves? That's + what this page is for. Prepare to be mathed! +

    +
    +

    Virtually all Bézier graphics are interactive.

    +

    + This page uses interactive examples, relying heavily on Bezier.js, as well as maths + formulae which are typeset into SVG using the XeLaTeX typesetting system and + pdf2svg by David Barton. +

    +

    This book is open source.

    +

    + This book is an open source software project, and lives on two github repositories. The first is + https://github.com/pomax/bezierinfo and is the purely-for-presentation version you are + viewing right now. The other repository is https://github.com/pomax/BezierInfo-2, + which is the development version, housing all the code that gets turned into the web version, and is also where you should file + issues if you find bugs or have ideas on what to change or add to the primer. +

    +

    How complicated is the maths going to be?

    +

    + Most of the mathematics in this Primer are early high school maths. If you understand basic arithmetic, and you know how to read English, + you should be able to get by just fine. There will at times be far more complicated maths, but if you don't feel like digesting + them, you can safely skip over them by either skipping over the "detail boxes" in section or by just jumping to the end of a section with + maths that looks too involving. The end of sections typically simply list the conclusions so you can just work with those values directly. +

    +

    What language is all this example code in?

    +

    + There are way too many programming languages to favour one of all others, soo all the example code in this Primer uses a form of + pseudo-code that uses a syntax that's close enough to, but not actually, modern scripting languages like JS, Python, etc. That means you + won't be able to copy-paste any of it without giving it any thought, but that's intentional: if you're reading this primer, presumably you + want to learn, and you don't learn by copy-pasting. You learn by doing things yourself, making mistakes, and then fixing + those mistakes. Now, of course, I didn't intentionally add errors in the example code just to trick you into making mistakes (that would + be horrible!) but I did intentionally keep the code from favouring one programming language over another. Don't worry though, if + you know even a single procedural programming language, you should be able to read the examples without any difficulties. +

    +

    Questions, comments:

    +

    + If you have suggestions for new sections, hit up the Github issue tracker (also + reachable from the repo linked to in the upper right). If you have questions about the material, there's currently no comment section + while I'm doing the rewrite, but you can use the issue tracker for that as well. Once the rewrite is done, I'll add a general comment + section back in, and maybe a more topical "select this section of text and hit the 'question' button to ask a question about it" system. + We'll see. +

    +

    Help support the book!

    +

    + If you enjoyed this book, or you simply found it useful for something you were trying to get done, and you were wondering how to let me + know you appreciated this book, you have two options: you can either head on over to the + Patreon page for this book, or if you prefer to make a one-time donation, head on over to + the buy Pomax a coffee page. This + work has grown from a small primer to a 100-plus print-page-equivalent reader on the subject of Bézier curves over the years, and a lot of + coffee went into the making of it. I don't regret a minute I spent on writing it, but I can always do with some more coffee to keep on + writing! +

    +
    +
    +
    +

    Що нового?

    +

    + Цей підручник постійно розвививається, тож залежно від того, коли ви востаннє його переглядали, тут можуть бути оновлення. Перейдіть за + цим посиланням, щоб побачити, що було додано. (Також доступний RSS-канал) +

    + + + +
    +

    November 2020

    +
      +
    • Added a section on finding curve/circle intersections

    • +
    +

    October 2020

    +
      +
    • Added the Ukranian locale! Help out in getting its localization to 100%!

    • +
    +

    August-September 2020

    +
      +
    • +

      + Completely overhauled the site: the Primer is now a normal web page that works fine with JS disabled, but obviously better with JS + turned on. +

      +
    • +
    +

    June 2020

    +
      +
    • Added automatic CI/CD using Github Actions

    • +
    +

    January 2020

    +
      +
    • Added reset buttons to all graphics

    • +
    • Updated to preface to correctly describe the on-page maths

    • +
    • Fixed the Catmull-Rom section because it had glaring maths errors

    • +
    +

    August 2019

    +
      +
    • Added a section on (plain) rational Bezier curves

    • +
    • Improved the Graphic component to allow for sliders

    • +
    +

    December 2018

    +
      +
    • Added a section on curvature and calculating kappa.

    • +
    • +

      + Added a Patreon page! Head on over to patreon.com/bezierinfo to help support this + site! +

      +
    • +
    +

    August 2018

    +
      +
    • Added a section on finding a curve's y, if all you have is the x coordinate.

    • +
    +

    July 2018

    +
      +
    • Rewrote the 3D normals section, implementing and explaining Rotation Minimising Frames.

    • +
    • Updated the section on curve order raising/lowering, showing how to get a least-squares optimized lower order curve.

    • +
    • +

      (Finally) updated 'npm test' so that it automatically rebuilds when files are changed while the dev server is running.

      +
    • +
    +

    June 2018

    +
      +
    • Added a section on direct curve fitting.

    • +
    • Added source links for all graphics.

    • +
    • Added this "What's new?" section.

    • +
    +

    April 2017

    +
      +
    • Added a section on 3d normals.

    • +
    • Added live-updating for the social link buttons, so they always link to the specific section you're reading.

    • +
    +

    February 2017

    +
      +
    • Finished rewriting the entire codebase for localization.

    • +
    +

    January 2016

    +
      +
    • Added a section to explain the Bezier interval.

    • +
    • Rewrote the Primer as a React application.

    • +
    +

    December 2015

    +
      +
    • Set up the split repository between BezierInfo-2 as development repository, and bezierinfo as live page.

    • +
    • +

      + Removed the need for client-side LaTeX parsing entirely, so the site doesn't take a full minute or more to load all the graphics. +

      +
    • +
    +

    May 2015

    +
      +
    • Switched over to pure JS rather than Processing-through-Processing.js

    • +
    • Added Cardano's algorithm for finding the roots of a cubic polynomial.

    • +
    +

    April 2015

    +
      +
    • Added a section on arc length approximations.

    • +
    +

    February 2015

    +
      +
    • Added a section on the canonical cubic Bezier form.

    • +
    +

    November 2014

    +
      +
    • Switched to HTTPS.

    • +
    +

    July 2014

    +
      +
    • Added the section on arc approximation.

    • +
    +

    April 2014

    +
      +
    • Added the section on Catmull-Rom fitting.

    • +
    +

    November 2013

    +
      +
    • Added the section on Catmull-Rom / Bezier conversion.

    • +
    • Added the section on Bezier cuves as matrices.

    • +
    +

    April 2013

    +
      +
    • Added a section on poly-Beziers.

    • +
    • Added a section on boolean shape operations.

    • +
    +

    March 2013

    +
      +
    • First drastic rewrite.

    • +
    • Added sections on circle approximations.

    • +
    • Added a section on projecting a point onto a curve.

    • +
    • Added a section on tangents and normals.

    • +
    • Added Legendre-Gauss numerical data tables.

    • +
    +

    October 2011

    +
      +
    • +

      + First commit for the bezierinfo site, based on the pre-Primer webpage that covered + the basics of Bezier curves in HTML with Processing.js examples. +

      +
    • +
    +
    +
    +
    +
    +

    + + Короткий вступ +

    +

    + Давайте розпочнемо з добрих новин: криві Безьє, про які ми говоритимемо, ви зможете побачити далі на графіках. Ці криві розпочинаються у + якійсь певній точці, і закінчуються у якійсь певній точці. Їх кривизна залежить від однієї або кількох "проміжних" контрольних точок. + Зараз, оскільки всі графіки на цій сторінці інтерактивні, поекспериментуйте трохи з цими кривими. Клікніть на точку мишкою й потягніть - + так ви зможете відчути, як форма кривої змінюється в залежності від ваших дій. +

    +
    + + + Скрипти вимкнено. показує резервний. + + + + + + Скрипти вимкнено. показує резервний. + + + +
    -
    - -

    Preface

    -

    In order to draw things in 2D, we usually rely on lines, which typically get classified into two categories: straight lines, and curves. The first of these are as easy to draw as they are easy to make a computer draw. Give a computer the first and last point in the line, and BAM! straight line. No questions asked.

    -

    Curves, however, are a much bigger problem. While we can draw curves with ridiculous ease freehand, computers are a bit handicapped in that they can't draw curves unless there is a mathematical function that describes how it should be drawn. In fact, they even need this for straight lines, but the function is ridiculously easy, so we tend to ignore that as far as computers are concerned; all lines are "functions", regardless of whether they're straight or curves. However, that does mean that we need to come up with fast-to-compute functions that lead to nice looking curves on a computer. There are a number of these, and in this article we'll focus on a particular function that has received quite a bit of attention and is used in pretty much anything that can draw curves: Bézier curves.

    -

    They're named after Pierre Bézier, who is principally responsible for making them known to the world as a curve well-suited for design work (publishing his investigations in 1962 while working for Renault), although he was not the first, or only one, to "invent" these type of curves. One might be tempted to say that the mathematician Paul de Casteljau was first, as he began investigating the nature of these curves in 1959 while working at Citroën, and came up with a really elegant way of figuring out how to draw them. However, de Casteljau did not publish his work, making the question "who was first" hard to answer in any absolute sense. Or is it? Bézier curves are, at their core, "Bernstein polynomials", a family of mathematical functions investigated by Sergei Natanovich Bernstein, whose publications on them date back at least as far as 1912.

    -

    Anyway, that's mostly trivia, what you are more likely to care about is that these curves are handy: you can link up multiple Bézier curves so that the combination looks like a single curve. If you've ever drawn Photoshop "paths" or worked with vector drawing programs like Flash, Illustrator or Inkscape, those curves you've been drawing are Bézier curves.

    -

    But what if you need to program them yourself? What are the pitfalls? How do you draw them? What are the bounding boxes, how do you determine intersections, how can you extrude a curve, in short: how do you do everything that you might want to do with these curves? That's what this page is for. Prepare to be mathed!

    -
    - -

    Virtually all Bézier graphics are interactive.

    -

    This page uses interactive examples, relying heavily on Bezier.js, as well as maths formulae which are typeset into SVG using the XeLaTeX typesetting system and pdf2svg by David Barton.

    -

    This book is open source.

    -

    This book is an open source software project, and lives on two github repositories. The first is https://github.com/pomax/bezierinfo and is the purely-for-presentation version you are viewing right now. The other repository is https://github.com/pomax/BezierInfo-2, which is the development version, housing all the code that gets turned into the web version, and is also where you should file issues if you find bugs or have ideas on what to change or add to the primer.

    -

    How complicated is the maths going to be?

    -

    Most of the mathematics in this Primer are early high school maths. If you understand basic arithmetic, and you know how to read English, you should be able to get by just fine. There will at times be far more complicated maths, but if you don't feel like digesting them, you can safely skip over them by either skipping over the "detail boxes" in section or by just jumping to the end of a section with maths that looks too involving. The end of sections typically simply list the conclusions so you can just work with those values directly.

    -

    What language is all this example code in?

    -

    There are way too many programming languages to favour one of all others, soo all the example code in this Primer uses a form of pseudo-code that uses a syntax that's close enough to, but not actually, modern scripting languages like JS, Python, etc. That means you won't be able to copy-paste any of it without giving it any thought, but that's intentional: if you're reading this primer, presumably you want to learn, and you don't learn by copy-pasting. You learn by doing things yourself, making mistakes, and then fixing those mistakes. Now, of course, I didn't intentionally add errors in the example code just to trick you into making mistakes (that would be horrible!) but I did intentionally keep the code from favouring one programming language over another. Don't worry though, if you know even a single procedural programming language, you should be able to read the examples without any difficulties.

    -

    Questions, comments:

    -

    If you have suggestions for new sections, hit up the Github issue tracker (also reachable from the repo linked to in the upper right). If you have questions about the material, there's currently no comment section while I'm doing the rewrite, but you can use the issue tracker for that as well. Once the rewrite is done, I'll add a general comment section back in, and maybe a more topical "select this section of text and hit the 'question' button to ask a question about it" system. We'll see.

    -

    Help support the book!

    -

    If you enjoyed this book, or you simply found it useful for something you were trying to get done, and you were wondering how to let me know you appreciated this book, you have two options: you can either head on over to the Patreon page for this book, or if you prefer to make a one-time donation, head on over to the buy Pomax a coffee page. This work has grown from a small primer to a 100-plus print-page-equivalent reader on the subject of Bézier curves over the years, and a lot of coffee went into the making of it. I don't regret a minute I spent on writing it, but I can always do with some more coffee to keep on writing!

    -
    -
    -
    -

    Що нового?

    -

    Цей підручник постійно розвививається, тож залежно від того, коли ви востаннє його переглядали, тут можуть бути оновлення. Перейдіть за цим посиланням, щоб побачити, що було додано. (Також доступний RSS-канал)

    - - - -

    November 2020

    • Added a section on finding curve/circle intersections

      -
    -

    October 2020

    • Added the Ukranian locale! Help out in getting its localization to 100%!

      -
    -

    August-September 2020

    • Completely overhauled the site: the Primer is now a normal web page that works fine with JS disabled, but obviously better with JS turned on.

      -
    -

    June 2020

    • Added automatic CI/CD using Github Actions

      -
    -

    January 2020

    • Added reset buttons to all graphics

      -
    • -
    • Updated to preface to correctly describe the on-page maths

      -
    • -
    • Fixed the Catmull-Rom section because it had glaring maths errors

      -
    -

    August 2019

    • Added a section on (plain) rational Bezier curves

      -
    • -
    • Improved the Graphic component to allow for sliders

      -
    -

    December 2018

    • Added a section on curvature and calculating kappa.

      -
    • -
    • Added a Patreon page! Head on over to patreon.com/bezierinfo to help support this site!

      -
    -

    August 2018

    • Added a section on finding a curve's y, if all you have is the x coordinate.

      -
    -

    July 2018

    • Rewrote the 3D normals section, implementing and explaining Rotation Minimising Frames.

      -
    • -
    • Updated the section on curve order raising/lowering, showing how to get a least-squares optimized lower order curve.

      -
    • -
    • (Finally) updated 'npm test' so that it automatically rebuilds when files are changed while the dev server is running.

      -
    -

    June 2018

    • Added a section on direct curve fitting.

      -
    • -
    • Added source links for all graphics.

      -
    • -
    • Added this "What's new?" section.

      -
    -

    April 2017

    • Added a section on 3d normals.

      -
    • -
    • Added live-updating for the social link buttons, so they always link to the specific section you're reading.

      -
    -

    February 2017

    • Finished rewriting the entire codebase for localization.

      -
    -

    January 2016

    • Added a section to explain the Bezier interval.

      -
    • -
    • Rewrote the Primer as a React application.

      -
    -

    December 2015

    • Set up the split repository between BezierInfo-2 as development repository, and bezierinfo as live page.

      -
    • -
    • Removed the need for client-side LaTeX parsing entirely, so the site doesn't take a full minute or more to load all the graphics.

      -
    -

    May 2015

    • Switched over to pure JS rather than Processing-through-Processing.js

      -
    • -
    • Added Cardano's algorithm for finding the roots of a cubic polynomial.

      -
    -

    April 2015

    • Added a section on arc length approximations.

      -
    -

    February 2015

    • Added a section on the canonical cubic Bezier form.

      -
    -

    November 2014

    • Switched to HTTPS.

      -
    -

    July 2014

    • Added the section on arc approximation.

      -
    -

    April 2014

    • Added the section on Catmull-Rom fitting.

      -
    -

    November 2013

    • Added the section on Catmull-Rom / Bezier conversion.

      -
    • -
    • Added the section on Bezier cuves as matrices.

      -
    -

    April 2013

    • Added a section on poly-Beziers.

      -
    • -
    • Added a section on boolean shape operations.

      -
    -

    March 2013

    • First drastic rewrite.

      -
    • -
    • Added sections on circle approximations.

      -
    • -
    • Added a section on projecting a point onto a curve.

      -
    • -
    • Added a section on tangents and normals.

      -
    • -
    • Added Legendre-Gauss numerical data tables.

      -
    -

    October 2011

    • First commit for the bezierinfo site, based on the pre-Primer webpage that covered the basics of Bezier curves in HTML with Processing.js examples.

      -
    -
    -
    -

    Короткий вступ

    -

    Давайте розпочнемо з добрих новин: криві Безьє, про які ми говоритимемо, ви зможете побачити далі на графіках. Ці криві розпочинаються у якійсь певній точці, і закінчуються у якійсь певній точці. Їх кривизна залежить від однієї або кількох "проміжних" контрольних точок. Зараз, оскільки всі графіки на цій сторінці інтерактивні, поекспериментуйте трохи з цими кривими. Клікніть на точку мишкою й потягніть - так ви зможете відчути, як форма кривої змінюється в залежності від ваших дій.

    -
    - - - Скрипти вимкнено. показує резервний. - - - - - - Скрипти вимкнено. показує резервний. - - - -
    - -

    Ці криві інтенсивно використовуються у системах автоматизованого проектування та виробництва (CAD/CAM), а також у програмах для графічного дизайну, таких як Adobe Illustrator, Photoshop, Inkscape, GIMP, тощо. Також криві Безьє використовуються у графічних технологіях, таких як масштабована векторна графіка (SVG) та шрифти OpenType (TTF/OTF). Криві Безьє використовуються багато де, тому якщо хочете дізнатись про них більше, приготуйтесь трохи повчитися!

    - -
    -
    -

    So what makes a Bézier Curve?

    -

    Playing with the points for curves may have given you a feel for how Bézier curves behave, but what are Bézier curves, really? There are two ways to explain what a Bézier curve is, and they turn out to be the entirely equivalent, but one of them uses complicated maths, and the other uses really simple maths. So... let's start with the simple explanation:

    -

    Bézier curves are the result of linear interpolations. That sounds complicated but you've been doing linear interpolation since you were very young: any time you had to point at something between two other things, you've been applying linear interpolation. It's simply "picking a point between two points".

    -

    If we know the distance between those two points, and we want a new point that is, say, 20% the distance away from the first point (and thus 80% the distance away from the second point) then we can compute that really easily:

    - - -

    So let's look at that in action: the following graphic is interactive in that you can use your up and down arrow keys to increase or decrease the interpolation ratio, to see what happens. We start with three points, which gives us two lines. Linear interpolation over those lines gives us two points, between which we can again perform linear interpolation, yielding a single point. And that point —and all points we can form in this way for all ratios taken together— form our Bézier curve:

    - - - Скрипти вимкнено. показує резервний. - - - - - -

    And that brings us to the complicated maths: calculus.

    -

    While it doesn't look like that's what we've just done, we actually just drew a quadratic curve, in steps, rather than in a single go. One of the fascinating parts about Bézier curves is that they can both be described in terms of polynomial functions, as well as in terms of very simple interpolations of interpolations of [...]. That, in turn, means we can look at what these curves can do based on both "real maths" (by examining the functions, their derivatives, and all that stuff), as well as by looking at the "mechanical" composition (which tells us, for instance, that a curve will never extend beyond the points we used to construct it).

    -

    So let's start looking at Bézier curves a bit more in depth: their mathematical expressions, the properties we can derive from them, and the various things we can do to, and with, Bézier curves.

    - -
    -
    -

    The mathematics of Bézier curves

    -

    Bézier curves are a form of "parametric" function. Mathematically speaking, parametric functions are cheats: a "function" is actually a well defined term representing a mapping from any number of inputs to a single output. Numbers go in, a single number comes out. Change the numbers that go in, and the number that comes out is still a single number.

    -

    Parametric functions cheat. They basically say "alright, well, we want multiple values coming out, so we'll just use more than one function". An illustration: Let's say we have a function that maps some value, let's call it x, to some other value, using some kind of number manipulation:

    - - -

    The notation f(x) is the standard way to show that it's a function (by convention called f if we're only listing one) and its output changes based on one variable (in this case, x). Change x, and the output for f(x) changes.

    -

    So far, so good. Now, let's look at parametric functions, and how they cheat. Let's take the following two functions:

    - - -

    There's nothing really remarkable about them, they're just a sine and cosine function, but you'll notice the inputs have different names. If we change the value for a, we're not going to change the output value for f(b), since a isn't used in that function. Parametric functions cheat by changing that. In a parametric function all the different functions share a variable, like this:

    - - -

    Multiple functions, but only one variable. If we change the value for t, we change the outcome of both fa(t) and fb(t). You might wonder how that's useful, and the answer is actually pretty simple: if we change the labels fa(t) and fb(t) with what we usually mean with them for parametric curves, things might be a lot more obvious:

    - - -

    There we go. x/y coordinates, linked through some mystery value t.

    -

    So, parametric curves don't define a y coordinate in terms of an x coordinate, like normal functions do, but they instead link the values to a "control" variable. If we vary the value of t, then with every change we get two values, which we can use as (x,y) coordinates in a graph. The above set of functions, for instance, generates points on a circle: We can range t from negative to positive infinity, and the resulting (x,y) coordinates will always lie on a circle with radius 1 around the origin (0,0). If we plot it for t from 0 to 5, we get this:

    - - - Скрипти вимкнено. показує резервний. - - - - - -

    Bézier curves are just one out of the many classes of parametric functions, and are characterised by using the same base function for all of the output values. In the example we saw above, the x and y values were generated by different functions (one uses a sine, the other a cosine); but Bézier curves use the "binomial polynomial" for both the x and y outputs. So what are binomial polynomials?

    -

    You may remember polynomials from high school. They're those sums that look like this:

    - - -

    If the highest order term they have is , they're called "cubic" polynomials; if it's , it's a "square" polynomial; if it's just x, it's a line (and if there aren't even any terms with x it's not a polynomial!)

    -

    Bézier curves are polynomials of t, rather than x, with the value for t being fixed between 0 and 1, with coefficients a, b etc. taking the "binomial" form, which sounds fancy but is actually a pretty simple description for mixing values:

    - - -

    I know what you're thinking: that doesn't look too simple! But if we remove t and add in "times one", things suddenly look pretty easy. Check out these binomial terms:

    - - -

    Notice that 2 is the same as 1+1, and 3 is 2+1 and 1+2, and 6 is 3+3... As you can see, each time we go up a dimension, we simply start and end with 1, and everything in between is just "the two numbers above it, added together", giving us a simple number sequence known as Pascal's triangle. Now that's easy to remember.

    -

    There's an equally simple way to figure out how the polynomial terms work: if we rename (1-t) to a and t to b, and remove the weights for a moment, we get this:

    - - -

    It's basically just a sum of "every combination of a and b", progressively replacing a's with b's after every + sign. So that's actually pretty simple too. So now you know binomial polynomials, and just for completeness I'm going to show you the generic function for this:

    - - -

    And that's the full description for Bézier curves. Σ in this function indicates that this is a series of additions (using the variable listed below the Σ, starting at ...=<value> and ending at the value listed on top of the Σ).

    -
    + +

    + And that's the full description for Bézier curves. Σ in this function indicates that this is a series of additions (using the variable + listed below the Σ, starting at ...=<value> and ending at the value listed on top of the Σ). +

    +
    +

    How to implement the basis function

    +

    We could naively implement the basis function as a mathematical construct, using the function as our guide, like this:

    -

    How to implement the basis function

    -

    We could naively implement the basis function as a mathematical construct, using the function as our guide, like this:

    - - - - - -
    1 - -
    2
    3
    4
    5
    - -

    I say we could, because we're not going to: the factorial function is incredibly expensive. And, as we can see from the above explanation, we can actually create Pascal's triangle quite easily without it: just start at [1], then [1,1], then [1,2,1], then [1,3,3,1], and so on, with each next row fitting 1 more number than the previous row, starting and ending with "1", with all the numbers in between being the sum of the previous row's elements on either side "above" the one we're computing.

    -

    We can generate this as a list of lists lightning fast, and then never have to compute the binomial terms because we have a lookup table:

    + return sum + + + + 2 + + + 3 + + + 4 + + + 5 + + - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    - -

    So what's going on here? First, we declare a lookup table with a size that's reasonably large enough to accommodate most lookups. Then, we declare a function to get us the values we need, and we make sure that if an n/k pair is requested that isn't in the LUT yet, we expand it first. Our basis function now looks like this:

    + return lut[n][k] + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + 18 + + - - - - -
    1 - -
    2
    3
    4
    5
    - -

    Perfect. Of course, we can optimize further. For most computer graphics purposes, we don't need arbitrary curves (although we will also provide code for arbitrary curves in this primer); we need quadratic and cubic curves, and that means we can drastically simplify the code:

    + return sum + + + + 2 + + + 3 + + + 4 + + + 5 + + - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    - -

    And now we know how to program the basis function. Excellent.

    -
    + return mt3 + 3*mt2*t + 3*mt*t2 + t3 + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + -

    So, now we know what the basis function looks like, time to add in the magic that makes Bézier curves so special: control points.

    +

    And now we know how to program the basis function. Excellent.

    +
    -
    -
    -

    Controlling Bézier curvatures

    -

    Bézier curves are, like all "splines", interpolation functions. This means that they take a set of points, and generate values somewhere "between" those points. (One of the consequences of this is that you'll never be able to generate a point that lies outside the outline for the control points, commonly called the "hull" for the curve. Useful information!). In fact, we can visualize how each point contributes to the value generated by the function, so we can see which points are important, where, in the curve.

    -

    The following graphs show the interpolation functions for quadratic and cubic curves, with "S" being the strength of a point's contribution to the total sum of the Bézier function. Click-and-drag to see the interpolation percentages for each curve-defining point at a specific t value.

    -
    - - - Скрипти вимкнено. показує резервний. - - - - - - - - Скрипти вимкнено. показує резервний. - - - - - - - - Скрипти вимкнено. показує резервний. - - - - - -
    +

    So, now we know what the basis function looks like, time to add in the magic that makes Bézier curves so special: control points.

    +
    +
    +

    + + Controlling Bézier curvatures +

    +

    + Bézier curves are, like all "splines", interpolation functions. This means that they take a set of points, and generate values somewhere + "between" those points. (One of the consequences of this is that you'll never be able to generate a point that lies outside the outline + for the control points, commonly called the "hull" for the curve. Useful information!). In fact, we can visualize how each point + contributes to the value generated by the function, so we can see which points are important, where, in the curve. +

    +

    + The following graphs show the interpolation functions for quadratic and cubic curves, with "S" being the strength of a point's + contribution to the total sum of the Bézier function. Click-and-drag to see the interpolation percentages for each curve-defining point at + a specific t value. +

    +
    + + + Скрипти вимкнено. показує резервний. + + + + + + + + Скрипти вимкнено. показує резервний. + + + + + + + + Скрипти вимкнено. показує резервний. + + + + + +
    -

    Also shown is the interpolation function for a 15th order Bézier function. As you can see, the start and end point contribute considerably more to the curve's shape than any other point in the control point set.

    -

    If we want to change the curve, we need to change the weights of each point, effectively changing the interpolations. The way to do this is about as straightforward as possible: just multiply each point with a value that changes its strength. These values are conventionally called "weights", and we can add them to our original Bézier function:

    - - -

    That looks complicated, but as it so happens, the "weights" are actually just the coordinate values we want our curve to have: for an nth order curve, w0 is our start coordinate, wn is our last coordinate, and everything in between is a controlling coordinate. Say we want a cubic curve that starts at (110,150), is controlled by (25,190) and (210,250) and ends at (210,30), we use this Bézier curve:

    - - -

    Which gives us the curve we saw at the top of the article:

    - - - Скрипти вимкнено. показує резервний. - - - -

    What else can we do with Bézier curves? Quite a lot, actually. The rest of this article covers a multitude of possible operations and algorithms that we can apply, and the tasks they achieve.

    -
    + +

    Which gives us the curve we saw at the top of the article:

    + + + Скрипти вимкнено. показує резервний. + + + +

    + What else can we do with Bézier curves? Quite a lot, actually. The rest of this article covers a multitude of possible operations and + algorithms that we can apply, and the tasks they achieve. +

    +
    +

    How to implement the weighted basis function

    +

    Given that we already know how to implement basis function, adding in the control points is remarkably easy:

    -

    How to implement the weighted basis function

    -

    Given that we already know how to implement basis function, adding in the control points is remarkably easy:

    - - - - - -
    1 - -
    2
    3
    4
    5
    - -

    And now for the optimized versions:

    + return sum + + + + 2 + + + 3 + + + 4 + + + 5 + + - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    - -

    And now we know how to program the weighted basis function.

    -
    + return w[0]*mt3 + 3*w[1]*mt2*t + 3*w[2]*mt*t2 + w[3]*t3 + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + -
    -
    -

    Controlling Bézier curvatures, part 2: Rational Béziers

    -

    We can further control Bézier curves by "rationalising" them: that is, adding a "ratio" value in addition to the weight value discussed in the previous section, thereby gaining control over "how strongly" each coordinate influences the curve.

    -

    Adding these ratio values to the regular Bézier curve function is fairly easy. Where the regular function is the following:

    - - -

    The function for rational Bézier curves has two more terms:

    - - -

    In this, the first new term represents an additional weight for each coordinate. For example, if our ratio values are [1, 0.5, 0.5, 1] then ratio0 = 1, ratio1 = 0.5, and so on, and is effectively identical as if we were just using different weight. So far, nothing too special.

    -

    However, the second new term is what makes the difference: every point on the curve isn't just a "double weighted" point, it is a fraction of the "doubly weighted" value we compute by introducing that ratio. When computing points on the curve, we compute the "normal" Bézier value and then divide that by the Bézier value for the curve that only uses ratios, not weights.

    -

    This does something unexpected: it turns our polynomial into something that isn't a polynomial anymore. It is now a kind of curve that is a super class of the polynomials, and can do some really cool things that Bézier curves can't do "on their own", such as perfectly describing circles (which we'll see in a later section is literally impossible using standard Bézier curves).

    -

    But the best way to show what this does is to do literally that: let's look at the effect of "rationalising" our Bézier curves using an interactive graphic for a rationalised curves. The following graphic shows the Bézier curve from the previous section, "enriched" with ratio factors for each coordinate. The closer to zero we set one or more terms, the less relative influence the associated coordinate exerts on the curve (and of course the higher we set them, the more influence they have). Try to change the values and see how it affects what gets drawn:

    - - - Скрипти вимкнено. показує резервний. - - - - - - - - -

    You can think of the ratio values as each coordinate's "gravity": the higher the gravity, the closer to that coordinate the curve will want to be. You'll also notice that if you simply increase or decrease all the ratios by the same amount, nothing changes... much like with gravity, if the relative strengths stay the same, nothing really changes. The values define each coordinate's influence relative to all other points.

    -
    + +

    + In this, the first new term represents an additional weight for each coordinate. For example, if our ratio values are [1, 0.5, 0.5, 1] + then ratio0 = 1, ratio1 = 0.5, and so on, and is effectively identical as if we were just + using different weight. So far, nothing too special. +

    +

    + However, the second new term is what makes the difference: every point on the curve isn't just a "double weighted" point, it is a + fraction of the "doubly weighted" value we compute by introducing that ratio. When computing points on the curve, we compute the + "normal" Bézier value and then divide that by the Bézier value for the curve that only uses ratios, not weights. +

    +

    + This does something unexpected: it turns our polynomial into something that isn't a polynomial anymore. It is now a kind of curve + that is a super class of the polynomials, and can do some really cool things that Bézier curves can't do "on their own", such as perfectly + describing circles (which we'll see in a later section is literally impossible using standard Bézier curves). +

    +

    + But the best way to show what this does is to do literally that: let's look at the effect of "rationalising" our Bézier curves using an + interactive graphic for a rationalised curves. The following graphic shows the Bézier curve from the previous section, "enriched" with + ratio factors for each coordinate. The closer to zero we set one or more terms, the less relative influence the associated coordinate + exerts on the curve (and of course the higher we set them, the more influence they have). Try to change the values and see how it affects + what gets drawn: +

    + + + Скрипти вимкнено. показує резервний. + + + + + + + + +

    + You can think of the ratio values as each coordinate's "gravity": the higher the gravity, the closer to that coordinate the curve will + want to be. You'll also notice that if you simply increase or decrease all the ratios by the same amount, nothing changes... much like + with gravity, if the relative strengths stay the same, nothing really changes. The values define each coordinate's influence + relative to all other points. +

    +
    +

    How to implement rational curves

    +

    Extending the code of the previous section to include ratios is almost trivial:

    -

    How to implement rational curves

    -

    Extending the code of the previous section to include ratios is almost trivial:

    - - - - - - - - - - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    20
    21
    22
    23
    24
    25
    26
    - -

    And that's all we have to do.

    -
    + return (f[0] * w[0] + f[1] * w[1] + f[2] * w[2] + f[3] * w[3])/basis + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + 18 + + + 19 + + + 20 + + + 21 + + + 22 + + + 23 + + + 24 + + + 25 + + + 26 + + -
    -
    -

    The Bézier interval [0,1]

    -

    Now that we know the mathematics behind Bézier curves, there's one curious thing that you may have noticed: they always run from t=0 to t=1. Why that particular interval?

    -

    It all has to do with how we run from "the start" of our curve to "the end" of our curve. If we have a value that is a mixture of two other values, then the general formula for this is:

    - - -

    The obvious start and end values here need to be a=1, b=0, so that the mixed value is 100% value 1, and 0% value 2, and a=0, b=1, so that the mixed value is 0% value 1 and 100% value 2. Additionally, we don't want "a" and "b" to be independent: if they are, then we could just pick whatever values we like, and end up with a mixed value that is, for example, 100% value 1 and 100% value 2. In principle that's fine, but for Bézier curves we always want mixed values between the start and end point, so we need to make sure we can never set "a" and "b" to some values that lead to a mix value that sums to more than 100%. And that's easy:

    - - -

    With this we can guarantee that we never sum above 100%. By restricting a to values in the interval [0,1], we will always be somewhere between our two values (inclusively), and we will always sum to a 100% mix.

    -

    But... what if we use this form, which is based on the assumption that we will only ever use values between 0 and 1, and instead use values outside of that interval? Do things go horribly wrong? Well... not really, but we get to "see more".

    -

    In the case of Bézier curves, extending the interval simply makes our curve "keep going". Bézier curves are simply segments of some polynomial curve, so if we pick a wider interval we simply get to see more of the curve. So what do they look like?

    -

    The following two graphics show you Bézier curves rendered "the usual way", as well as the curves they "lie on" if we were to extend the t values much further. As you can see, there's a lot more "shape" hidden in the rest of the curve, and we can model those parts by moving the curve points around.

    -
    - - - Скрипти вимкнено. показує резервний. - - - - - - Скрипти вимкнено. показує резервний. - - - -
    + +

    + With this we can guarantee that we never sum above 100%. By restricting a to values in the interval [0,1], we will always be + somewhere between our two values (inclusively), and we will always sum to a 100% mix. +

    +

    + But... what if we use this form, which is based on the assumption that we will only ever use values between 0 and 1, and instead use + values outside of that interval? Do things go horribly wrong? Well... not really, but we get to "see more". +

    +

    + In the case of Bézier curves, extending the interval simply makes our curve "keep going". Bézier curves are simply segments of some + polynomial curve, so if we pick a wider interval we simply get to see more of the curve. So what do they look like? +

    +

    + The following two graphics show you Bézier curves rendered "the usual way", as well as the curves they "lie on" if we were to extend the + t values much further. As you can see, there's a lot more "shape" hidden in the rest of the curve, and we can model those + parts by moving the curve points around. +

    +
    + + + Скрипти вимкнено. показує резервний. + + + + + + Скрипти вимкнено. показує резервний. + + + +
    -

    In fact, there are curves used in graphics design and computer modelling that do the opposite of Bézier curves; rather than fixing the interval, and giving you freedom to choose the coordinates, they fix the coordinates, but give you freedom over the interval. A great example of this is the "Spiro" curve, which is a curve based on part of a Cornu Spiral, also known as Euler's Spiral. It's a very aesthetically pleasing curve and you'll find it in quite a few graphics packages like FontForge and Inkscape. It has even been used in font design, for example for the Inconsolata typeface.

    - -
    -
    -

    Bézier curvatures as matrix operations

    -

    We can also represent Bézier curves as matrix operations, by expressing the Bézier formula as a polynomial basis function and a coefficients matrix, and the actual coordinates as a matrix. Let's look at what this means for the cubic curve, using P... to refer to coordinate values "in one or more dimensions":

    - - -

    Disregarding our actual coordinates for a moment, we have:

    - - -

    We can write this as a sum of four expressions:

    - - -

    And we can expand these expressions:

    - - -

    Furthermore, we can make all the 1 and 0 factors explicit:

    - - -

    And that, we can view as a series of four matrix operations:

    - - -

    If we compact this into a single matrix operation, we get:

    - - -

    This kind of polynomial basis representation is generally written with the bases in increasing order, which means we need to flip our t matrix horizontally, and our big "mixing" matrix upside down:

    - - -

    And then finally, we can add in our original coordinates as a single third matrix:

    - - -

    We can perform the same trick for the quadratic curve, in which case we end up with:

    - - -

    If we plug in a t value, and then multiply the matrices, we will get exactly the same values as when we evaluate the original polynomial function, or as when we evaluate the curve using progressive linear interpolation.

    -

    So: why would we bother with matrices? Matrix representations allow us to discover things about functions that would otherwise be hard to tell. It turns out that the curves form triangular matrices, and they have a determinant equal to the product of the actual coordinates we use for our curve. It's also invertible, which means there's a ton of properties that are all satisfied. Of course, the main question is "why is this useful to us now?", and the answer to that is that it's not immediately useful, but you'll be seeing some instances where certain curve properties can be either computed via function manipulation, or via clever use of matrices, and sometimes the matrix approach can be (drastically) faster.

    -

    So for now, just remember that we can represent curves this way, and let's move on.

    + +

    + If we plug in a t value, and then multiply the matrices, we will get exactly the same values as when we evaluate the original + polynomial function, or as when we evaluate the curve using progressive linear interpolation. +

    +

    + So: why would we bother with matrices? Matrix representations allow us to discover things about functions that would + otherwise be hard to tell. It turns out that the curves form + triangular matrices, and they have a determinant equal to the product of the + actual coordinates we use for our curve. It's also invertible, which means there's + a ton of properties that are all satisfied. Of + course, the main question is "why is this useful to us now?", and the answer to that is that it's not immediately useful, but + you'll be seeing some instances where certain curve properties can be either computed via function manipulation, or via clever use of + matrices, and sometimes the matrix approach can be (drastically) faster. +

    +

    So for now, just remember that we can represent curves this way, and let's move on.

    +
    +
    +

    + + de Casteljau's algorithm +

    +

    + If we want to draw Bézier curves, we can run through all values of t from 0 to 1 and then compute the weighted basis function + at each value, getting the x/y values we need to plot. Unfortunately, the more complex the curve gets, the more expensive + this computation becomes. Instead, we can use de Casteljau's algorithm to draw curves. This is a geometric approach to curve + drawing, and it's really easy to implement. So easy, in fact, you can do it by hand with a pencil and ruler. +

    +

    Rather than using our calculus function to find x/y values for t, let's do this instead:

    +
      +
    • treat t as a ratio (which it is). t=0 is 0% along a line, t=1 is 100% along a line.
    • +
    • Take all lines between the curve's defining points. For an order n curve, that's n lines.
    • +
    • + Place markers along each of these line, at distance t. So if t is 0.2, place the mark at 20% from the start, + 80% from the end. +
    • +
    • Now form lines between those points. This gives n-1 lines.
    • +
    • Place markers along each of these line at distance t.
    • +
    • Form lines between those points. This'll be n-2 lines.
    • +
    • Place markers, form lines, place markers, etc.
    • +
    • + Repeat this until you have only one line left. The point t on that line coincides with the original curve point at + t. +
    • +
    +

    + To see this in action, move the slider for the following sketch to changes which curve point is explicitly evaluated using de Casteljau's + algorithm. +

    + + + Скрипти вимкнено. показує резервний. + + + + + +
    +

    How to implement de Casteljau's algorithm

    +

    + Let's just use the algorithm we just specified, and implement that as a function that can take a list of curve-defining points, and a + t value, and draws the associated point on the curve for that t value: +

    -
    -
    -

    de Casteljau's algorithm

    -

    If we want to draw Bézier curves, we can run through all values of t from 0 to 1 and then compute the weighted basis function at each value, getting the x/y values we need to plot. Unfortunately, the more complex the curve gets, the more expensive this computation becomes. Instead, we can use de Casteljau's algorithm to draw curves. This is a geometric approach to curve drawing, and it's really easy to implement. So easy, in fact, you can do it by hand with a pencil and ruler.

    -

    Rather than using our calculus function to find x/y values for t, let's do this instead:

    -
      -
    • treat t as a ratio (which it is). t=0 is 0% along a line, t=1 is 100% along a line.
    • -
    • Take all lines between the curve's defining points. For an order n curve, that's n lines.
    • -
    • Place markers along each of these line, at distance t. So if t is 0.2, place the mark at 20% from the start, 80% from the end.
    • -
    • Now form lines between those points. This gives n-1 lines.
    • -
    • Place markers along each of these line at distance t.
    • -
    • Form lines between those points. This'll be n-2 lines.
    • -
    • Place markers, form lines, place markers, etc.
    • -
    • Repeat this until you have only one line left. The point t on that line coincides with the original curve point at t.
    • -
    -

    To see this in action, move the slider for the following sketch to changes which curve point is explicitly evaluated using de Casteljau's algorithm.

    - - - Скрипти вимкнено. показує резервний. - - - - - -
    - -

    How to implement de Casteljau's algorithm

    -

    Let's just use the algorithm we just specified, and implement that as a function that can take a list of curve-defining points, and a t value, and draws the associated point on the curve for that t value:

    - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    - -

    And done, that's the algorithm implemented. Although: usually you don't get the luxury of overloading the "+" operator, so let's also give the code for when you need to work with x and y values separately:

    + drawCurvePoint(newpoints, t) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    - -

    So what does this do? This draws a point, if the passed list of points is only 1 point long. Otherwise it will create a new list of points that sit at the t ratios (i.e. the "markers" outlined in the above algorithm), and then call the draw function for this new list.

    -
    + drawCurvePoint(newpoints, t) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + +

    + So what does this do? This draws a point, if the passed list of points is only 1 point long. Otherwise it will create a new list of + points that sit at the t ratios (i.e. the "markers" outlined in the above algorithm), and then call the draw function for this + new list. +

    + +
    +
    +

    + + Simplified drawing +

    +

    + We can also simplify the drawing process by "sampling" the curve at certain points, and then joining those points up with straight lines, + a process known as "flattening", as we are reducing a curve to a simple sequence of straight, "flat" lines. +

    +

    + We can do this is by saying "we want X segments", and then sampling the curve at intervals that are spaced such that we end up with the + number of segments we wanted. The advantage of this method is that it's fast: instead of evaluating 100 or even 1000 curve coordinates, we + can sample a much lower number and still end up with a curve that sort-of-kind-of looks good enough. The disadvantage of course is that we + lose the precision of working with "the real curve", so we usually can't use the flattened for doing true intersection detection, or + curvature alignment. +

    +
    + + + Скрипти вимкнено. показує резервний. + + + + + + + + Скрипти вимкнено. показує резервний. + + + + + +
    -
    -
    -

    Simplified drawing

    -

    We can also simplify the drawing process by "sampling" the curve at certain points, and then joining those points up with straight lines, a process known as "flattening", as we are reducing a curve to a simple sequence of straight, "flat" lines.

    -

    We can do this is by saying "we want X segments", and then sampling the curve at intervals that are spaced such that we end up with the number of segments we wanted. The advantage of this method is that it's fast: instead of evaluating 100 or even 1000 curve coordinates, we can sample a much lower number and still end up with a curve that sort-of-kind-of looks good enough. The disadvantage of course is that we lose the precision of working with "the real curve", so we usually can't use the flattened for doing true intersection detection, or curvature alignment.

    -
    - - - Скрипти вимкнено. показує резервний. - - - - - - - - Скрипти вимкнено. показує резервний. - - - - - -
    +

    + Try clicking on the sketch and using your up and down arrow keys to lower the number of segments for both the quadratic and cubic curve. + You'll notice that for certain curvatures, a low number of segments works quite well, but for more complex curvatures (try this for the + cubic curve), a higher number is required to capture the curvature changes properly. +

    +
    +

    How to implement curve flattening

    +

    Let's just use the algorithm we just specified, and implement that:

    -

    Try clicking on the sketch and using your up and down arrow keys to lower the number of segments for both the quadratic and cubic curve. You'll notice that for certain curvatures, a low number of segments works quite well, but for more complex curvatures (try this for the cubic curve), a higher number is required to capture the curvature changes properly.

    -
    - -

    How to implement curve flattening

    -

    Let's just use the algorithm we just specified, and implement that:

    - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    - -

    And done, that's the algorithm implemented. That just leaves drawing the resulting "curve" as a sequence of lines:

    + return coordinates; + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    - -

    We start with the first coordinate as reference point, and then just draw lines between each point and its next point.

    -
    + coord = _coord + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + -
    -
    -

    Splitting curves

    -

    Using de Casteljau's algorithm, we can also find all the points we need to split up a Bézier curve into two, smaller curves, which taken together form the original curve. When we construct de Casteljau's skeleton for some value t, the procedure gives us all the points we need to split a curve at that t value: one curve is defined by all the inside skeleton points found prior to our on-curve point, with the other curve being defined by all the inside skeleton points after our on-curve point.

    - - - Скрипти вимкнено. показує резервний. - - - - - -
    +

    We start with the first coordinate as reference point, and then just draw lines between each point and its next point.

    +
    +
    +
    +

    + + Splitting curves +

    +

    + Using de Casteljau's algorithm, we can also find all the points we need to split up a Bézier curve into two, smaller curves, which taken + together form the original curve. When we construct de Casteljau's skeleton for some value t, the procedure gives us all the + points we need to split a curve at that t value: one curve is defined by all the inside skeleton points found prior to our + on-curve point, with the other curve being defined by all the inside skeleton points after our on-curve point. +

    + + + Скрипти вимкнено. показує резервний. + + + + + +
    +

    implementing curve splitting

    +

    We can implement curve splitting by bolting some extra logging onto the de Casteljau function:

    -

    implementing curve splitting

    -

    We can implement curve splitting by bolting some extra logging onto the de Casteljau function:

    - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    - -

    After running this function for some value t, the left and right arrays will contain all the coordinates for two new curves - one to the "left" of our t value, the other on the "right". These new curves will have the same order as the original curve, and can be overlaid exactly on the original curve.

    -
    + drawCurvePoint(newpoints, t) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + -
    -
    -

    Splitting curves using matrices

    -

    Another way to split curves is to exploit the matrix representation of a Bézier curve. In the section on matrices, we saw that we can represent curves as matrix multiplications. Specifically, we saw these two forms for the quadratic and cubic curves respectively: (we'll reverse the Bézier coefficients vector for legibility)

    - - -

    and

    - - -

    Let's say we want to split the curve at some point t = z, forming two new (obviously smaller) Bézier curves. To find the coordinates for these two Bézier curves, we can use the matrix representation and some linear algebra. First, we separate out the actual "point on the curve" information into a new matrix multiplication:

    - - -

    and

    - - -

    If we could compact these matrices back to the form [t values] · [Bézier matrix] · [column matrix], with the first two staying the same, then that column matrix on the right would be the coordinates of a new Bézier curve that describes the first segment, from t = 0 to t = z. As it turns out, we can do this quite easily, by exploiting some simple rules of linear algebra (and if you don't care about the derivations, just skip to the end of the box for the results!).

    -
    - -

    Deriving new hull coordinates

    -

    Deriving the two segments upon splitting a curve takes a few steps, and the higher the curve order, the more work it is, so let's look at the quadratic curve first:

    - - - - - - - - -

    We can do this because [M · M-1] is the identity matrix. It's a bit like multiplying something by x/x in calculus: it doesn't do anything to the function, but it does allow you to rewrite it to something that may be easier to work with, or can be broken up differently. In the same way, multiplying our matrix by [M · M-1] has no effect on the total formula, but it does allow us to change the matrix sequence [something · M] to a sequence [M · something], and that makes a world of difference: if we know what [M-1 · Z · M] is, we can apply that to our coordinates, and be left with a proper matrix representation of a quadratic Bézier curve (which is [T · M · P]), with a new set of coordinates that represent the curve from t = 0 to t = z. So let's get computing:

    - - -

    Excellent! Now we can form our new quadratic curve:

    - - - - - - -

    Brilliant: if we want a subcurve from t = 0 to t = z, we can keep the first coordinate the same (which makes sense), our control point becomes a z-ratio mixture of the original control point and the start point, and the new end point is a mixture that looks oddly similar to a Bernstein polynomial of degree two. These new coordinates are actually really easy to compute directly!

    -

    Of course, that's only one of the two curves. Getting the section from t = z to t = 1 requires doing this again. We first observe that in the previous calculation, we actually evaluated the general interval [0,z]. We were able to write it down in a more simple form because of the zero, but what we actually evaluated, making the zero explicit, was:

    - - - - -

    If we want the interval [z,1], we will be evaluating this instead:

    - - - - -

    We're going to do the same trick of multiplying by the identity matrix, to turn [something · M] into [M · something]:

    - - -

    So, our final second curve looks like:

    - - - - - - -

    Nice. We see the same thing as before: we can keep the last coordinate the same (which makes sense); our control point becomes a z-ratio mixture of the original control point and the end point, and the new start point is a mixture that looks oddly similar to a bernstein polynomial of degree two, except this time it uses (z-1) rather than (1-z). These new coordinates are also really easy to compute directly!

    -
    + +

    + Nice. We see the same thing as before: we can keep the last coordinate the same (which makes sense); our control point becomes a z-ratio + mixture of the original control point and the end point, and the new start point is a mixture that looks oddly similar to a bernstein + polynomial of degree two, except this time it uses (z-1) rather than (1-z). These new coordinates are also really easy to + compute directly! +

    + -

    So, using linear algebra rather than de Casteljau's algorithm, we have determined that, for any quadratic curve split at some value t = z, we get two subcurves that are described as Bézier curves with simple-to-derive coordinates:

    - - -

    and

    - - -

    We can do the same for cubic curves. However, I'll spare you the actual derivation (don't let that stop you from writing that out yourself, though) and simply show you the resulting new coordinate sets:

    - - -

    and

    - - -

    So, looking at our matrices, did we really need to compute the second segment matrix? No, we didn't. Actually having one segment's matrix means we implicitly have the other: push the values of each row in the matrix Q to the right, with zeroes getting pushed off the right edge and appearing back on the left, and then flip the matrix vertically. Presto, you just "calculated" Q'.

    -

    Implementing curve splitting this way requires less recursion, and is just straight arithmetic with cached values, so can be cheaper on systems where recursion is expensive. If you're doing computation with devices that are good at matrix multiplication, chopping up a Bézier curve with this method will be a lot faster than applying de Casteljau.

    - -
    -
    -

    Lowering and elevating curve order

    -

    One interesting property of Bézier curves is that an nth order curve can always be perfectly represented by an (n+1)th order curve, by giving the higher-order curve specific control points.

    -

    If we have a curve with three points, then we can create a curve with four points that exactly reproduces the original curve. First, we give it the same start and end points, and for its two control points we pick "1/3rd start + 2/3rd control" and "2/3rd control + 1/3rd end". Now we have exactly the same curve as before, except represented as a cubic curve rather than a quadratic curve.

    -

    The general rule for raising an nth order curve to an (n+1)th order curve is as follows (observing that the start and end weights are the same as the start and end weights for the old curve):

    - - -

    However, this rule also has as direct consequence that you cannot generally safely lower a curve from nth order to (n-1)th order, because the control points cannot be "pulled apart" cleanly. We can try to, but the resulting curve will not be identical to the original, and may in fact look completely different.

    -

    However, there is a surprisingly good way to ensure that a lower order curve looks "as close as reasonably possible" to the original curve: we can optimise the "least-squares distance" between the original curve and the lower order curve, in a single operation (also explained over on Sirver's Castle). However, to use it, we'll need to do some calculus work and then switch over to linear algebra. As mentioned in the section on matrix representations, some things can be done much more easily with matrices than with calculus functions, and this is one of those things. So... let's go!

    -

    We start by taking the standard Bézier function, and condensing it a little:

    - - -

    Then, we apply one of those silly (actually, super useful) calculus tricks: since our t value is always between zero and one (inclusive), we know that (1-t) plus t always sums to 1. As such, we can express any value as a sum of t and 1-t:

    - - -

    So, with that seemingly trivial observation, we rewrite that Bézier function by splitting it up into a sum of a (1-t) and t component:

    - - -

    So far so good. Now, to see why we did this, let's write out the (1-t) and t parts, and see what that gives us. I promise, it's about to make sense. We start with (1-t):

    - - -

    So by using this seemingly silly trick, we can suddenly express part of our nth order Bézier function in terms of an (n+1)th order Bézier function. And that sounds a lot like raising the curve order! Of course we need to be able to repeat that trick for the t part, but that's not a problem:

    - - -

    So, with both of those changed from an order n expression to an order (n+1) expression, we can put them back together again. Now, where the order n function had a summation from 0 to n, the order n+1 function uses a summation from 0 to n+1, but this shouldn't be a problem as long as we can add some new terms that "contribute nothing". In the next section on derivatives, there is a discussion about why "higher terms than there is a binomial for" and "lower than zero terms" both "contribute nothing". So as long as we can add terms that have the same form as the terms we need, we can just include them in the summation, they'll sit there and do nothing, and the resulting function stays identical to the lower order curve.

    -

    Let's do this:

    - - -

    And this is where we switch over from calculus to linear algebra, and matrices: we can now express this relation between Bézier(n,t) and Bézier(n+1,t) as a very simple matrix multiplication:

    - - -

    where the matrix M is an n+1 by n matrix, and looks like:

    - - -

    That might look unwieldy, but it's really just a mostly-zeroes matrix, with a very simply fraction on the diagonal, and an even simpler fraction to the left of it. Multiplying a list of coordinates with this matrix means we can plug the resulting transformed coordinates into the one-order-higher function and get an identical looking curve.

    -

    Not too bad!

    -

    Equally interesting, though, is that with this matrix operation established, we can now use an incredibly powerful and ridiculously simple way to find out a "best fit" way to reverse the operation, called the normal equation. What it does is minimize the sum of the square differences between one set of values and another set of values. Specifically, if we can express that as some function A x = b, we can use it. And as it so happens, that's exactly what we're dealing with, so:

    - - -

    The steps taken here are:

    -
      -
    1. We have a function in a form that the normal equation can be used with, so
    2. -
    3. apply the normal equation!
    4. -
    5. Then, we want to end up with just Bn on the left, so we start by left-multiply both sides such that we'll end up with lots of stuff on the left that simplified to "a factor 1", which in matrix maths is the identity matrix.
    6. -
    7. In fact, by left-multiplying with the inverse of what was already there, we've effectively "nullified" (but really, one-inified) that big, unwieldy block into the identity matrix I. So we substitute the mess with I, and then
    8. -
    9. because multiplication with the identity matrix does nothing (like multiplying by 1 does nothing in regular algebra), we just drop it.
    10. -
    -

    And we're done: we now have an expression that lets us approximate an n+1th order curve with a lower nth order curve. It won't be an exact fit, but it's definitely a best approximation. So, let's implement these rules for raising and lowering curve order to a (semi) random curve, using the following graphic. Select the sketch, which has movable control points, and press your up and down arrow keys to raise or lower the curve order.

    - - - Скрипти вимкнено. показує резервний. - - - - - - - -
    -
    -

    Derivatives

    -

    There's a number of useful things that you can do with Bézier curves based on their derivative, and one of the more amusing observations about Bézier curves is that their derivatives are, in fact, also Bézier curves. In fact, the differentiation of a Bézier curve is relatively straightforward, although we do need a bit of math.

    -

    First, let's look at the derivative rule for Bézier curves, which is:

    - - -

    which we can also write (observing that b in this formula is the same as our w weights, and that n times a summation is the same as a summation where each term is multiplied by n) as:

    - - -

    Or, in plain text: the derivative of an nth degree Bézier curve is an (n-1)th degree Bézier curve, with one fewer term, and new weights w'0...w'n-1 derived from the original weights as n(wi+1 - wi). So for a 3rd degree curve, with four weights, the derivative has three new weights: w'0 = 3(w1-w0), w'1 = 3(w2-w1) and w'2 = 3(w3-w2).

    -
    - -

    "Slow down, why is that true?"

    -

    Sometimes just being told "this is the derivative" is nice, but you might want to see why this is indeed the case. As such, let's have a look at the proof for this derivative. First off, the weights are independent of the full Bézier function, so the derivative involves only the derivative of the polynomial basis function. So, let's find that:

    - - -

    Applying the product and chain rules gives us:

    - - -

    Which is hard to work with, so let's expand that properly:

    - - -

    Now, the trick is to turn this expression into something that has binomial coefficients again, so we want to end up with things that look like "x! over y!(x-y)!". If we can do that in a way that involves terms of n-1 and k-1, we'll be on the right track.

    - - -

    And that's the first part done: the two components inside the parentheses are actually regular, lower-order Bézier expressions:

    - - -

    Now to apply this to our weighted Bézier curves. We'll write out the plain curve formula that we saw earlier, and then work our way through to its derivative:

    - - -

    If we expand this (with some color to show how terms line up), and reorder the terms by increasing values for k we see the following:

    - - -

    Two of these terms fall way: the first term falls away because there is no -1st term in a summation. As such, it always contributes "nothing", so we can safely completely ignore it for the purpose of finding the derivative function. The other term is the very last term in this expansion: one involving Bn-1,n. This term would have a binomial coefficient of [i choose i+1], which is a non-existent binomial coefficient. Again, this term would contribute "nothing", so we can ignore it, too. This means we're left with:

    - - -

    And that's just a summation of lower order curves:

    - - -

    We can rewrite this as a normal summation, and we're done:

    - - -
    + + -

    Let's rewrite that in a form similar to our original formula, so we can see the difference. We will first list our original formula for Bézier curves, and then the derivative:

    - - - - -

    What are the differences? In terms of the actual Bézier curve, virtually nothing! We lowered the order (rather than n, it's now n-1), but it's still the same Bézier function. The only real difference is in how the weights change when we derive the curve's function. If we have four points A, B, C, and D, then the derivative will have three points, the second derivative two, and the third derivative one:

    - - -

    We can keep performing this trick for as long as we have more than one weight. Once we have one weight left, the next step will see k = 0, and the result of our "Bézier function" summation is zero, because we're not adding anything at all. As such, a quadratic curve has no second derivative, a cubic curve has no third derivative, and generalized: an nth order curve has n-1 (meaningful) derivatives, with any further derivative being zero.

    - -
    -
    -

    Tangents and normals

    -

    If you want to move objects along a curve, or "away from" a curve, the two vectors you're most interested in are the tangent vector and normal vector for curve points. These are actually really easy to find. For moving and orienting along a curve, we use the tangent, which indicates the direction of travel at specific points, and is literally just the first derivative of our curve:

    - - -

    This gives us the directional vector we want. We can normalize it to give us uniform directional vectors (having a length of 1.0) at each point, and then do whatever it is we want to do based on those directions:

    - - -

    The tangent is very useful for moving along a line, but what if we want to move away from the curve instead, perpendicular to the curve at some point t? In that case we want the normal vector. This vector runs at a right angle to the direction of the curve, and is typically of length 1.0, so all we have to do is rotate the normalized directional vector and we're done:

    - - -
    - -

    Rotating coordinates is actually very easy, if you know the rule for it. You might find it explained as "applying a rotation matrix, which is what we'll look at here, too. Essentially, the idea is to take the circles over which we can rotate, and simply "sliding the coordinates" over these circles by the desired -angle. If we want a quarter circle turn, we take the coordinate, slide it along the circle by a quarter turn, and done.

    -

    To turn any point (x,y) into a rotated point (x',y') (over 0,0) by some angle φ, we apply this nice and easy computation:

    - - -

    Which is the "long" version of the following matrix transformation:

    - - -

    And that's all we need to rotate any coordinate. Note that for quarter, half, and three-quarter turns these functions become even easier, since sin and cos for these angles are, respectively: 0 and 1, -1 and 0, and 0 and -1.

    -

    But why does this work? Why this matrix multiplication? Wikipedia (technically, Thomas Herter and Klaus Lott) tells us that a rotation matrix can be -treated as a sequence of three (elementary) shear operations. When we combine this into a single matrix operation (because all matrix multiplications can be collapsed), we get the matrix that you see above. DataGenetics have an excellent article about this very thing: it's really quite cool, and I strongly recommend taking a quick break from this primer to read that article.

    -
    + +

    + And that's all we need to rotate any coordinate. Note that for quarter, half, and three-quarter turns these functions become even + easier, since sin and cos for these angles are, respectively: 0 and 1, -1 and 0, and 0 and -1. +

    +

    + But why does this work? Why this matrix multiplication? + Wikipedia (technically, Thomas Herter and Klaus + Lott) tells us that a rotation matrix can be treated as a sequence of three (elementary) shear operations. When we combine this into a + single matrix operation (because all matrix multiplications can be collapsed), we get the matrix that you see above. + DataGenetics have an excellent article about this very thing: it's + really quite cool, and I strongly recommend taking a quick break from this primer to read that article. +

    + -

    The following two graphics show the tangent and normal along a quadratic and cubic curve, with the direction vector coloured blue, and the normal vector coloured red (the markers are spaced out evenly as t-intervals, not spaced equidistant).

    -
    - - - Скрипти вимкнено. показує резервний. - - - - - - Скрипти вимкнено. показує резервний. - - - -
    +

    + The following two graphics show the tangent and normal along a quadratic and cubic curve, with the direction vector coloured blue, and the + normal vector coloured red (the markers are spaced out evenly as t-intervals, not spaced equidistant). +

    +
    + + + Скрипти вимкнено. показує резервний. + + + + + + Скрипти вимкнено. показує резервний. + + + +
    +
    +
    +

    + + Working with 3D normals +

    +

    + Before we move on to the next section we need to spend a little bit of time on the difference between 2D and 3D. While for many things + this difference is irrelevant and the procedures are identical (for instance, getting the 3D tangent is just doing what we do for 2D, but + for x, y, and z, instead of just for x and y), when it comes to normals things are a little more complex, and thus more work. Mind you, + it's not "super hard", but there are more steps involved and we should have a look at those. +

    +

    + Getting normals in 3D is in principle the same as in 2D: we take the normalised tangent vector, and then rotate it by a quarter turn. + However, this is where things get that little more complex: we can turn in quite a few directions, since "the normal" in 3D is a plane, + not a single vector, so we basically need to define what "the" normal is in the 3D case. +

    +

    + The "naïve" approach is to construct what is known as the + Frenet normal, where we follow a simple recipe that works in + many cases (but does super bizarre things in some others). The idea is that even though there are infinitely many vectors that are + perpendicular to the tangent (i.e. make a 90 degree angle with it), the tangent itself sort of lies on its own plane already: since each + point on the curve (no matter how closely spaced) has its own tangent vector, we can say that each point lies in the same plane as the + local tangent, as well as the tangents "right next to it". +

    +

    + Even if that difference in tangent vectors is minute, "any difference" is all we need to find out what that plane is - or rather, what the + vector perpendicular to that plane is. Which is what we need: if we can calculate that vector, and we have the tangent vector that we know + lies on a plane, then we can rotate the tangent vector over the perpendicular, and presto. We have computed the normal using the same + logic we used for the 2D case: "just rotate it 90 degrees". +

    +

    So let's do that! And in a twist surprise, we can do this in four lines:

    +
      +
    • a = normalize(B'(t))
    • +
    • b = normalize(a + B''(t))
    • +
    • r = normalize(b × a)
    • +
    • normal = normalize(r × a)
    • +
    +

    Let's unpack that a little:

    +
      +
    • + We start by taking the normalized vector for the derivative at some point on the + curve. We normalize it so the maths is less work. Less work is good. +
    • +
    • + Then, we compute b which represents what a next point's tangent would be if the curve stopped changing at our point and + just had the same derivative and second derivative from that point on. +
    • +
    • + This lets us find two vectors (the derivative, and the second derivative added to the derivative) that lie on the same plane, which + means we can use them to compute a vector perpendicular to that plane, using an elementary vector operation called the + cross product. (Note that while that operation uses the × operator, it's most + definitely not a multiplication!) The result of that gives us a vector that we can use as the "axis of rotation" for turning the tangent + a quarter circle to get our normal, just like we did in the 2D case. +
    • +
    • + Since the cross product lets us find a vector that is perpendicular to some plane defined by two other vectors, and since the normal + vector should be perpendicular to the plane that the tangent and the axis of rotation lie in, we can use the cross product a second + time, and immediately get our normal vector. +
    • +
    +

    + And then we're done, we found "the" normal vector for a 3D curve. Let's see what that looks like for a sample curve, shall we? You can + move your cursor across the graphic from left to right, to show the normal at a point with a t value that is based on your cursor + position: all the way on the left is 0, all the way on the right = 1, midway is t=0.5, etc: +

    + + + Скрипти вимкнено. показує резервний. + + + + + +

    + However, if you've played with that graphic a bit, you might have noticed something odd. The normal seems to "suddenly twist around the + curve" between t=0.65 and t=0.75... Why is it doing that? +

    +

    + As it turns out, it's doing that because that's how the maths works, and that's the problem with Frenet normals: while they are + "mathematically correct", they are "practically problematic", and so for any kind of graphics work what we really want is a way to compute + normals that just... look good. +

    +

    Thankfully, Frenet normals are not our only option.

    +

    + Another option is to take a slightly more algorithmic approach and compute a form of + Rotation Minimising Frame + (also known as "parallel transport frame" or "Bishop frame") instead, where a "frame" is a set made up of the tangent, the rotational + axis, and the normal vector, centered on an on-curve point. +

    +

    + These type of frames are computed based on "the previous frame", so we cannot simply compute these "on demand" for single points, as we + could for Frenet frames; we have to compute them for the entire curve. Thankfully, the procedure is pretty simple, and can be performed at + the same time that you're building lookup tables for your curve. +

    +

    + The idea is to take a starting "tangent/rotation axis/normal" frame at t=0, and then compute what the next frame "should" look like by + applying some rules that yield a good looking next frame. In the case of the RMF paper linked above, those rules are: +

    +
      +
    • Take a point on the curve for which we know the RM frame already,
    • +
    • take a next point on the curve for which we don't know the RM frame yet, and
    • +
    • + reflect the known frame onto the next point, by treating the plane through the curve at the point exactly between the next and previous + points as a "mirror". +
    • +
    • + This gives the next point a tangent vector that's essentially pointing in the opposite direction of what it should be, and a normal + that's slightly off-kilter, so: +
    • +
    • + reflect the vectors of our "mirrored frame" a second time, but this time using the plane through the "next point" itself as "mirror". +
    • +
    • Done: the tangent and normal have been fixed, and we have a good looking frame to work with.
    • +
    +

    So, let's write some code for that!

    +
    +

    Implementing Rotation Minimising Frames

    +

    + We first assume we have a function for calculating the Frenet frame at a point, which we already discussed above, inn a way that it + yields a frame with properties: +

    -
    -
    -

    Working with 3D normals

    -

    Before we move on to the next section we need to spend a little bit of time on the difference between 2D and 3D. While for many things this difference is irrelevant and the procedures are identical (for instance, getting the 3D tangent is just doing what we do for 2D, but for x, y, and z, instead of just for x and y), when it comes to normals things are a little more complex, and thus more work. Mind you, it's not "super hard", but there are more steps involved and we should have a look at those.

    -

    Getting normals in 3D is in principle the same as in 2D: we take the normalised tangent vector, and then rotate it by a quarter turn. However, this is where things get that little more complex: we can turn in quite a few directions, since "the normal" in 3D is a plane, not a single vector, so we basically need to define what "the" normal is in the 3D case.

    -

    The "naïve" approach is to construct what is known as the Frenet normal, where we follow a simple recipe that works in many cases (but does super bizarre things in some others). The idea is that even though there are infinitely many vectors that are perpendicular to the tangent (i.e. make a 90 degree angle with it), the tangent itself sort of lies on its own plane already: since each point on the curve (no matter how closely spaced) has its own tangent vector, we can say that each point lies in the same plane as the local tangent, as well as the tangents "right next to it".

    -

    Even if that difference in tangent vectors is minute, "any difference" is all we need to find out what that plane is - or rather, what the vector perpendicular to that plane is. Which is what we need: if we can calculate that vector, and we have the tangent vector that we know lies on a plane, then we can rotate the tangent vector over the perpendicular, and presto. We have computed the normal using the same logic we used for the 2D case: "just rotate it 90 degrees".

    -

    So let's do that! And in a twist surprise, we can do this in four lines:

    -
      -
    • a = normalize(B'(t))
    • -
    • b = normalize(a + B''(t))
    • -
    • r = normalize(b × a)
    • -
    • normal = normalize(r × a)
    • -
    -

    Let's unpack that a little:

    -
      -
    • We start by taking the normalized vector for the derivative at some point on the curve. We normalize it so the maths is less work. Less work is good.
    • -
    • Then, we compute b which represents what a next point's tangent would be if the curve stopped changing at our point and just had the same derivative and second derivative from that point on.
    • -
    • This lets us find two vectors (the derivative, and the second derivative added to the derivative) that lie on the same plane, which means we can use them to compute a vector perpendicular to that plane, using an elementary vector operation called the cross product. (Note that while that operation uses the × operator, it's most definitely not a multiplication!) The result of that gives us a vector that we can use as the "axis of rotation" for turning the tangent a quarter circle to get our normal, just like we did in the 2D case.
    • -
    • Since the cross product lets us find a vector that is perpendicular to some plane defined by two other vectors, and since the normal vector should be perpendicular to the plane that the tangent and the axis of rotation lie in, we can use the cross product a second time, and immediately get our normal vector.
    • -
    -

    And then we're done, we found "the" normal vector for a 3D curve. Let's see what that looks like for a sample curve, shall we? You can move your cursor across the graphic from left to right, to show the normal at a point with a t value that is based on your cursor position: all the way on the left is 0, all the way on the right = 1, midway is t=0.5, etc:

    - - - Скрипти вимкнено. показує резервний. - - - - - -

    However, if you've played with that graphic a bit, you might have noticed something odd. The normal seems to "suddenly twist around the curve" between t=0.65 and t=0.75... Why is it doing that?

    -

    As it turns out, it's doing that because that's how the maths works, and that's the problem with Frenet normals: while they are "mathematically correct", they are "practically problematic", and so for any kind of graphics work what we really want is a way to compute normals that just... look good.

    -

    Thankfully, Frenet normals are not our only option.

    -

    Another option is to take a slightly more algorithmic approach and compute a form of Rotation Minimising Frame (also known as "parallel transport frame" or "Bishop frame") instead, where a "frame" is a set made up of the tangent, the rotational axis, and the normal vector, centered on an on-curve point.

    -

    These type of frames are computed based on "the previous frame", so we cannot simply compute these "on demand" for single points, as we could for Frenet frames; we have to compute them for the entire curve. Thankfully, the procedure is pretty simple, and can be performed at the same time that you're building lookup tables for your curve.

    -

    The idea is to take a starting "tangent/rotation axis/normal" frame at t=0, and then compute what the next frame "should" look like by applying some rules that yield a good looking next frame. In the case of the RMF paper linked above, those rules are:

    -
      -
    • Take a point on the curve for which we know the RM frame already,
    • -
    • take a next point on the curve for which we don't know the RM frame yet, and
    • -
    • reflect the known frame onto the next point, by treating the plane through the curve at the point exactly between the next and previous points as a "mirror".
    • -
    • This gives the next point a tangent vector that's essentially pointing in the opposite direction of what it should be, and a normal that's slightly off-kilter, so:
    • -
    • reflect the vectors of our "mirrored frame" a second time, but this time using the plane through the "next point" itself as "mirror".
    • -
    • Done: the tangent and normal have been fixed, and we have a good looking frame to work with.
    • -
    -

    So, let's write some code for that!

    -
    - -

    Implementing Rotation Minimising Frames

    -

    We first assume we have a function for calculating the Frenet frame at a point, which we already discussed above, inn a way that it yields a frame with properties:

    - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    - -

    Then, we can write a function that generates a sequence of RM frames in the following manner:

    +} + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    20
    21
    22
    23
    24
    25
    26
    27
    28
    29
    30
    31
    32
    33
    34
    35
    36
    - -

    Ignoring comments, this is certainly more code than when we were just computing a single Frenet frame, but it's not a crazy amount more code to get much better looking normals.

    -
    + frames.add(x1) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + 18 + + + 19 + + + 20 + + + 21 + + + 22 + + + 23 + + + 24 + + + 25 + + + 26 + + + 27 + + + 28 + + + 29 + + + 30 + + + 31 + + + 32 + + + 33 + + + 34 + + + 35 + + + 36 + + -

    Speaking of better looking, what does this actually look like? Let's revisit that earlier curve, but this time use rotation minimising frames rather than Frenet frames:

    - - - Скрипти вимкнено. показує резервний. - - - - - -

    That looks so much better!

    -

    For those reading along with the code: we don't even strictly speaking need a Frenet frame to start with: we could, for instance, treat the z-axis as our initial axis of rotation, so that our initial normal is (0,0,1) × tangent, and then take things from there, but having that initial "mathematically correct" frame so that the initial normal seems to line up based on the curve's orientation in 3D space is just nice.

    +

    + Ignoring comments, this is certainly more code than when we were just computing a single Frenet frame, but it's not a crazy amount more + code to get much better looking normals. +

    + -
    -
    -

    Component functions

    -

    One of the first things people run into when they start using Bézier curves in their own programs is "I know how to draw the curve, but how do I determine the bounding box?". It's actually reasonably straightforward to do so, but it requires having some knowledge on exploiting math to get the values we need. For bounding boxes, we aren't actually interested in the curve itself, but only in its "extremities": the minimum and maximum values the curve has for its x- and y-axis values. If you remember your calculus (provided you ever took calculus, otherwise it's going to be hard to remember) we can determine function extremities using the first derivative of that function, but this poses a problem, since our function is parametric: every axis has its own function.

    -

    The solution: compute the derivative for each axis separately, and then fit them back together in the same way we do for the original.

    -

    Let's look at how a parametric Bézier curve "splits up" into two normal functions, one for the x-axis and one for the y-axis. Note the leftmost figure is again an interactive curve, without labeled axes (you get coordinates in the graph instead). The center and rightmost figures are the component functions for computing the x-axis value, given a value for t (between 0 and 1 inclusive), and the y-axis value, respectively.

    -

    If you move points in a curve sideways, you should only see the middle graph change; likewise, moving points vertically should only show a change in the right graph.

    - - - Скрипти вимкнено. показує резервний. - - - -

     

    - - - Скрипти вимкнено. показує резервний. - - - - -
    -
    -

    Finding extremities: root finding

    -

    Now that we understand (well, superficially anyway) the component functions, we can find the extremities of our Bézier curve by finding maxima and minima on the component functions, by solving the equation B'(t) = 0. We've already seen that the derivative of a Bézier curve is a simpler Bézier curve, but how do we solve the equality? Fairly easily, actually, until our derivatives are 4th order or higher... then things get really hard. But let's start simple:

    -

    Quadratic curves: linear derivatives.

    -

    The derivative of a quadratic Bézier curve is a linear Bézier curve, interpolating between just two terms, which means finding the solution for "where is this line 0" is effectively trivial by rewriting it to a function of t and solving. First we turn our quadratic Bézier function into a linear one, by following the rule mentioned at the end of the derivatives section:

    - - -

    And then we turn this into our solution for t using basic arithmetics:

    - - -

    Done.

    -

    Although with the caveat that if b-a is zero, there is no solution and we probably shouldn't try to perform that division.

    -

    Cubic curves: the quadratic formula.

    -

    The derivative of a cubic Bézier curve is a quadratic Bézier curve, and finding the roots for a quadratic polynomial means we can apply the Quadratic formula. If you've seen it before, you'll remember it, and if you haven't, it looks like this:

    - - -

    So, if we can rewrite the Bézier component function as a plain polynomial, we're done: we just plug in the values into the quadratic formula, check if that square root is negative or not (if it is, there are no roots) and then just compute the two values that come out (because of that plus/minus sign we get two). Any value between 0 and 1 is a root that matters for Bézier curves, anything below or above that is irrelevant (because Bézier curves are only defined over the interval [0,1]). So, how do we convert?

    -

    First we turn our cubic Bézier function into a quadratic one, by following the rule mentioned at the end of the derivatives section:

    - - -

    And then, using these v values, we can find out what our a, b, and c should be:

    - - -

    This gives us three coefficients {a, b, c} that are expressed in terms of v values, where the v values are expressions of our original coordinate values, so we can do some substitution to get:

    - - -

    Easy-peasy. We can now almost trivially find the roots by plugging those values into the quadratic formula.

    -

    And as a cubic curve, there is also a meaningful second derivative, which we can compute by simple taking the derivative of the derivative.

    -

    Quartic curves: Cardano's algorithm.

    -

    We haven't really looked at them before now, but the next step up would be a Quartic curve, a fourth degree Bézier curve. As expected, these have a derivative that is a cubic function, and now things get much harder. Cubic functions don't have a "simple" rule to find their roots, like the quadratic formula, and instead require quite a bit of rewriting to a form that we can even start to try to solve.

    -

    Back in the 16th century, before Bézier curves were a thing, and even before calculus itself was a thing, Gerolamo Cardano figured out that even if the general cubic function is really hard to solve, it can be rewritten to a form for which finding the roots is "easier" (even if not "easy"):

    - - -

    We can see that the easier formula only has two constants, rather than four, and only two expressions involving t, rather than three: this makes things considerably easier to solve because it lets us use regular calculus to find the values that satisfy the equation.

    -

    Now, there is one small hitch: as a cubic function, the solutions may be complex numbers rather than plain numbers... And Cardano realised this, centuries before complex numbers were a well-understood and established part of number theory. His interpretation of them was "these numbers are impossible but that's okay because they disappear again in later steps", allowing him to not think about them too much, but we have it even easier: as we're trying to find the roots for display purposes, we don't even care about complex numbers: we're going to simplify Cardano's approach just that tiny bit further by throwing away any solution that's not a plain number.

    -

    So, how do we rewrite the hard formula into the easier formula? This is explained in detail over at Ken J. Ward's page for solving the cubic equation, so instead of showing the maths, I'm simply going to show the programming code for solving the cubic equation, with the complex roots getting totally ignored, but if you're interested you should definitely head over to Ken's page and give the procedure a read-through.

    -
    + +

    + We can see that the easier formula only has two constants, rather than four, and only two expressions involving t, rather + than three: this makes things considerably easier to solve because it lets us use + regular calculus to find the values that satisfy the equation. +

    +

    + Now, there is one small hitch: as a cubic function, the solutions may be + complex numbers rather than plain numbers... And Cardano realised this, + centuries before complex numbers were a well-understood and established part of number theory. His interpretation of them was "these + numbers are impossible but that's okay because they disappear again in later steps", allowing him to not think about them too much, but we + have it even easier: as we're trying to find the roots for display purposes, we don't even care about complex numbers: we're + going to simplify Cardano's approach just that tiny bit further by throwing away any solution that's not a plain number. +

    +

    + So, how do we rewrite the hard formula into the easier formula? This is explained in detail over at + Ken J. Ward's page for solving the + cubic equation, so instead of showing the maths, I'm simply going to show the programming code for solving the cubic equation, with the + complex roots getting totally ignored, but if you're interested you should definitely head over to Ken's page and give the procedure a + read-through. +

    +
    +

    Implementing Cardano's algorithm for finding all real roots

    +

    + The "real roots" part is fairly important, because while you cannot take a square, cube, etc. root of a negative number in the "real" + number space (denoted with ℝ), this is perfectly fine in the + "complex" number space (denoted with ℂ). And, as it so happens, Cardano is + also attributed as the first mathematician in history to have made use of complex numbers in his calculations. For this very algorithm! +

    -

    Implementing Cardano's algorithm for finding all real roots

    -

    The "real roots" part is fairly important, because while you cannot take a square, cube, etc. root of a negative number in the "real" number space (denoted with ℝ), this is perfectly fine in the "complex" number space (denoted with ℂ). And, as it so happens, Cardano is also attributed as the first mathematician in history to have made use of complex numbers in his calculations. For this very algorithm!

    - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    20
    21
    22
    23
    24
    25
    26
    27
    28
    29
    30
    31
    32
    33
    34
    35
    36
    37
    38
    39
    40
    41
    42
    43
    44
    45
    46
    47
    48
    49
    50
    51
    52
    53
    54
    55
    56
    57
    58
    59
    60
    61
    62
    63
    64
    65
    66
    67
    68
    69
    70
    71
    72
    73
    74
    75
    76
    77
    78
    79
    80
    81
    - -
    +} + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + 18 + + + 19 + + + 20 + + + 21 + + + 22 + + + 23 + + + 24 + + + 25 + + + 26 + + + 27 + + + 28 + + + 29 + + + 30 + + + 31 + + + 32 + + + 33 + + + 34 + + + 35 + + + 36 + + + 37 + + + 38 + + + 39 + + + 40 + + + 41 + + + 42 + + + 43 + + + 44 + + + 45 + + + 46 + + + 47 + + + 48 + + + 49 + + + 50 + + + 51 + + + 52 + + + 53 + + + 54 + + + 55 + + + 56 + + + 57 + + + 58 + + + 59 + + + 60 + + + 61 + + + 62 + + + 63 + + + 64 + + + 65 + + + 66 + + + 67 + + + 68 + + + 69 + + + 70 + + + 71 + + + 72 + + + 73 + + + 74 + + + 75 + + + 76 + + + 77 + + + 78 + + + 79 + + + 80 + + + 81 + + +
    -

    And that's it. The maths is complicated, but the code is pretty much just "follow the maths, while caching as many values as we can to prevent recomputing things as much as possible" and now we have a way to find all roots for a cubic function and can just move on with using that to find extremities of our curves.

    -

    And of course, as a quartic curve also has meaningful second and third derivatives, we can quite easily compute those by using the derivative of the derivative (of the derivative), just as for cubic curves.

    -

    Quintic and higher order curves: finding numerical solutions

    -

    And this is where thing stop, because we cannot find the roots for polynomials of degree 5 or higher using algebra (a fact known as the Abel–Ruffini theorem). Instead, for occasions like these, where algebra simply cannot yield an answer, we turn to numerical analysis.

    -

    That's a fancy term for saying "rather than trying to find exact answers by manipulating symbols, find approximate answers by describing the underlying process as a combination of steps, each of which can be assigned a number via symbolic manipulation". For example, trying to mathematically compute how much water fits in a completely crazy three dimensional shape is very hard, even if it got you the perfect, precise answer. A much easier approach, which would be less perfect but still entirely useful, would be to just grab a buck and start filling the shape until it was full: just count the number of buckets of water you used. And if we want a more precise answer, we can use smaller buckets.

    -

    So that's what we're going to do here, too: we're going to treat the problem as a sequence of steps, and the smaller we can make each step, the closer we'll get to that "perfect, precise" answer. And as it turns out, there is a really nice numerical root-finding algorithm, called the Newton-Raphson root finding method (yes, after that Newton), which we can make use of. The Newton-Raphson approach consists of taking our impossible-to-solve function f(x), picking some initial value x (literally any value will do), and calculating f(x). We can think of that value as the "height" of the function at x. If that height is zero, we're done, we have found a root. If it isn't, we calculate the tangent line at f(x) and calculate at which x value its height is zero (which we've already seen is very easy). That will give us a new x and we repeat the process until we find a root.

    -

    Mathematically, this means that for some x, at step n=1, we perform the following calculation until fy(x) is zero, so that the next t is the same as the one we already have:

    - - -

    (The Wikipedia article has a decent animation for this process, so I will not add a graphic for that here)

    -

    Now, this works well only if we can pick good starting points, and our curve is continuously differentiable and doesn't have oscillations. Glossing over the exact meaning of those terms, the curves we're dealing with conform to those constraints, so as long as we pick good starting points, this will work. So the question is: which starting points do we pick?

    -

    As it turns out, Newton-Raphson is so blindingly fast that we could get away with just not picking: we simply run the algorithm from t=0 to t=1 at small steps (say, 1/200th) and the result will be all the roots we want. Of course, this may pose problems for high order Bézier curves: 200 steps for a 200th order Bézier curve is going to go wrong, but that's okay: there is no reason (at least, none that I know of) to ever use Bézier curves of crazy high orders. You might use a fifth order curve to get the "nicest still remotely workable" approximation of a full circle with a single Bézier curve, but that's pretty much as high as you'll ever need to go.

    -

    In conclusion:

    -

    So now that we know how to do root finding, we can determine the first and second derivative roots for our Bézier curves, and show those roots overlaid on the previous graphics. For the quadratic curve, that means just the first derivative, in red:

    - - - Скрипти вимкнено. показує резервний. - - - -

    And for cubic curves, that means first and second derivatives, in red and purple respectively:

    - - - Скрипти вимкнено. показує резервний. - - - + +

    (The Wikipedia article has a decent animation for this process, so I will not add a graphic for that here)

    +

    + Now, this works well only if we can pick good starting points, and our curve is + continuously differentiable and doesn't have + oscillations. Glossing over the exact meaning of those terms, the + curves we're dealing with conform to those constraints, so as long as we pick good starting points, this will work. So the question is: + which starting points do we pick? +

    +

    + As it turns out, Newton-Raphson is so blindingly fast that we could get away with just not picking: we simply run the algorithm from + t=0 to t=1 at small steps (say, 1/200th) and the result will be all the roots we want. Of course, this may + pose problems for high order Bézier curves: 200 steps for a 200th order Bézier curve is going to go wrong, but that's okay: + there is no reason (at least, none that I know of) to ever use Bézier curves of crazy high orders. You might use a fifth order + curve to get the "nicest still remotely workable" approximation of a full circle with a single Bézier curve, but that's pretty much as + high as you'll ever need to go. +

    +

    In conclusion:

    +

    + So now that we know how to do root finding, we can determine the first and second derivative roots for our Bézier curves, and show those + roots overlaid on the previous graphics. For the quadratic curve, that means just the first derivative, in red: +

    + + + Скрипти вимкнено. показує резервний. + + +

    And for cubic curves, that means first and second derivatives, in red and purple respectively:

    + + + Скрипти вимкнено. показує резервний. + + +
    +
    +

    + + Bounding boxes +

    +

    + If we have the extremities, and the start/end points, a simple for-loop that tests for min/max values for x and y means we have the four + values we need to box in our curve: +

    +

    Computing the bounding box for a Bézier curve:

    +
      +
    1. Find all t value(s) for the curve derivative's x- and y-roots.
    2. +
    3. Discard any t value that's lower than 0 or higher than 1, because Bézier curves only use the interval [0,1].
    4. +
    5. + Determine the lowest and highest value when plugging the values t=0, t=1 and each of the found roots into the original + functions: the lowest value is the lower bound, and the highest value is the upper bound for the bounding box we want to construct. +
    6. +
    +

    + Applying this approach to our previous root finding, we get the following + axis-aligned bounding boxes (with all curve extremity points + shown on the curve): +

    +
    + + + Скрипти вимкнено. показує резервний. + + + + + + Скрипти вимкнено. показує резервний. + + + +
    -
    -
    -

    Bounding boxes

    -

    If we have the extremities, and the start/end points, a simple for-loop that tests for min/max values for x and y means we have the four values we need to box in our curve:

    -

    Computing the bounding box for a Bézier curve:

    -
      -
    1. Find all t value(s) for the curve derivative's x- and y-roots.
    2. -
    3. Discard any t value that's lower than 0 or higher than 1, because Bézier curves only use the interval [0,1].
    4. -
    5. Determine the lowest and highest value when plugging the values t=0, t=1 and each of the found roots into the original functions: the lowest value is the lower bound, and the highest value is the upper bound for the bounding box we want to construct.
    6. -
    -

    Applying this approach to our previous root finding, we get the following axis-aligned bounding boxes (with all curve extremity points shown on the curve):

    -
    - - - Скрипти вимкнено. показує резервний. - - - - - - Скрипти вимкнено. показує резервний. - - - -
    - -

    We can construct even nicer boxes by aligning them along our curve, rather than along the x- and y-axis, but in order to do so we first need to look at how aligning works.

    - -
    -
    -

    Aligning curves

    -

    While there are an incredible number of curves we can define by varying the x- and y-coordinates for the control points, not all curves are actually distinct. For instance, if we define a curve, and then rotate it 90 degrees, it's still the same curve, and we'll find its extremities in the same spots, just at different draw coordinates. As such, one way to make sure we're working with a "unique" curve is to "axis-align" it.

    -

    Aligning also simplifies a curve's functions. We can translate (move) the curve so that the first point lies on (0,0), which turns our n term polynomial functions into n-1 term functions. The order stays the same, but we have less terms. Then, we can rotate the curves so that the last point always lies on the x-axis, too, making its coordinate (...,0). This further simplifies the function for the y-component to an n-2 term function. For instance, if we have a cubic curve such as this:

    - - -

    Then translating it so that the first coordinate lies on (0,0), moving all x coordinates by -120, and all y coordinates by -160, gives us:

    - - -

    If we then rotate the curve so that its end point lies on the x-axis, the coordinates (integer-rounded for illustrative purposes here) become:

    - - -

    If we drop all the zero-terms, this gives us:

    - - -

    We can see that our original curve definition has been simplified considerably. The following graphics illustrate the result of aligning our example curves to the x-axis, with the cubic case using the coordinates that were just used in the example formulae:

    - - - Скрипти вимкнено. показує резервний. - - - -

     

    - - - Скрипти вимкнено. показує резервний. - - - + +

    + We can see that our original curve definition has been simplified considerably. The following graphics illustrate the result of aligning + our example curves to the x-axis, with the cubic case using the coordinates that were just used in the example formulae: +

    + + + Скрипти вимкнено. показує резервний. + + +

     

    + + + Скрипти вимкнено. показує резервний. + + +
    +
    +

    + + Tight bounding boxes +

    +

    + With our knowledge of bounding boxes, and curve alignment, We can now form the "tight" bounding box for curves. We first align our curve, + recording the translation we performed, "T", and the rotation angle we used, "R". We then determine the aligned curve's normal bounding + box. Once we have that, we can map that bounding box back to our original curve by rotating it by -R, and then translating it by -T. +

    +

    We now have nice tight bounding boxes for our curves:

    +
    + + + Скрипти вимкнено. показує резервний. + + + + + + Скрипти вимкнено. показує резервний. + + + +
    -
    -
    -

    Tight bounding boxes

    -

    With our knowledge of bounding boxes, and curve alignment, We can now form the "tight" bounding box for curves. We first align our curve, recording the translation we performed, "T", and the rotation angle we used, "R". We then determine the aligned curve's normal bounding box. Once we have that, we can map that bounding box back to our original curve by rotating it by -R, and then translating it by -T.

    -

    We now have nice tight bounding boxes for our curves:

    -
    - - - Скрипти вимкнено. показує резервний. - - - - - - Скрипти вимкнено. показує резервний. - - - -
    - -

    These are, strictly speaking, not necessarily the tightest possible bounding boxes. It is possible to compute the optimal bounding box by determining which spanning lines we need to effect a minimal box area, but because of the parametric nature of Bézier curves this is actually a rather costly operation, and the gain in bounding precision is often not worth it.

    - -
    -
    -

    Curve inflections

    -

    Now that we know how to align a curve, there's one more thing we can calculate: inflection points. Imagine we have a variable size circle that we can slide up against our curve. We place it against the curve and adjust its radius so that where it touches the curve, the curvatures of the curve and the circle are the same, and then we start to slide the circle along the curve - for quadratic curves, we can always do this without the circle behaving oddly: we might have to change the radius of the circle as we slide it along, but it'll always sit against the same side of the curve.

    -

    But what happens with cubic curves? Imagine we have an S curve and we place our circle at the start of the curve, and start sliding it along. For a while we can simply adjust the radius and things will be fine, but once we get to the midpoint of that S, something odd happens: the circle "flips" from one side of the curve to the other side, in order for the curvatures to keep matching. This is called an inflection, and we can find out where those happen relatively easily.

    -

    What we need to do is solve a simple equation:

    - - -

    What we're saying here is that given the curvature function C(t), we want to know for which values of t this function is zero, meaning there is no "curvature", which will be exactly at the point between our circle being on one side of the curve, and our circle being on the other side of the curve. So what does C(t) look like? Actually something that seems not too hard:

    - - -

    The function C(t) is the cross product between the first and second derivative functions for the parametric dimensions of our curve. And, as already shown, derivatives of Bézier curves are just simpler Bézier curves, with very easy to compute new coefficients, so this should be pretty easy.

    -

    However as we've seen in the section on aligning, aligning lets us simplify things a lot, by completely removing the contributions of the first coordinate from most mathematical evaluations, and removing the last y coordinate as well by virtue of the last point lying on the x-axis. So, while we can evaluate C(t) = 0 for our curve, it'll be much easier to first axis-align the curve and then evaluating the curvature function.

    -
    - -

    Let's derive the full formula anyway

    -

    Of course, before we do our aligned check, let's see what happens if we compute the curvature function without axis-aligning. We start with the first and second derivatives, given our basis functions:

    - - -

    And of course the same functions for y:

    - - -

    Asking a computer to now compose the C(t) function for us (and to expand it to a readable form of simple terms) gives us this rather overly complicated set of arithmetic expressions:

    - - -

    That is... unwieldy. So, we note that there are a lot of terms that involve multiplications involving x1, y1, and y4, which would all disappear if we axis-align our curve, which is why aligning is a great idea.

    -
    + +

    + That is... unwieldy. So, we note that there are a lot of terms that involve multiplications involving x1, y1, and y4, which would all + disappear if we axis-align our curve, which is why aligning is a great idea. +

    + -

    Aligning our curve so that three of the eight coefficients become zero, and observing that scale does not affect finding t values, we end up with the following simple term function for C(t):

    - - -

    That's a lot easier to work with: we see a fair number of terms that we can compute and then cache, giving us the following simplification:

    - - -

    This is a plain quadratic curve, and we know how to solve C(t) = 0; we use the quadratic formula:

    - - -

    We can easily compute this value if the discriminator isn't a negative number (because we only want real roots, not complex roots), and if x is not zero, because divisions by zero are rather useless.

    -

    Taking that into account, we compute t, we disregard any t value that isn't in the Bézier interval [0,1], and we now know at which t value(s) our curve will inflect.

    - - - Скрипти вимкнено. показує резервний. - - - - -
    -
    -

    The canonical form (for cubic curves)

    -

    While quadratic curves are relatively simple curves to analyze, the same cannot be said of the cubic curve. As a curvature is controlled by more than one control point, it exhibits all kinds of features like loops, cusps, odd colinear features, and as many as two inflection points because the curvature can change direction up to three times. Now, knowing what kind of curve we're dealing with means that some algorithms can be run more efficiently than if we have to implement them as generic solvers, so is there a way to determine the curve type without lots of work?

    -

    As it so happens, the answer is yes, and the solution we're going to look at was presented by Maureen C. Stone from Xerox PARC and Tony D. deRose from the University of Washington in their joint paper "A Geometric Characterization of Parametric Cubic curves". It was published in 1989, and defines curves as having a "canonical" form (i.e. a form that all curves can be reduced to) from which we can immediately tell what features a curve will have. So how does it work?

    -

    The first observation that makes things work is that if we have a cubic curve with four points, we can apply a linear transformation to these points such that three of the points end up on (0,0), (0,1) and (1,1), with the last point then being "somewhere". After applying that transformation, the location of that last point can then tell us what kind of curve we're dealing with. Specifically, we see the following breakdown:

    - - - Скрипти вимкнено. показує резервний. - - - -

    This is a fairly funky image, so let's see what the various parts of it mean...

    -

    We see the three fixed points at (0,0), (0,1) and (1,1). The various regions and boundaries indicate what property the original curve will have, if the fourth point is in/on that region or boundary. Specifically, if the fourth point is...

    -
      -
    1. ...anywhere inside the red zone, but not on its boundaries, the curve will be self-intersecting (yielding a loop). We won't know where it self-intersects (in terms of t values), but we are guaranteed that it does.

      -
    2. -
    3. ...on the left (red) edge of the red zone, the curve will have a cusp. We again don't know where, but we know there is one. This edge is described by the function:

      - - -
    4. -
    5. ...on the almost circular, lower right (pink) edge, the curve's end point touches the curve, forming a loop. This edge is described by the function:

      - - -
    6. -
    7. ...on the top (blue) edge, the curve's start point touches the curve, forming a loop. This edge is described by the function:

      - - -
    8. -
    9. ...inside the lower (green) zone, past y=1, the curve will have a single inflection (switching concave/convex once).

      -
    10. -
    11. ...between the left and lower boundaries (below the cusp line but above the single-inflection line), the curve will have two inflections (switching from concave to convex and then back again, or from convex to concave and then back again).

      -
    12. -
    13. ...anywhere on the right of self-intersection zone, the curve will have no inflections. It'll just be a simple arch.

      -
    14. -
    -

    Of course, this map is fairly small, but the regions extend to infinity, with well defined boundaries.

    -
    + + +
  • +

    ...inside the lower (green) zone, past y=1, the curve will have a single inflection (switching concave/convex once).

    +
  • +
  • +

    + ...between the left and lower boundaries (below the cusp line but above the single-inflection line), the curve will have two + inflections (switching from concave to convex and then back again, or from convex to concave and then back again). +

    +
  • +
  • ...anywhere on the right of self-intersection zone, the curve will have no inflections. It'll just be a simple arch.

  • + +

    Of course, this map is fairly small, but the regions extend to infinity, with well defined boundaries.

    +
    +

    Wait, where do those lines come from?

    +

    + Without repeating the paper mentioned at the top of this section, the loop-boundaries come from rewriting the curve into canonical form, + and then solving the formulae for which constraints must hold for which possible curve properties. In the paper these functions yield + formulae for where you will find cusp points, or loops where we know t=0 or t=1, but those functions are derived for the full cubic + expression, meaning they apply to t=-∞ to t=∞... For Bézier curves we only care about the "clipped interval" t=0 to t=1, so some of the + properties that apply when you look at the curve over an infinite interval simply don't apply to the Bézier curve interval. +

    +

    + The right bound for the loop region, indicating where the curve switches from "having inflections" to "having a loop", for the general + cubic curve, is actually mirrored over x=1, but for Bézier curves this right half doesn't apply, so we don't need to pay attention to + it. Similarly, the boundaries for t=0 and t=1 loops are also nice clean curves but get "cut off" when we only look at what the general + curve does over the interval t=0 to t=1. +

    +

    + For the full details, head over to the paper and read through sections 3 and 4. If you still remember your high school pre-calculus, you + can probably follow along with this paper, although you might have to read it a few times before all the bits "click". +

    +
    -

    Wait, where do those lines come from?

    -

    Without repeating the paper mentioned at the top of this section, the loop-boundaries come from rewriting the curve into canonical form, and then solving the formulae for which constraints must hold for which possible curve properties. In the paper these functions yield formulae for where you will find cusp points, or loops where we know t=0 or t=1, but those functions are derived for the full cubic expression, meaning they apply to t=-∞ to t=∞... For Bézier curves we only care about the "clipped interval" t=0 to t=1, so some of the properties that apply when you look at the curve over an infinite interval simply don't apply to the Bézier curve interval.

    -

    The right bound for the loop region, indicating where the curve switches from "having inflections" to "having a loop", for the general cubic curve, is actually mirrored over x=1, but for Bézier curves this right half doesn't apply, so we don't need to pay attention to it. Similarly, the boundaries for t=0 and t=1 loops are also nice clean curves but get "cut off" when we only look at what the general curve does over the interval t=0 to t=1.

    -

    For the full details, head over to the paper and read through sections 3 and 4. If you still remember your high school pre-calculus, you can probably follow along with this paper, although you might have to read it a few times before all the bits "click".

    -
    - -

    So now the question becomes: how do we manipulate our curve so that it fits this canonical form, with three fixed points, and one "free" point? Enter linear algebra. Don't worry, I'll be doing all the math for you, as well as show you what the effect is on our curves, but basically we're going to be using linear algebra, rather than calculus, because "it's way easier". Sometimes a calculus approach is very hard to work with, when the equivalent geometrical solution is super obvious.

    -

    The approach is going to start with a curve that doesn't have all-colinear points (so we need to make sure the points don't all fall on a straight line), and then applying three graphics operations that you will probably have heard of: translation (moving all points by some fixed x- and y-distance), scaling (multiplying all points by some x and y scale factor), and shearing (an operation that turns rectangles into parallelograms).

    -

    Step 1: we translate any curve by -p1.x and -p1.y, so that the curve starts at (0,0). We're going to make use of an interesting trick here, by pretending our 2D coordinates are 3D, with the z coordinate simply always being 1. This is an old trick in graphics to overcome the limitations of 2D transformations: without it, we can only turn (x,y) coordinates into new coordinates of the form (ax + by, cx + dy), which means we can't do translation, since that requires we end up with some kind of (x + a, y + b). If we add a bogus z coordinate that is always 1, then we can suddenly add arbitrary values. For example:

    - - -

    Sweet! z stays 1, so we can effectively ignore it entirely, but we added some plain values to our x and y coordinates. So, if we want to subtract p1.x and p1.y, we use:

    - - -

    Running all our coordinates through this transformation gives a new set of coordinates, let's call those U, where the first coordinate lies on (0,0), and the rest is still somewhat free. Our next job is to make sure point 2 ends up lying on the x=0 line, so what we want is a transformation matrix that, when we run it, subtracts x from whatever x we currently have. This is called shearing, and the typical x-shear matrix and its transformation looks like this:

    - - -

    So we want some shearing value that, when multiplied by y, yields -x, so our x coordinate becomes zero. That value is simply -x/y, because *-x/y * y = -x*. Done:

    - - -

    Now, running this on all our points generates a new set of coordinates, let's call those V, which now have point 1 on (0,0) and point 2 on (0, some-value), and we wanted it at (0,1), so we need to do some scaling to make sure it ends up at (0,1). Additionally, we want point 3 to end up on (1,1), so we can also scale x to make sure its x-coordinate will be 1 after we run the transform. That means we'll be x-scaling by 1/point3x, and y-scaling by point2y. This is really easy:

    - - -

    Then, finally, this generates a new set of coordinates, let's call those W, of which point 1 lies on (0,0), point 2 lies on (0,1), and point three lies on (1, ...) so all that's left is to make sure point 3 ends up at (1,1) - but we can't scale! Point 2 is already in the right place, and y-scaling would move it out of (0,1) again, so our only option is to y-shear point three, just like how we x-sheared point 2 earlier. In this case, we do the same trick, but with y/x rather than x/y because we're not x-shearing but y-shearing. Additionally, we don't actually want to end up at zero (which is what we did before) so we need to shear towards an offset, in this case 1:

    - - -

    And this generates our final set of four coordinates. Of these, we already know that points 1 through 3 are (0,0), (0,1) and (1,1), and only the last coordinate is "free". In fact, given any four starting coordinates, the resulting "transformation mapped" coordinate will be:

    - - -

    Okay, well, that looks plain ridiculous, but: notice that every coordinate value is being offset by the initial translation, and also notice that a lot of terms in that expression are repeated. Even though the maths looks crazy as a single expression, we can just pull this apart a little and end up with an easy-to-calculate bit of code!

    -

    First, let's just do that translation step as a "preprocessing" operation so we don't have to subtract the values all the time. What does that leave?

    - - -

    Suddenly things look a lot simpler: the mapped x is fairly straight forward to compute, and we see that the mapped y actually contains the mapped x in its entirety, so we'll have that part already available when we need to evaluate it. In fact, let's pull out all those common factors to see just how simple this is:

    - - -

    That's kind of super-simple to write out in code, I think you'll agree. Coding math tends to be easier than the formulae initially make it look!

    -
    + +

    + That's kind of super-simple to write out in code, I think you'll agree. Coding math tends to be easier than the formulae initially make it + look! +

    +
    +

    How do you track all that?

    +

    + Doing maths can be a pain, so whenever possible, I like to make computers do the work for me. Especially for things like this, I simply + use Mathematica. Tracking all this math by hand is insane, and we invented computers, + literally, to do this for us. I have no reason to use pen and paper when I can write out what I want to do in a program, and have the + program do the math for me. And real math, too, with symbols, not with numbers. In fact, + here's the Mathematica notebook if you want to see how + this works for yourself. +

    +

    + Now, I know, you're thinking "but Mathematica is super expensive!" and that's true, it's + $344 for home use, up from $295 when I original wrote this, but it's + also free when you buy a $35 raspberry pi. Obviously, I bought a + raspberry pi, and I encourage you to do the same. With that, as long as you know what you want to do, Mathematica can just do + it for you. And we don't have to be geniuses to work out what the maths looks like. That's what we have computers for. +

    +
    -

    How do you track all that?

    -

    Doing maths can be a pain, so whenever possible, I like to make computers do the work for me. Especially for things like this, I simply use Mathematica. Tracking all this math by hand is insane, and we invented computers, literally, to do this for us. I have no reason to use pen and paper when I can write out what I want to do in a program, and have the program do the math for me. And real math, too, with symbols, not with numbers. In fact, here's the Mathematica notebook if you want to see how this works for yourself.

    -

    Now, I know, you're thinking "but Mathematica is super expensive!" and that's true, it's $344 for home use, up from $295 when I original wrote this, but it's also free when you buy a $35 raspberry pi. Obviously, I bought a raspberry pi, and I encourage you to do the same. With that, as long as you know what you want to do, Mathematica can just do it for you. And we don't have to be geniuses to work out what the maths looks like. That's what we have computers for.

    -
    - -

    So, let's write up a sketch that'll show us the canonical form for any curve drawn in blue, overlaid on our canonical map, so that we can immediately tell which features our curve must have, based on where the fourth coordinate is located on the map:

    - - - Скрипти вимкнено. показує резервний. - - - - -
    -
    -

    Finding Y, given X

    -

    One common task that pops up in things like CSS work, or parametric equalizers, or image leveling, or any other number of applications where Bézier curves are used as control curves in a way that there is really only ever one "y" value associated with one "x" value, you might want to cut out the middle man, as it were, and compute "y" directly based on "x". After all, the function looks simple enough, finding the "y" value should be simple too, right? Unfortunately, not really. However, it is possible and as long as you have some code in place to help, it's not a lot of a work either.

    -

    We'll be tackling this problem in two stages: the first, which is the hard part, is figuring out which "t" value belongs to any given "x" value. For instance, have a look at the following graphic. On the left we have a Bézier curve that looks for all intents and purposes like it fits our criteria: every "x" has one and only one associated "y" value. On the right we see the function for just the "x" values: that's a cubic curve, but not a really crazy cubic curve. If you move the graphic's slider, you will see a red line drawn that corresponds to the x coordinate: this is a vertical line in the left graphic, and a horizontal line on the right.

    - - - Скрипти вимкнено. показує резервний. - - - - - -

    Now, if you look more closely at that right graphic, you'll notice something interesting: if we treat the red line as "the x axis", then the point where the function crosses our line is really just a root for the cubic function x(t) through a shifted "x-axis"... and we've already seen how to calculate roots, so let's just run cubic root finding - and not even the complicated cubic case either: because of the kind of curve we're starting with, we know there is only root, simplifying the code we need!

    -

    First, let's look at the function for x(t):

    - - -

    We can rewrite this to a plain polynomial form, by just fully writing out the expansion and then collecting the polynomial factors, as:

    - - -

    Nothing special here: that's a standard cubic polynomial in "power" form (i.e. all the terms are ordered by their power of t). So, given that a, b, c, d, and x(t) are all known constants, we can trivially rewrite this (by moving the x(t) across the equal sign) as:

    - - -

    You might be wondering "where did all the other 'minus x' for all the other values a, b, c, and d go?" and the answer there is that they all cancel out, so the only one we actually need to subtract is the one at the end. Handy! So now we just solve this equation using Cardano's algorithm, and we're left with some rather short code:

    + +

    + You might be wondering "where did all the other 'minus x' for all the other values a, b, c, and d go?" and the answer there is that they + all cancel out, so the only one we actually need to subtract is the one at the end. Handy! So now we just solve this equation using + Cardano's algorithm, and we're left with some rather short code: +

    - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    - -

    So the procedure is fairly straight forward: pick an x, find the associated t value, evaluate our curve for that t value, which gives us the curve's {x,y} coordinate, which means we know y for this x. Move the slider for the following graphic to see this in action:

    - - - Скрипти вимкнено. показує резервний. - - - - - +y = curve.get(t).y + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + -
    -
    -

    Arc length

    -

    How long is a Bézier curve? As it turns out, that's not actually an easy question, because the answer requires maths that —much like root finding— cannot generally be solved the traditional way. If we have a parametric curve with fx(t) and fy(t), then the length of the curve, measured from start point to some point t = z, is computed using the following seemingly straight forward (if a bit overwhelming) formula:

    - - -

    or, more commonly written using Leibnitz notation as:

    - - -

    This formula says that the length of a parametric curve is in fact equal to the area underneath a function that looks a remarkable amount like Pythagoras' rule for computing the diagonal of a straight angled triangle. This sounds pretty simple, right? Sadly, it's far from simple... cutting straight to after the chase is over: for quadratic curves, this formula generates an unwieldy computation, and we're simply not going to implement things that way. For cubic Bézier curves, things get even more fun, because there is no "closed form" solution, meaning that due to the way calculus works, there is no generic formula that allows you to calculate the arc length. Let me just repeat this, because it's fairly crucial: for cubic and higher Bézier curves, there is no way to solve this function if you want to use it "for all possible coordinates".

    -

    Seriously: It cannot be done.

    -

    So we turn to numerical approaches again. The method we'll look at here is the Gauss quadrature. This approximation is a really neat trick, because for any nth degree polynomial it finds approximated values for an integral really efficiently. Explaining this procedure in length is way beyond the scope of this page, so if you're interested in finding out why it works, I can recommend the University of South Florida video lecture on the procedure, linked in this very paragraph. The general solution we're looking for is the following:

    - - -

    In plain text: an integral function can always be treated as the sum of an (infinite) number of (infinitely thin) rectangular strips sitting "under" the function's plotted graph. To illustrate this idea, the following graph shows the integral for a sinusoid function. The more strips we use (and of course the more we use, the thinner they get) the closer we get to the true area under the curve, and thus the better the approximation:

    -
    - - - Скрипти вимкнено. показує резервний. - - - - - - Скрипти вимкнено. показує резервний. - - - - - - Скрипти вимкнено. показує резервний. - - - -
    + +

    + In plain text: an integral function can always be treated as the sum of an (infinite) number of (infinitely thin) rectangular strips + sitting "under" the function's plotted graph. To illustrate this idea, the following graph shows the integral for a sinusoid function. The + more strips we use (and of course the more we use, the thinner they get) the closer we get to the true area under the curve, and thus the + better the approximation: +

    +
    + + + Скрипти вимкнено. показує резервний. + + + + + + Скрипти вимкнено. показує резервний. + + + + + + Скрипти вимкнено. показує резервний. + + + +
    -

    Now, infinitely many terms to sum and infinitely thin rectangles are not something that computers can work with, so instead we're going to approximate the infinite summation by using a sum of a finite number of "just thin" rectangular strips. As long as we use a high enough number of thin enough rectangular strips, this will give us an approximation that is pretty close to what the real value is.

    -

    So, the trick is to come up with useful rectangular strips. A naive way is to simply create n strips, all with the same width, but there is a far better way using special values for C and f(t) depending on the value of n, which indicates how many strips we'll use, and it's called the Legendre-Gauss quadrature.

    -

    This approach uses strips that are not spaced evenly, but instead spaces them in a special way based on describing the function as a polynomial (the more strips, the more accurate the polynomial), and then computing the exact integral for that polynomial. We're essentially performing arc length computation on a flattened curve, but flattening it based on the intervals dictated by the Legendre-Gauss solution.

    -
    - -

    Note that one requirement for the approach we'll use is that the integral must run from -1 to 1. That's no good, because we're dealing with Bézier curves, and the length of a section of curve applies to values which run from 0 to "some value smaller than or equal to 1" (let's call that value z). Thankfully, we can quite easily transform any integral interval to any other integral interval, by shifting and scaling the inputs. Doing so, we get the following:

    - - -

    That may look a bit more complicated, but the fraction involving z is a fixed number, so the summation, and the evaluation of the f(t) values are still pretty simple.

    -

    So, what do we need to perform this calculation? For one, we'll need an explicit formula for f(t), because that derivative notation is handy on paper, but not when we have to implement it. We'll also need to know what these Ci and ti values should be. Luckily, that's less work because there are actually many tables available that give these values, for any n, so if we want to approximate our integral with only two terms (which is a bit low, really) then these tables would tell us that for n=2 we must use the following values:

    - - -

    Which means that in order for us to approximate the integral, we must plug these values into the approximate function, which gives us:

    - - -

    We can program that pretty easily, provided we have that f(t) available, which we do, as we know the full description for the Bézier curve functions Bx(t) and By(t).

    -
    + +

    + We can program that pretty easily, provided we have that f(t) available, which we do, as we know the full description for the + Bézier curve functions Bx(t) and By(t). +

    + -

    If we use the Legendre-Gauss values for our C values (thickness for each strip) and t values (location of each strip), we can determine the approximate length of a Bézier curve by computing the Legendre-Gauss sum. The following graphic shows a cubic curve, with its computed lengths; Go ahead and change the curve, to see how its length changes. One thing worth trying is to see if you can make a straight line, and see if the length matches what you'd expect. What if you form a line with the control points on the outside, and the start/end points on the inside?

    - - - Скрипти вимкнено. показує резервний. - - - +

    + If we use the Legendre-Gauss values for our C values (thickness for each strip) and t values (location of each strip), + we can determine the approximate length of a Bézier curve by computing the Legendre-Gauss sum. The following graphic shows a cubic curve, + with its computed lengths; Go ahead and change the curve, to see how its length changes. One thing worth trying is to see if you can make + a straight line, and see if the length matches what you'd expect. What if you form a line with the control points on the outside, and the + start/end points on the inside? +

    + + + Скрипти вимкнено. показує резервний. + + + +
    +
    +

    + + Approximated arc length +

    +

    + Sometimes, we don't actually need the precision of a true arc length, and we can get away with simply computing the approximate arc length + instead. The by far fastest way to do this is to flatten the curve and then simply calculate the linear distance from point to point. This + will come with an error, but this can be made arbitrarily small by increasing the segment count. +

    +

    + If we combine the work done in the previous sections on curve flattening and arc length computation, we can implement these with minimal + effort: +

    +
    + + + Скрипти вимкнено. показує резервний. + + + + + + + + Скрипти вимкнено. показує резервний. + + + + + +
    -
    -
    -

    Approximated arc length

    -

    Sometimes, we don't actually need the precision of a true arc length, and we can get away with simply computing the approximate arc length instead. The by far fastest way to do this is to flatten the curve and then simply calculate the linear distance from point to point. This will come with an error, but this can be made arbitrarily small by increasing the segment count.

    -

    If we combine the work done in the previous sections on curve flattening and arc length computation, we can implement these with minimal effort:

    -
    - - - - Скрипти вимкнено. показує резервний. - - - - - - - - Скрипти вимкнено. показує резервний. - - - - - -
    - -

    You may notice that even though the error in length is actually pretty significant in absolute terms, even at a low number of segments we get a length that agrees with the true length when it comes to just the integer part of the arc length. Quite often, approximations can drastically speed things up!

    - -
    -
    -

    Curvature of a curve

    -

    If we have two curves, and we want to line them in up in a way that "looks right", what would we use as metric to let a computer decide what "looks right" means?

    -

    For instance, we can start by ensuring that the two curves share an end coordinate, so that there is no "gap" between the end of one and the start of the next curve, but that won't guarantee that things look right: both curves can be going in wildly different directions, and the resulting joined geometry will have a corner in it, rather than a smooth transition from one curve to the next.

    -

    What we want is to ensure that the curvature at the transition from one curve to the next "looks good". So, we start with a shared coordinate, and then also require that derivatives for both curves match at that coordinate. That way, we're assured that their tangents line up, which must mean the curve transition is perfectly smooth. We can even make the second, third, etc. derivatives match up for better and better transitions.

    -

    Problem solved!

    -

    However, there's a problem with this approach: if we think about this a little more, we realise that "what a curve looks like" and its derivative values are pretty much entirely unrelated. After all, the section on reordering curves showed us that the same looking curve can have an infinite number of curve expressions of arbitrarily high Bézier degree, and each of those will have wildly different derivative values.

    -

    So what we really want is some kind of expression that's not based on any particular expression of t, but is based on something that is invariant to the kind of function(s) we use to draw our curve. And the prime candidate for this is our curve expression, reparameterised for distance: no matter what order of Bézier curve we use, if we were able to rewrite it as a function of distance-along-the-curve, all those different degree Bézier functions would end up being the same function for "coordinate at some distance D along the curve".

    -

    We've seen this before... that's the arc length function.

    -

    So you might think that in order to find the curvature of a curve, we now need to solve the arc length function itself, and that this would be quite a problem because we just saw that there is no way to actually do that. Thankfully, we don't. We only need to know the form of the arc length function, which we saw above and is fairly simple, rather than needing to solve the arc length function. If we start with the arc length expression and the run through the steps necessary to determine its derivative (with an alternative, shorter demonstration of how to do this found over on Stackexchange), then the integral that was giving us so much problems in solving the arc length function disappears entirely (because of the fundamental theorem of calculus), and what we're left with us some surprisingly simple maths that relates curvature (denoted as κ, "kappa") to—and this is the truly surprising bit—a specific combination of derivatives of our original function.

    -

    Let me highlight what just happened, because it's pretty special:

    -
      -
    1. we wanted to make curves line up, and initially thought to match the curves' derivatives, but
    2. -
    3. that turned out to be a really bad choice, so instead
    4. -
    5. we picked a function that is basically impossible to work with, and then worked with that, which
    6. -
    7. gives us a simple formula that is and expression using the curves' derivatives.
    8. -
    -

    That's crazy!

    -

    But that's also one of the things that makes maths so powerful: even if your initial ideas are off the mark, you might be much closer than you thought you were, and the journey from "thinking we're completely wrong" to "actually being remarkably close to being right" is where we can find a lot of insight.

    -

    So, what does the function look like? This:

    - - -

    Which is really just a "short form" that glosses over the fact that we're dealing with functions of t, so let's expand that a tiny bit:

    - - -

    And while that's a little more verbose, it's still just as simple to work with as the first function: the curvature at some point on any (and this cannot be overstated: any) curve is a ratio between the first and second derivative cross product, and something that looks oddly similar to the standard Euclidean distance function. And nothing in these functions is hard to calculate either: for Bézier curves, simply knowing our curve coordinates means we know what the first and second derivatives are, and so evaluating this function for any t value is just a matter of basic arithematics.

    -

    In fact, let's just implement it right now:

    + +

    + And while that's a little more verbose, it's still just as simple to work with as the first function: the curvature at some point on any + (and this cannot be overstated: any) curve is a ratio between the first and second derivative cross product, and something that + looks oddly similar to the standard Euclidean distance function. And nothing in these functions is hard to calculate either: for Bézier + curves, simply knowing our curve coordinates means we know what the first and second derivatives are, and so + evaluating this function for any t value is just a matter of basic arithematics. +

    +

    In fact, let's just implement it right now:

    - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    - -

    That was easy! (Well okay, that "not a number" value will need to be taken into account by downstream code, but that's a reality of programming anyway)

    -

    With all of that covered, let's line up some curves! The following graphic gives you two curves that look identical, but use quadratic and cubic functions, respectively. As you can see, despite their derivatives being necessarily different, their curvature (thanks to being derived based on maths that "ignores" specific function derivative, and instead gives a formula that smooths out any differences) is exactly the same. And because of that, we can put them together such that the point where they overlap has the same curvature for both curves, giving us the smoothest transition.

    - - - Скрипти вимкнено. показує резервний. - - - -

    One thing you may have noticed in this sketch is that sometimes the curvature looks fine, but seems to be pointing in the wrong direction, making it hard to line up the curves properly. A way around that, of course, is to show the curvature on both sides of the curve, so let's just do that. But let's take it one step further: we can also compute the associated "radius of curvature", which gives us the implicit circle that "fits" the curve's curvature at any point, using what is possibly the simplest bit of maths found in this entire primer:

    - - -

    So let's revisit the previous graphic with the curvature visualised on both sides of our curves, as well as showing the circle that "fits" our curve at some point that we can control by using a slider:

    - - - Скрипти вимкнено. показує резервний. - - - - - + +

    + So let's revisit the previous graphic with the curvature visualised on both sides of our curves, as well as showing the circle that "fits" + our curve at some point that we can control by using a slider: +

    + + + Скрипти вимкнено. показує резервний. + + + + + +
    +
    +

    + + Tracing a curve at fixed distance intervals +

    +

    + Say you want to draw a curve with a dashed line, rather than a solid line, or you want to move something along the curve at fixed distance + intervals over time, like a train along a track, and you want to use Bézier curves. +

    +

    Now you have a problem.

    +

    + The reason you have a problem is that Bézier curves are parametric functions with non-linear behaviour, whereas moving a train along a + track is about as close to a practical example of linear behaviour as you can get. The problem we're faced with is that we can't just pick + t values at some fixed interval and expect the Bézier functions to generate points that are spaced a fixed distance apart. In + fact, let's look at the relation between "distance along a curve" and "t value", by plotting them against one another. +

    +

    + The following graphic shows a particularly illustrative curve, and its distance-for-t plot. For linear traversal, this line needs to be + straight, running from (0,0) to (length,1). That is, it's safe to say, not what we'll see: we'll see something very wobbly, instead. To + make matters even worse, the distance-for-t function is also of a much higher order than our curve is: while the curve we're using for + this exercise is a cubic curve, which can switch concave/convex form twice at best, the distance function is our old friend the arc length + function, which can have more inflection points. +

    + + + Скрипти вимкнено. показує резервний. + + +

    + So, how do we "cut up" the arc length function at regular intervals, when we can't really work with it? We basically cheat: we run through + the curve using t values, determine the distance-for-this-t-value at each point we generate during the run, and + then we find "the closest t value that matches some required distance" using those values instead. If we have a low number of + points sampled, we can then even refine which t value "should" work for our desired distance by interpolating between two + points, but if we have a high enough number of samples, we don't even need to bother. +

    +

    + So let's do exactly that: the following graph is similar to the previous one, showing how we would have to "chop up" our distance-for-t + curve in order to get regularly spaced points on the curve. It also shows what using those t values on the real curve looks + like, by coloring each section of curve between two distance markers differently: +

    + + + Скрипти вимкнено. показує резервний. + + + + + +

    Use the slider to increase or decrease the number of equidistant segments used to colour the curve.

    +

    + However, are there better ways? One such way is discussed in "Moving Along a Curve with Specified Speed" by David Eberly of Geometric Tools, LLC, but basically because we have no explicit length function (or rather, one we don't have to + constantly compute for different intervals), you may simply be better off with a traditional lookup table (LUT). +

    +
    +
    +

    + + Intersections +

    +

    + Let's look at some more things we will want to do with Bézier curves. Almost immediately after figuring out how to get bounding boxes to + work, people tend to run into the problem that even though the minimal bounding box (based on rotation) is tight, it's not sufficient to + perform true collision detection. It's a good first step to make sure there might be a collision (if there is no bounding box + overlap, there can't be one), but in order to do real collision detection we need to know whether or not there's an intersection on the + actual curve. +

    +

    + We'll do this in steps, because it's a bit of a journey to get to curve/curve intersection checking. First, let's start simple, by + implementing a line-line intersection checker. While we can solve this the traditional calculus way (determine the functions for both + lines, then compute the intersection by equating them and solving for two unknowns), linear algebra actually offers a nicer solution. +

    +

    Line-line intersections

    +

    + If we have two line segments with two coordinates each, segments A-B and C-D, we can find the intersection of the lines these segments are + an intervals on by linear algebra, using the procedure outlined in this + top coder + article. Of course, we need to make sure that the intersection isn't just on the lines our line segments lie on, but actually on our line + segments themselves. So after we find the intersection, we need to verify that it lies without the bounds of our original line segments. +

    +

    + The following graphic implements this intersection detection, showing a red point for an intersection on the lines our segments lie on + (thus being a virtual intersection point), and a green point for an intersection that lies on both segments (being a real intersection + point). +

    + + + Скрипти вимкнено. показує резервний. + + + +
    +

    Implementing line-line intersections

    +

    + Let's have a look at how to implement a line-line intersection checking function. The basics are covered in the article mentioned above, + but sometimes you need more function signatures, because you might not want to call your function with eight distinct parameters. Maybe + you're using point structs for the line. Let's get coding: +

    -
    -
    -

    Tracing a curve at fixed distance intervals

    -

    Say you want to draw a curve with a dashed line, rather than a solid line, or you want to move something along the curve at fixed distance intervals over time, like a train along a track, and you want to use Bézier curves.

    -

    Now you have a problem.

    -

    The reason you have a problem is that Bézier curves are parametric functions with non-linear behaviour, whereas moving a train along a track is about as close to a practical example of linear behaviour as you can get. The problem we're faced with is that we can't just pick t values at some fixed interval and expect the Bézier functions to generate points that are spaced a fixed distance apart. In fact, let's look at the relation between "distance along a curve" and "t value", by plotting them against one another.

    -

    The following graphic shows a particularly illustrative curve, and its distance-for-t plot. For linear traversal, this line needs to be straight, running from (0,0) to (length,1). That is, it's safe to say, not what we'll see: we'll see something very wobbly, instead. To make matters even worse, the distance-for-t function is also of a much higher order than our curve is: while the curve we're using for this exercise is a cubic curve, which can switch concave/convex form twice at best, the distance function is our old friend the arc length function, which can have more inflection points.

    - - - Скрипти вимкнено. показує резервний. - - - -

    So, how do we "cut up" the arc length function at regular intervals, when we can't really work with it? We basically cheat: we run through the curve using t values, determine the distance-for-this-t-value at each point we generate during the run, and then we find "the closest t value that matches some required distance" using those values instead. If we have a low number of points sampled, we can then even refine which t value "should" work for our desired distance by interpolating between two points, but if we have a high enough number of samples, we don't even need to bother.

    -

    So let's do exactly that: the following graph is similar to the previous one, showing how we would have to "chop up" our distance-for-t curve in order to get regularly spaced points on the curve. It also shows what using those t values on the real curve looks like, by coloring each section of curve between two distance markers differently:

    - - - Скрипти вимкнено. показує резервний. - - - - - -

    Use the slider to increase or decrease the number of equidistant segments used to colour the curve.

    -

    However, are there better ways? One such way is discussed in "Moving Along a Curve with Specified Speed" by David Eberly of Geometric Tools, LLC, but basically because we have no explicit length function (or rather, one we don't have to constantly compute for different intervals), you may simply be better off with a traditional lookup table (LUT).

    - -
    -
    -

    Intersections

    -

    Let's look at some more things we will want to do with Bézier curves. Almost immediately after figuring out how to get bounding boxes to work, people tend to run into the problem that even though the minimal bounding box (based on rotation) is tight, it's not sufficient to perform true collision detection. It's a good first step to make sure there might be a collision (if there is no bounding box overlap, there can't be one), but in order to do real collision detection we need to know whether or not there's an intersection on the actual curve.

    -

    We'll do this in steps, because it's a bit of a journey to get to curve/curve intersection checking. First, let's start simple, by implementing a line-line intersection checker. While we can solve this the traditional calculus way (determine the functions for both lines, then compute the intersection by equating them and solving for two unknowns), linear algebra actually offers a nicer solution.

    -

    Line-line intersections

    -

    If we have two line segments with two coordinates each, segments A-B and C-D, we can find the intersection of the lines these segments are an intervals on by linear algebra, using the procedure outlined in this top coder article. Of course, we need to make sure that the intersection isn't just on the lines our line segments lie on, but actually on our line segments themselves. So after we find the intersection, we need to verify that it lies without the bounds of our original line segments.

    -

    The following graphic implements this intersection detection, showing a red point for an intersection on the lines our segments lie on (thus being a virtual intersection point), and a green point for an intersection that lies on both segments (being a real intersection point).

    - - - Скрипти вимкнено. показує резервний. - - - -
    - -

    Implementing line-line intersections

    -

    Let's have a look at how to implement a line-line intersection checking function. The basics are covered in the article mentioned above, but sometimes you need more function signatures, because you might not want to call your function with eight distinct parameters. Maybe you're using point structs for the line. Let's get coding:

    - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    - -
    + return lli4(line1.p1, line1.p2, line2.p1, line2.p2) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + -

    What about curve-line intersections?

    -

    Curve/line intersection is more work, but we've already seen the techniques we need to use in order to perform it: first we translate/rotate both the line and curve together, in such a way that the line coincides with the x-axis. This will position the curve in a way that makes it cross the line at points where its y-function is zero. By doing this, the problem of finding intersections between a curve and a line has now become the problem of performing root finding on our translated/rotated curve, as we already covered in the section on finding extremities.

    -
    - - - Скрипти вимкнено. показує резервний. - - - - - - Скрипти вимкнено. показує резервний. - - - -
    +

    What about curve-line intersections?

    +

    + Curve/line intersection is more work, but we've already seen the techniques we need to use in order to perform it: first we + translate/rotate both the line and curve together, in such a way that the line coincides with the x-axis. This will position the curve in + a way that makes it cross the line at points where its y-function is zero. By doing this, the problem of finding intersections between a + curve and a line has now become the problem of performing root finding on our translated/rotated curve, as we already covered in the + section on finding extremities. +

    +
    + + + Скрипти вимкнено. показує резервний. + + + + + + Скрипти вимкнено. показує резервний. + + + +
    -

    Curve/curve intersection, however, is more complicated. Since we have no straight line to align to, we can't simply align one of the curves and be left with a simple procedure. Instead, we'll need to apply two techniques we've met before: de Casteljau's algorithm, and curve splitting.

    +

    + Curve/curve intersection, however, is more complicated. Since we have no straight line to align to, we can't simply align one of the + curves and be left with a simple procedure. Instead, we'll need to apply two techniques we've met before: de Casteljau's algorithm, and + curve splitting. +

    +
    +
    +

    + + Curve/curve intersection +

    +

    + Using de Casteljau's algorithm to split the curve we can now implement curve/curve intersection finding using a "divide and conquer" + technique: +

    +
      +
    1. + Take two curves C1 and C2, and treat them as a pair. +
    2. +
    3. If their bounding boxes overlap, split up each curve into two sub-curves
    4. +
    5. + With C1.1, C1.2, C2.1 and C2.2, form four new pairs (C1.1,C2.1), (C1.1, C2.2), (C1.2,C2.1), and (C1.2,C2.2). +
    6. +
    7. + For each pair, check whether their bounding boxes overlap. +
        +
      1. If their bounding boxes do not overlap, discard the pair, as there is no intersection between this pair of curves.
      2. +
      3. If there is overlap, rerun all steps for this pair.
      4. +
      +
    8. +
    9. + Once the sub-curves we form are so small that they effectively occupy sub-pixel areas, we consider an intersection found, noting that we + might have a cluster of multiple intersections at the sub-pixel level, out of which we pick one to act as "found" t value + (we can either throw all but one away, we can average the cluster's t values, or you can do something even more creative). +
    10. +
    +

    + This algorithm will start with a single pair, "balloon" until it runs in parallel for a large number of potential sub-pairs, and then + taper back down as it homes in on intersection coordinates, ending up with as many pairs as there are intersections. +

    +

    + The following graphic applies this algorithm to a pair of cubic curves, one step at a time, so you can see the algorithm in action. Click + the button to run a single step in the algorithm, after setting up your curves in some creative arrangement. You can also change the value + that is used in step 5 to determine whether the curves are small enough. Manipulating the curves or changing the threshold will reset the + algorithm, so you can try this with lots of different curves. +

    +

    (can you find the configuration that yields the maximum number of intersections between two cubic curves? Nine intersections!)

    + + + Скрипти вимкнено. показує резервний. + + + + + + +

    + Finding self-intersections is effectively the same procedure, except that we're starting with a single curve, so we need to turn that into + two separate curves first. This is trivially achieved by splitting at an inflection point, or if there are none, just splitting at + t=0.5 first, and then running the exact same algorithm as above, with all non-overlapping curve pairs getting removed at each + iteration, and each successive step homing in on the curve's self-intersection points. +

    +
    +
    +

    + + The projection identity +

    +

    + De Casteljau's algorithm is the pivotal algorithm when it comes to Bézier curves. You can use it not just to split curves, but also to + draw them efficiently (especially for high-order Bézier curves), as well as to come up with curves based on three points and a tangent. + Particularly this last thing is really useful because it lets us "mold" a curve, by picking it up at some point, and dragging that point + around to change the curve's shape. +

    +

    How does that work? Succinctly: we run de Casteljau's algorithm in reverse!

    +

    + In order to run de Casteljau's algorithm in reverse, we need a few basic things: a start and end point, a point on the curve that we want + to be moving around, which has an associated t value, and a point we've not explicitly talked about before, and as far as I know + has no explicit name, but lives one iteration higher in the de Casteljau process then our on-curve point does. I like to call it "A" for + reasons that will become obvious. +

    +

    + So let's use graphics instead of text to see where this "A" is, because text only gets us so far: move the sliders for the following + graphics to see what, given a specific t value, our A coordinate is. As well as some other coordinates, which + taken together let us derive a value that the graphics call "ratio": if you move the curve's points around, A, B, and C will move, what + happens to that value? +

    +
    + + + Скрипти вимкнено. показує резервний. + + + + + + + + Скрипти вимкнено. показує резервний. + + + + + +
    -
    -
    -

    Curve/curve intersection

    -

    Using de Casteljau's algorithm to split the curve we can now implement curve/curve intersection finding using a "divide and conquer" technique:

    -
      -
    1. Take two curves C1 and C2, and treat them as a pair.
    2. -
    3. If their bounding boxes overlap, split up each curve into two sub-curves
    4. -
    5. With C1.1, C1.2, C2.1 and C2.2, form four new pairs (C1.1,C2.1), (C1.1, C2.2), (C1.2,C2.1), and (C1.2,C2.2).
    6. -
    7. For each pair, check whether their bounding boxes overlap.
        -
      1. If their bounding boxes do not overlap, discard the pair, as there is no intersection between this pair of curves.
      2. -
      3. If there is overlap, rerun all steps for this pair.
      4. -
      -
    8. -
    9. Once the sub-curves we form are so small that they effectively occupy sub-pixel areas, we consider an intersection found, noting that we might have a cluster of multiple intersections at the sub-pixel level, out of which we pick one to act as "found" t value (we can either throw all but one away, we can average the cluster's t values, or you can do something even more creative).
    10. -
    -

    This algorithm will start with a single pair, "balloon" until it runs in parallel for a large number of potential sub-pairs, and then taper back down as it homes in on intersection coordinates, ending up with as many pairs as there are intersections.

    -

    The following graphic applies this algorithm to a pair of cubic curves, one step at a time, so you can see the algorithm in action. Click the button to run a single step in the algorithm, after setting up your curves in some creative arrangement. You can also change the value that is used in step 5 to determine whether the curves are small enough. Manipulating the curves or changing the threshold will reset the algorithm, so you can try this with lots of different curves.

    -

    (can you find the configuration that yields the maximum number of intersections between two cubic curves? Nine intersections!)

    - - - Скрипти вимкнено. показує резервний. - - - - - - -

    Finding self-intersections is effectively the same procedure, except that we're starting with a single curve, so we need to turn that into two separate curves first. This is trivially achieved by splitting at an inflection point, or if there are none, just splitting at t=0.5 first, and then running the exact same algorithm as above, with all non-overlapping curve pairs getting removed at each iteration, and each successive step homing in on the curve's self-intersection points.

    - -
    -
    -

    The projection identity

    -

    De Casteljau's algorithm is the pivotal algorithm when it comes to Bézier curves. You can use it not just to split curves, but also to draw them efficiently (especially for high-order Bézier curves), as well as to come up with curves based on three points and a tangent. Particularly this last thing is really useful because it lets us "mold" a curve, by picking it up at some point, and dragging that point around to change the curve's shape.

    -

    How does that work? Succinctly: we run de Casteljau's algorithm in reverse!

    -

    In order to run de Casteljau's algorithm in reverse, we need a few basic things: a start and end point, a point on the curve that we want to be moving around, which has an associated t value, and a point we've not explicitly talked about before, and as far as I know has no explicit name, but lives one iteration higher in the de Casteljau process then our on-curve point does. I like to call it "A" for reasons that will become obvious.

    -

    So let's use graphics instead of text to see where this "A" is, because text only gets us so far: move the sliders for the following graphics to see what, given a specific t value, our A coordinate is. As well as some other coordinates, which taken together let us derive a value that the graphics call "ratio": if you move the curve's points around, A, B, and C will move, what happens to that value?

    -
    - - - - Скрипти вимкнено. показує резервний. - - - - - - - - Скрипти вимкнено. показує резервний. - - - - - -
    - -

    So these graphics show us several things:

    -
      -
    1. a point at the tip of the curve construction's "hat": let's call that A, as well as
    2. -
    3. our on-curve point give our chosen t value: let's call that B, and finally,
    4. -
    5. a point that we get by projecting A, through B, onto the line between the curve's start and end points: let's call that C.
    6. -
    7. for both quadratic and cubic curves, two points e1 and e2, which represent the single-to-last step in de Casteljau's algorithm: in the last step, we find B at (1-t) * e1 + t * e2.
    8. -
    9. for cubic curves, also the points v1 and v2, which together with A represent the first step in de Casteljau's algorithm: in the next step, we find e1 and e2.
    10. -
    -

    These three values A, B, and C allow us to derive an important identity formula for quadratic and cubic Bézier curves: for any point on the curve with some t value, the ratio of distances from A to B and B to C is fixed: if some t value sets up a C that is 20% away from the start and 80% away from the end, then it doesn't matter where the start, end, or control points are; for that t value, C will always lie at 20% from the start and 80% from the end point. Go ahead, pick an on-curve point in either graphic and then move all the other points around: if you only move the control points, start and end won't move, and so neither will C, and if you move either start or end point, C will move but its relative position will not change.

    -

    So, how can we compute C? We start with our observation that C always lies somewhere between the start and end points, so logically C will have a function that interpolates between those two coordinates:

    - - -

    If we can figure out what the function u(t) looks like, we'll be done. Although we do need to remember that this u(t) will have a different form depending on whether we're working with quadratic or cubic curves. Running through the maths (with thanks to Boris Zbarsky) shows us the following two formulae:

    - - -

    And

    - - -

    So, if we know the start and end coordinates and the t value, we know C without having to calculate the A or even B coordinates. In fact, we can do the same for the ratio function. As another function of t, we technically don't need to know what A or B or C are. It, too, can be expressed as a pure function of t.

    -

    We start by observing that, given A, B, and C, the following always holds:

    - - -

    Working out the maths for this, we see the following two formulae for quadratic and cubic curves:

    - - -

    And

    - - -

    Which now leaves us with some powerful tools: given three points (start, end, and "some point on the curve"), as well as a t value, we can construct curves. We can compute C using the start and end points and our u(t) function, and once we have C, we can use our on-curve point (B) and the ratio(t) function to find A:

    - - -

    With A found, finding e1 and e2 for quadratic curves is a matter of running the linear interpolation with t between start and A to yield e1, and between A and end to yield e2. For cubic curves, there is no single pair of points that can act as e1 and e2 (there are infinitely many, because the tangent at B is a free parameter for cubic curves) so as long as the distance ratio between e1 to B and B to e2 is the Bézier ratio (1-t):t, we are free to pick any pair, after which we can reverse engineer v1 and v2:

    - - -

    And then reverse engineer the curve's control points:

    - - -

    So: if we have a curve's start and end points, as well as some third point B that we want the curve to pass through, then for any t value we implicitly know all the ABC values, which (combined with an educated guess on appropriate e1 and e2 coordinates for cubic curves) gives us the necessary information to reconstruct a curve's "de Casteljau skeleton". Which means that we can now do several things: we can "fit" curves using only three points, which means we can also "mold" curves by moving an on-curve point but leaving its start and end points, and then reconstruct the curve based on where we moved the on-curve point to. These are very useful things, and we'll look at both in the next few sections.

    - -
    -
    -

    Creating a curve from three points

    -

    Given the preceding section, you might be wondering if we can use that knowledge to just "create" curves by placing some points and having the computer do the rest, to which the answer is: that's exactly what we can now do!

    -

    For quadratic curves, things are pretty easy. Technically, we'll need a t value in order to compute the ratio function used in computing the ABC coordinates, but we can just as easily approximate one by treating the distance between the start and B point, and B and end point as a ratio, using

    - - -

    With this code in place, creating a quadratic curve from three points is literally just computing the ABC values, and using A as our curve's control point:

    - - - Скрипти вимкнено. показує резервний. - - - -

    For cubic curves we need to do a little more work, but really only just a little. We're first going to assume that a decent curve through the three points should approximate a circular arc, which first requires knowing how to fit a circle to three points. You may remember (if you ever learned it!) that a line between two points on a circle is called a chord, and that one property of chords is that the line from the center of any chord, perpendicular to that chord, passes through the center of the circle.

    -

    That means that if we have three points on a circle, we have three (different) chords, and consequently, three (different) lines that go from those chords through the center of the circle: if we find two of those lines, then their intersection will be our circle's center, and the circle's radius will—by definition!—be the distance from the center to any of our three points:

    - - - Скрипти вимкнено. показує резервний. - - - -

    With that covered, we now also know the tangent line to our point B, because the tangent to any point on the circle is a line through that point, perpendicular to the line from that point to the center. That just leaves marking appropriate points e1 and e2 on that tangent, so that we can construct a new cubic curve hull. We use the approach as we did for quadratic curves to automatically determine a reasonable t value, and then our e1 and e2 coordinates must obey the standard de Casteljau rule for linear interpolation:

    - - -

    Where d is the total length of the line segment from e1 to e2. So how long do we make that? There are again all kinds of approaches we can take, and a simple-but-effective one is to set the length of that segment to "one third the length of the baseline". This forces e1 and e2 to always be the "linear curve" distance apart, which means if we place our three points on a line, it will actually look like a line. Nice! The last thing we'll need to do is make sure to flip the sign of d depending on which side of the baseline our B is located, so we don't end up creating a funky curve with a loop in it. To do this, we can use the atan2 function:

    - - -

    This angle φ will be between 0 and π if B is "above" the baseline (rotating all three points so that the start is on the left and the end is the right), so we can use a relatively straight forward check to make sure we're using the correct sign for our value d:

    - - -

    The result of this approach looks as follows:

    - - - Скрипти вимкнено. показує резервний. - - - -

    It is important to remember that even though we're using a circular arc to come up with decent e1 and e2 terms, we're not trying to perfectly create a circular arc with a cubic curve (which is good, because we can't; more on that later), we're only trying to come up with some reasonable e1 and e2 points so we can construct a new cubic curve... so now that we have those: let's see what kind of cubic curve that gives us:

    - - - Скрипти вимкнено. показує резервний. - - - -

    That looks perfectly serviceable!

    -

    Of course, we can take this one step further: we can't just "create" curves, we also have (almost!) all the tools available to "mold" curves, where we can reshape a curve by dragging a point on the curve around while leaving the start and end fixed, effectively molding the shape as if it were clay or the like. We'll see the last tool we need to do that in the next section, and then we'll look at implementing curve molding in the section after that, so read on!

    + +

    The result of this approach looks as follows:

    + + + Скрипти вимкнено. показує резервний. + + + +

    + It is important to remember that even though we're using a circular arc to come up with decent e1 and e2 terms, + we're not trying to perfectly create a circular arc with a cubic curve (which is good, because we can't; + more on that later), we're only trying to come up with some reasonable e1 and + e2 points so we can construct a new cubic curve... so now that we have those: let's see what kind of cubic curve that gives + us: +

    + + + Скрипти вимкнено. показує резервний. + + + +

    That looks perfectly serviceable!

    +

    + Of course, we can take this one step further: we can't just "create" curves, we also have (almost!) all the tools available to "mold" + curves, where we can reshape a curve by dragging a point on the curve around while leaving the start and end fixed, effectively molding + the shape as if it were clay or the like. We'll see the last tool we need to do that in the next section, and then we'll look at + implementing curve molding in the section after that, so read on! +

    +
    +
    +

    + + Projecting a point onto a Bézier curve +

    +

    + Before we can move on to actual curve molding, it'll be good if know how to actually be able to find "some point on the curve" that we're + trying to click on. After all, if all we have is our Bézier coordinates, that is not in itself enough to figure out which point on the + curve our cursor will be closest to. So, how do we project points onto a curve? +

    +

    + If the Bézier curve is of low enough order, we might be able to + work out the maths for how to do this, and get a perfect t value back, but in general this is an incredibly hard problem and the easiest solution is, really, a + numerical approach again. We'll be finding our ideal t value using a + binary search. First, we do a coarse distance-check based on + t values associated with the curve's "to draw" coordinates (using a lookup table, or LUT). This is pretty fast: +

    -
    -
    -

    Projecting a point onto a Bézier curve

    -

    Before we can move on to actual curve molding, it'll be good if know how to actually be able to find "some point on the curve" that we're trying to click on. After all, if all we have is our Bézier coordinates, that is not in itself enough to figure out which point on the curve our cursor will be closest to. So, how do we project points onto a curve?

    -

    If the Bézier curve is of low enough order, we might be able to work out the maths for how to do this, and get a perfect t value back, but in general this is an incredibly hard problem and the easiest solution is, really, a numerical approach again. We'll be finding our ideal t value using a binary search. First, we do a coarse distance-check based on t values associated with the curve's "to draw" coordinates (using a lookup table, or LUT). This is pretty fast:

    - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    - -

    After this runs, we know that LUT[i] is the coordinate on the curve in our LUT that is closest to the point we want to project, so that's a pretty good initial guess as to what the best projection onto our curve is. To refine it, we note that LUT[i] is a better guess than both LUT[i-1] and LUT[i+1], but there might be an even better projection somewhere else between those two values, so that's what we're going to be testing for, using a variation of the binary search.

    -
      -
    1. we start with our point p, and the t values t1=LUT[i-1].t and t2=LUT[i+1].t, which span an interval v = t2-t1.
    2. -
    3. we test this interval in five spots: the start, middle, and end (which we already have), and the two points in between the middle and start/end points
    4. -
    5. we then check which of these five points is the closest to our original point p, and then repeat step 1 with the points before and after the closest point we just found.
    6. -
    -

    This makes the interval we check smaller and smaller at each iteration, and we can keep running the three steps until the interval becomes so small as to lead to distances that are, for all intents and purposes, the same for all points.

    -

    So, let's see that in action: in this case, I'm going to arbitrarily say that if we're going to run the loop until the interval is smaller than 0.001, and show you what that means for projecting your mouse cursor or finger tip onto a rather complex Bézier curve (which, of course, you can reshape as you like). Also shown are the original three points that our coarse check finds.

    - - - Скрипти вимкнено. показує резервний. - - - + i = index + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + -
    -
    -

    Intersections with a circle

    -

    It might seem odd to cover this subject so much later than the line/line, line/curve, and curve/curve intersection topics from several sections earlier, but the reason we can't cover circle/curve intersections is that we can't really discuss circle/curve intersection until we've covered the kind of lookup table (LUT) walking that the section on projecting a point onto a curve uses. To see why, let's look at what we would have to do if we wanted to find the intersections between a curve and a circle using calculus.

    -

    First, we observe that "finding intersections" in this case means that, given a circle defined by a center point c = (x,y) and a radius r, we want to find all points on the Bezier curve for which the distance to the circle's center point is equal to the circle radius, which by definition means those points lie on the circle, and so count as intersections. In maths, that means we're trying to solve:

    - - -

    Which seems simple enough. Unfortunately, when we expand that dist function, things get a lot more problematic:

    - - -

    And now we have a problem because that's a sixth degree polynomial inside the square root. So, thanks to the Abel-Ruffini theorem that we saw before, we can't solve this by just going "square both sides because we don't care about signs"... we can't solve a sixth degree polynomial. So, we're going to have to actually evaluate that expression. We can "simplify" this by translating all our coordinates so that the center of the circle is (0,0) and all our coordinates are shifted accordingly, which makes the cx and cy terms fall away, but then we're still left with a monstrous function to solve.

    -

    So instead, we turn to the same kind of "LUT walking" that we saw for projecting points onto a curve, with a twist: instead of finding the on-curve point with the smallest distance to our projection point, we want to find the on-curve point that has the exact distance r to our projection point (namely, our circle center). Of course, there can be more than one such point, so there's also a bit more code to make sure we find all of them, but let's look at the steps involved:

    + +

    + And now we have a problem because that's a sixth degree polynomial inside the square root. So, thanks to the + Abel-Ruffini theorem that we saw before, we can't solve this by + just going "square both sides because we don't care about signs"... we can't solve a sixth degree polynomial. So, we're going to have to + actually evaluate that expression. We can "simplify" this by translating all our coordinates so that the center of the circle is (0,0) and + all our coordinates are shifted accordingly, which makes the cx and cy terms fall away, but then we're still left + with a monstrous function to solve. +

    +

    + So instead, we turn to the same kind of "LUT walking" that we saw for projecting points onto a curve, with a twist: instead of finding the + on-curve point with the smallest distance to our projection point, we want to find the on-curve point that has the exact distance + r to our projection point (namely, our circle center). Of course, there can be more than one such point, so there's also a + bit more code to make sure we find all of them, but let's look at the steps involved: +

    - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    - -

    This is very similar to the code in the previous section, with an extra input r for the circle radius, and a minor change in the "distance for this coordinate": rather than just distance(coordinate, p) we want to know the difference between that distance and the circle radius. After all, if that difference is zero, then the distance from the coordinate to the circle center is exactly the radius, so the coordinate lies on both the curve and the circle.

    -

    So far so good.

    -

    However, we also want to make sure we find all the points, not just a single one, so we need a little more code for that:

    + i = index + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    - -

    After running this code, values will be the list of all LUT coordinates that are closest to the distance r: we can use those values to run the same kind of refinement lookup we used for point projection (with the caveat that we're now not checking for smallest distance, but for "distance closest to r"), and we'll have all our intersection points. Of course, that does require explaining what findClosest does: rather than looking for a global minimum, we're now interested in finding a local minimum, so instead of checking a single point and looking at its distance value, we check three points ("current", "previous" and "before previous") and then check whether they form a local minimum:

    + start = i + 2; + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + - - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    - -

    In words: given a start index, the circle center and radius, and our LUT, we check where (closest to our start index) we can find a local minimum for the difference between "the distance from the curve to the circle center", and the circle's radius. We track this by looking at three values (associated with the indices index-2, index-1, and index), and we know we've found a local minimum if the three values show that the middle value (pd1) is less than either value beside it. When we do, we can set our "best guess, relative to start" as index-1. Of course, since we're now checking values relative to some start value, we might not find another candidate value at all, in which case we return start - 1, so that a simple "is the result less than start?" lets us determine that there are no more intersections to find.

    -

    Finally, while not necessary for point projection, there is one more step we need to perform when we run the binary refinement function on our candidate LUT indices, because we've so far only been testing using distances "closest to the radius of the circle", and that's actually not good enough... we need distances that are the radius of the circle. So, after running the refinement for each of these indices, we need to discard any final value that isn't the circle radius. And because we're working with floating point numbers, what this really means is that we need to discard any value that's a pixel or more "off". Or, if we want to get really fancy, "some small epsilon value".

    -

    Based on all of that, the following graphic shows this off for the standard cubic curve (which you can move the coordinates around for, of course) and a circle with a controllable radius centered on the graphic's center, using the code approach described above.

    - - - Скрипти вимкнено. показує резервний. - - - - - -

    And of course, for the full details, click that "view source" link.

    + return start + i + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + 18 + + + 19 + + -
    -
    -

    Molding a curve

    -

    Armed with knowledge of the "ABC" relation, point-on-curve projection, and guestimating reasonable looking helper values for cubic curve construction, we can finally cover curve molding: updating a curve's shape interactively, by dragging points on the curve around.

    -

    For quadratic curve, this is a really simple trick: we project our cursor onto the curve, which gives us a t value and initial B coordinate. We don't even need the latter: with our t value and "wherever the cursor is" as target B, we can compute the associated C:

    - - -

    And then the associated A:

    - - -

    And we're done, because that's our new quadratic control point!

    - - - Скрипти вимкнено. показує резервний. - - - -

    As before, cubic curves are a bit more work, because while it's easy to find our initial t value and ABC values, getting those all-important e1 and e2 coordinates is going to pose a bit of a problem... in the section on curve creation, we were free to pick an appropriate t value ourselves, which allowed us to find appropriate e1 and e2 coordinates. That's great, but when we're curve molding we don't have that luxury: whatever point we decide to start moving around already has its own t value, and its own e1 and e2 values, and those may not make sense for the rest of the curve.

    -

    For example, let's see what happens if we just "go with what we get" when we pick a point and start moving it around, preserving its t value and e1/e2 coordinates:

    - - - Скрипти вимкнено. показує резервний. - - - -

    That looks reasonable, close to the original point, but the further we drag our point, the less "useful" things become. Especially if we drag our point across the baseline, rather than turning into a nice curve.

    -

    One way to combat this might be to combine the above approach with the approach from the creating curves section: generate both the "unchanged t/e1/e2" curve, as well as the "idealized" curve through the start/cursor/end points, with idealized t value, and then interpolating between those two curves:

    - - - Скрипти вимкнено. показує резервний. - - - - - -

    The slide controls the "falloff distance" relative to where the original point on the curve is, so that as we drag our point around, it interpolates with a bias towards "preserving t/e1/e2" closer to the original point, and bias towards "idealized" form the further away we move our point, with anything that's further than our falloff distance simply being the idealized curve. We don't even try to interpolate at that point.

    -

    A more advanced way to try to smooth things out is to implement continuous molding, where we constantly update the curve as we move around, and constantly change what our B point is, based on constantly projecting the cursor on the curve as we're updating it - this is, you won't be surprised to learn, tricky, and beyond the scope of this section: interpolation (with a reasonable distance) will do for now!

    - -
    -
    -

    Curve fitting

    -

    Given the previous section, one question you might have is "what if I don't want to guess t values?". After all, plenty of graphics packages do automated curve fitting, so how can we implement that in a way that just finds us reasonable t values all on its own?

    -

    And really this is just a variation on the question "how do I get the curve through these X points?", so let's look at that. Specifically, let's look at the answer: "curve fitting". This is in fact a rather rich field in geometry, applying to anything from data modelling to path abstraction to "drawing", so there's a fair number of ways to do curve fitting, but we'll look at one of the most common approaches: something called a least squares polynomial regression. In this approach, we look at the number of points we have in our data set, roughly determine what would be an appropriate order for a curve that would fit these points, and then tackle the question "given that we want an nth order curve, what are the coordinates we can find such that our curve is "off" by the least amount?".

    -

    Now, there are many ways to determine how "off" points are from the curve, which is where that "least squares" term comes in. The most common tool in the toolbox is to minimise the squared distance between each point we have, and the corresponding point on the curve we end up "inventing". A curve with a snug fit will have zero distance between those two, and a bad fit will have non-zero distances between every such pair. It's a workable metric. You might wonder why we'd need to square, rather than just ensure that distance is a positive value (so that the total error is easy to compute by just summing distances) and the answer really is "because it tends to be a little better". There's lots of literature on the web if you want to deep-dive the specific merits of least squared error metrics versus least absolute error metrics, but those are well beyond the scope of this material.

    -

    So let's look at what we end up with in terms of curve fitting if we start with the idea of performing least squares Bézier fitting. We're going to follow a procedure similar to the one described by Jim Herold over on his "Least Squares Bézier Fit" article, and end with some nice interactive graphics for doing some curve fitting.

    -

    Before we begin, we're going to use the curve in matrix form. In the section on matrices, I mentioned that some things are easier if we use the matrix representation of a Bézier curve rather than its calculus form, and this is one of those things.

    -

    As such, the first step in the process is expressing our Bézier curve as powers/coefficients/coordinate matrix T x M x C, by expanding the Bézier functions.

    -
    - -

    Revisiting the matrix representation

    -

    Rewriting Bézier functions to matrix form is fairly easy, if you first expand the function, and then arrange them into a multiple line form, where each line corresponds to a power of t, and each column is for a specific coefficient. First, we expand the function:

    - - -

    And then we (trivially) rearrange the terms across multiple lines:

    - - -

    This rearrangement has "factors of t" at each row (the first row is t⁰, i.e. "1", the second row is t¹, i.e. "t", the third row is t²) and "coefficient" at each column (the first column is all terms involving "a", the second all terms involving "b", the third all terms involving "c").

    -

    With that arrangement, we can easily decompose this as a matrix multiplication:

    - - -

    We can do the same for the cubic curve, of course. We know the base function for cubics:

    - - -

    So we write out the expansion and rearrange:

    - - -

    Which we can then decompose:

    - - -

    And, of course, we can do this for quartic curves too (skipping the expansion step):

    - - -

    And so and on so on. Now, let's see how to use these T, M, and C, to do some curve fitting.

    -
    + +

    + And so and on so on. Now, let's see how to use these T, M, and C, to do some curve + fitting. +

    + -

    Let's get started: we're going to assume we picked the right order curve: for n points we're fitting an n-1th order curve, so we "start" with a vector P that represents the coordinates we already know, and for which we want to do curve fitting:

    - - -

    Next, we need to figure out appropriate t values for each point in the curve, because we need something that lets us tie "the actual coordinate" to "some point on the curve". There's a fair number of different ways to do this (and a large part of optimizing "the perfect fit" is about picking appropriate t values), but in this case let's look at two "obvious" choices:

    -
      -
    1. equally spaced t values, and
    2. -
    3. t values that align with distance along the polygon.
    4. -
    -

    The first one is really simple: if we have n points, then we'll just assign each point i a t value of (i-1)/(n-1). So if we have four points, the first point will have t=(1-1)/(4-1)=0/3, the second point will have t=(2-1)/(4-1)=1/3, the third point will have t=2/3, and the last point will be t=1. We're just straight up spacing the t values to match the number of points we have.

    -

    The second one is a little more interesting: since we're doing polynomial regression, we might as well exploit the fact that our base coordinates just constitute a collection of line segments. At the first point, we're fixing t=0, and the last point, we want t=1, and anywhere in between we're simply going to say that t is equal to the distance along the polygon, scaled to the [0,1] domain.

    -

    To get these values, we first compute the general "distance along the polygon" matrix:

    - - -

    Where length() is literally just that: the length of the line segment between the point we're looking at, and the previous point. This isn't quite enough, of course: we still need to make sure that all the values between i=1 and i=n fall in the [0,1] interval, so we need to scale all values down by whatever the total length of the polygon is:

    - - -

    And now we can move on to the actual "curve fitting" part: what we want is a function that lets us compute "ideal" control point values such that if we build a Bézier curve with them, that curve passes through all our original points. Or, failing that, have an overall error distance that is as close to zero as we can get it. So, let's write out what the error distance looks like.

    -

    As mentioned before, this function is really just "the distance between the actual coordinate, and the coordinate that the curve evaluates to for the associated t value", which we'll square to get rid of any pesky negative signs:

    - - -

    Since this function only deals with individual coordinates, we'll need to sum over all coordinates in order to get the full error function. So, we literally just do that; the total error function is simply the sum of all these individual errors:

    - - -

    And here's the trick that justifies using matrices: while we can work with individual values using calculus, with matrices we can compute as many values as we make our matrices big, all at the "same time", We can replace the individual terms pi with the full P coordinate matrix, and we can replace Bézier(si) with the matrix representation T x M x C we talked about before, which gives us:

    - - -

    In which we can replace the rather cumbersome "squaring" operation with a more conventional matrix equivalent:

    - - -

    Here, the letter T is used instead of the number 2, to represent the matrix transpose; each row in the original matrix becomes a column in the transposed matrix instead (row one becomes column one, row two becomes column two, and so on).

    -

    This leaves one problem: T isn't actually the matrix we want: we don't want symbolic t values, we want the actual numerical values that we computed for S, so we need to form a new matrix, which we'll call 𝕋, that makes use of those, and then use that 𝕋 instead of T in our error function:

    - - -

    Which, because of the first and last values in S, means:

    - - -

    Now we can properly write out the error function as matrix operations:

    - - -

    So, we have our error function: we now need to figure out the expression for where that function has minimal value, e.g. where the error between the true coordinates and the coordinates generated by the curve fitting is smallest. Like in standard calculus, this requires taking the derivative, and determining where that derivative is zero:

    - - -
    + +
    +

    Where did this derivative come from?

    +

    + That... is a good question. In fact, when trying to run through this approach, I ran into the same question! And you know what? I + straight up had no idea. I'm decent enough at calculus, I'm decent enough at linear algebra, and I just don't know. +

    +

    + So I did what I always do when I don't understand something: I asked someone to help me understand how things work. In this specific + case, I + posted a question + to Math.stackexchange, and received a answer that goes into way more detail than I had + hoped to receive. +

    +

    + Is that answer useful to you? Probably: no. At least, not unless you like understanding maths on a recreational level. And I do mean + maths in general, not just basic algebra. But it does help in giving us a reference in case you ever wonder "Hang on. Why was that + true?". There are answers. They might just require some time to come to understand. +

    +
    -

    Where did this derivative come from?

    -

    That... is a good question. In fact, when trying to run through this approach, I ran into the same question! And you know what? I straight up had no idea. I'm decent enough at calculus, I'm decent enough at linear algebra, and I just don't know.

    -

    So I did what I always do when I don't understand something: I asked someone to help me understand how things work. In this specific case, I posted a question to Math.stackexchange, and received a answer that goes into way more detail than I had hoped to receive.

    -

    Is that answer useful to you? Probably: no. At least, not unless you like understanding maths on a recreational level. And I do mean maths in general, not just basic algebra. But it does help in giving us a reference in case you ever wonder "Hang on. Why was that true?". There are answers. They might just require some time to come to understand.

    -
    - -

    Now, given the above derivative, we can rearrange the terms (following the rules of matrix algebra) so that we end up with an expression for C:

    - - -

    Here, the "to the power negative one" is the notation for the matrix inverse. But that's all we have to do: we're done. Starting with P and inventing some t values based on the polygon the coordinates in P define, we can compute the corresponding Bézier coordinates C that specify a curve that goes through our points. Or, if it can't go through them exactly, as near as possible.

    -

    So before we try that out, how much code is involved in implementing this? Honestly, that answer depends on how much you're going to be writing yourself. If you already have a matrix maths library available, then really not that much code at all. On the other hand, if you are writing this from scratch, you're going to have to write some utility functions for doing your matrix work for you, so it's really anywhere from 50 lines of code to maybe 200 lines of code. Not a bad price to pay for being able to fit curves to pre-specified coordinates.

    -

    So let's try it out! The following graphic lets you place points, and will start computing exact-fit curves once you've placed at least three. You can click for more points, and the code will simply try to compute an exact fit using a Bézier curve of the appropriate order. Four points? Cubic Bézier. Five points? Quartic. And so on. Of course, this does break down at some point: depending on where you place your points, it might become mighty hard for the fitter to find an exact fit, and things might actually start looking horribly off once there's enough points for compound floating point rounding errors to start making a difference (which is around 10~11 points).

    - - - Скрипти вимкнено. показує резервний. - - - - - - -

    You'll note there is a convenient "toggle" buttons that lets you toggle between equidistant t values, and distance ratio along the polygon formed by the points. Arguably more interesting is that once you have points to abstract a curve, you also get direct control over the time values through sliders for each, because if the time values are our degree of freedom, you should be able to freely manipulate them and see what the effect on your curve is.

    - -
    -
    -

    Bézier curves and Catmull-Rom curves

    -

    Taking an excursion to different splines, the other common design curve is the Catmull-Rom spline, which unlike Bézier curves pass through each control point, so they offer a kind of "built-in" curve fitting.

    -

    In fact, let's start with just playing with one: the following graphic has a predefined curve that you manipulate the points for, and lets you add points by clicking/tapping the background, as well as let you control "how fast" the curve passes through its point using the tension slider. The tenser the curve, the more the curve tends towards straight lines from one point to the next.

    - - - Скрипти вимкнено. показує резервний. - - - - - -

    Now, it may look like Catmull-Rom curves are very different from Bézier curves, because these curves can get very long indeed, but what looks like a single Catmull-Rom curve is actually a spline: a single curve built up of lots of identically-computed pieces, similar to if you just took a whole bunch of Bézier curves, placed them end to end, and lined up their control points so that things look like a single curve. For a Catmull-Rom curve, each "piece" between two points is defined by the point's coordinates, and the tangent for those points, the latter of which can trivially be derived from knowing the previous and next point:

    - + +

    + You'll note there is a convenient "toggle" buttons that lets you toggle between equidistant t values, and distance ratio + along the polygon formed by the points. Arguably more interesting is that once you have points to abstract a curve, you also get + direct control over the time values through sliders for each, because if the time values are our degree of freedom, you should be + able to freely manipulate them and see what the effect on your curve is. +

    +
    +
    +

    + + Bézier curves and Catmull-Rom curves +

    +

    + Taking an excursion to different splines, the other common design curve is the + Catmull-Rom spline, which unlike Bézier curves + pass through each control point, so they offer a kind of "built-in" curve fitting. +

    +

    + In fact, let's start with just playing with one: the following graphic has a predefined curve that you manipulate the points for, and lets + you add points by clicking/tapping the background, as well as let you control "how fast" the curve passes through its point using the + tension slider. The tenser the curve, the more the curve tends towards straight lines from one point to the next. +

    + + + Скрипти вимкнено. показує резервний. + + + + + +

    + Now, it may look like Catmull-Rom curves are very different from Bézier curves, because these curves can get very long indeed, but what + looks like a single Catmull-Rom curve is actually a spline: a single + curve built up of lots of identically-computed pieces, similar to if you just took a whole bunch of Bézier curves, placed them end to end, + and lined up their control points so that things look like a single curve. For a Catmull-Rom curve, each "piece" between two points is + defined by the point's coordinates, and the tangent for those points, the latter of which + can trivially be derived from knowing the + previous and next point: +

    + - -

    One downside of this is that—as you may have noticed from the graphic—the first and last point of the overall curve don't actually join up with the rest of the curve: they don't have a previous/next point respectively, and so there is no way to calculate what their tangent should be. Which also makes it rather tricky to fit a Catmull-Rom curve to three points like we were able to do for Bézier curves. More on that in the next section.

    -

    In fact, before we move on, let's look at how to actually draw the basic form of these curves (I say basic, because there are a number of variations that make things considerable more complex):

    + +

    + One downside of this is that—as you may have noticed from the graphic—the first and last point of the overall curve don't actually join up + with the rest of the curve: they don't have a previous/next point respectively, and so there is no way to calculate what their tangent + should be. Which also makes it rather tricky to fit a Catmull-Rom curve to three points like we were able to do for Bézier curves. More on + that in the next section. +

    +

    + In fact, before we move on, let's look at how to actually draw the basic form of these curves (I say basic, because there are a number of + variations that make things + considerable more + complex): +

    - - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    - -

    Now, since a Catmull-Rom curve is a form of cubic Hermite spline, and as cubic Bézier curves are also a form of cubic Hermite spline, we run into an interesting bit of maths programming: we can convert one to the other and back, and the maths for doing so is surprisingly simple!

    -

    The main difference between Catmull-Rom curves and Bézier curves is "what the points mean":

    -
      -
    • A cubic Bézier curve is defined by a start point, a control point that implies the tangent at the start, a control point that implies the tangent at the end, and an end point, plus a characterizing matrix that we can multiply by that point vector to get on-curve coordinates.
    • -
    • A Catmull-Rom curve is defined by a start point, a tangent that for that starting point, an end point, and a tangent for that end point, plus a characteristic matrix that we can multiple by the point vector to get on-curve coordinates.
    • -
    -

    Those are very similar, so let's see exactly how similar they are. We've already see the matrix form for Bézier curves, so how different is the matrix form for Catmull-Rom curves?:

    - - -

    That's pretty dang similar. So the question is: how can we convert that expression with Catmull-Rom matrix and vector into an expression of the Bézier matrix and vector? The short answer is of course "by using linear algebra", but the longer answer is the rest of this section, and involves some maths that you may not even care for: if you just want to know the (incredibly simple) conversions between the two curve forms, feel free to skip to the end of the following explanation, but if you want to how we can get one from the other... let's get mathing!

    -
    - -

    Deriving the conversion formulae

    -

    In order to convert between Catmull-Rom curves and Bézier curves, we need to know two things. Firstly, how to express the Catmull-Rom curve using a "set of four coordinates", rather than a mix of coordinates and tangents, and secondly, how to convert those Catmull-Rom coordinates to and from Bézier form.

    -

    We start with the first part, to figure out how we can go from Catmull-Rom V coordinates to Bézier P coordinates, by applying "some matrix T". We don't know what that T is yet, but we'll get to that:

    - - -

    So, this mapping says that in order to map a Catmull-Rom "point + tangent" vector to something based on an "all coordinates" vector, we need to determine the mapping matrix such that applying T yields P2 as start point, P3 as end point, and two tangents based on the lines between P1 and P3, and P2 nd P4, respectively.

    -

    Computing T is really more "arranging the numbers":

    - - -

    Thus:

    - - -

    However, we're not quite done, because Catmull-Rom curves have that "tension" parameter, written as τ (a lowercase"tau"), which is a scaling factor for the tangent vectors: the bigger the tension, the smaller the tangents, and the smaller the tension, the bigger the tangents. As such, the tension factor goes in the denominator for the tangents, and before we continue, let's add that tension factor into both our coordinate vector representation, and mapping matrix T:

    - - -

    With the mapping matrix properly done, let's rewrite the "point + tangent" Catmull-Rom matrix form to a matrix form in terms of four coordinates, and see what we end up with:

    - - -

    Replace point/tangent vector with the expression for all-coordinates:

    - - -

    and merge the matrices:

    - - -

    This looks a lot like the Bézier matrix form, which as we saw in the chapter on Bézier curves, should look like this:

    - - -

    So, if we want to express a Catmull-Rom curve using a Bézier curve, we'll need to turn this Catmull-Rom bit:

    - - -

    Into something that looks like this:

    - - -

    And the way we do that is with a fairly straight forward bit of matrix rewriting. We start with the equality we need to ensure:

    - - -

    Then we remove the coordinate vector from both sides without affecting the equality:

    - - -

    Then we can "get rid of" the Bézier matrix on the right by left-multiply both with the inverse of the Bézier matrix:

    - - -

    A matrix times its inverse is the matrix equivalent of 1, and because "something times 1" is the same as "something", so we can just outright remove any matrix/inverse pair:

    - - -

    And now we're basically done. We just multiply those two matrices and we know what V is:

    - - -

    We now have the final piece of our function puzzle. Let's run through each step.

    -
      -
    1. Start with the Catmull-Rom function:
    2. -
    - - -
      -
    1. rewrite to pure coordinate form:
    2. -
    - - -
      -
    1. rewrite for "normal" coordinate vector:
    2. -
    - - -
      -
    1. merge the inner matrices:
    2. -
    - - -
      -
    1. rewrite for Bézier matrix form:
    2. -
    - - -
      -
    1. and transform the coordinates so we have a "pure" Bézier expression:
    2. -
    - - -

    And we're done: we finally know how to convert these two curves!

    -
    + +

    And we're done: we finally know how to convert these two curves!

    + -

    If we have a Catmull-Rom curve defined by four coordinates P1 through P4, then we can draw that curve using a Bézier curve that has the vector:

    - - -

    Similarly, if we have a Bézier curve defined by four coordinates P1 through P4, we can draw that using a standard tension Catmull-Rom curve with the following coordinate values:

    - - -

    Or, if your API allows you to specify Catmull-Rom curves using plain coordinates:

    - - - -
    -
    -

    Creating a Catmull-Rom curve from three points

    -

    Much shorter than the previous section: we saw that Catmull-Rom curves need at least 4 points to draw anything sensible, so how do we create a Catmull-Rom curve from three points?

    -

    Short and sweet: we don't.

    -

    We run through the maths that lets us create a cubic Bézier curve, and then convert its coordinates to Catmull-Rom form using the conversion formulae we saw above.

    - -
    -
    -

    Forming poly-Bézier curves

    -

    Much like lines can be chained together to form polygons, Bézier curves can be chained together to form poly-Béziers, and the only trick required is to make sure that:

    -
      -
    1. the end point of each section is the starting point of the following section, and
    2. -
    3. the derivatives across that dual point line up.
    4. -
    -

    Unless you want sharp corners, of course. Then you don't even need 2.

    -

    We'll cover three forms of poly-Bézier curves in this section. First, we'll look at the kind that just follows point 1. where the end point of a segment is the same point as the start point of the next segment. This leads to poly-Béziers that are pretty hard to work with, but they're the easiest to implement:

    - - - Скрипти вимкнено. показує резервний. - - - - - - Скрипти вимкнено. показує резервний. - - - -

    Dragging the control points around only affects the curve segments that the control point belongs to, and moving an on-curve point leaves the control points where they are, which is not the most useful for practical modelling purposes. So, let's add in the logic we need to make things a little better. We'll start by linking up control points by ensuring that the "incoming" derivative at an on-curve point is the same as it's "outgoing" derivative:

    - - -

    We can effect this quite easily, because we know that the vector from a curve's last control point to its last on-curve point is equal to the derivative vector. If we want to ensure that the first control point of the next curve matches that, all we have to do is mirror that last control point through the last on-curve point. And mirroring any point A through any point B is really simple:

    - - -

    So let's implement that and see what it gets us. The following two graphics show a quadratic and a cubic poly-Bézier curve again, but this time moving the control points around moves others, too. However, you might see something unexpected going on for quadratic curves...

    - - - Скрипти вимкнено. показує резервний. - - - - - - Скрипти вимкнено. показує резервний. - - - -

    As you can see, quadratic curves are particularly ill-suited for poly-Bézier curves, as all the control points are effectively linked. Move one of them, and you move all of them. Not only that, but if we move the on-curve points, it's possible to get a situation where a control point cannot satisfy the constraint that it's the reflection of its two neighbouring control points... This means that we cannot use quadratic poly-Béziers for anything other than really, really simple shapes. And even then, they're probably the wrong choice. Cubic curves are pretty decent, but the fact that the derivatives are linked means we can't manipulate curves as well as we might if we relaxed the constraints a little.

    -

    So: let's relax the requirement a little.

    -

    We can change the constraint so that we still preserve the angle of the derivatives across sections (so transitions from one section to the next will still look natural), but give up the requirement that they should also have the same vector length. Doing so will give us a much more useful kind of poly-Bézier curve:

    - - - Скрипти вимкнено. показує резервний. - - - - - - Скрипти вимкнено. показує резервний. - - - -

    Cubic curves are now better behaved when it comes to dragging control points around, but the quadratic poly-Bézier still has the problem that moving one control points will move the control points and may ending up defining "the next" control point in a way that doesn't work. Quadratic curves really aren't very useful to work with...

    -

    Finally, we also want to make sure that moving the on-curve coordinates preserves the relative positions of the associated control points. With that, we get to the kind of curve control that you might be familiar with from applications like Photoshop, Inkscape, Blender, etc.

    - - - Скрипти вимкнено. показує резервний. - - - - - - Скрипти вимкнено. показує резервний. - - - -

    Again, we see that cubic curves are now rather nice to work with, but quadratic curves have a new, very serious problem: we can move an on-curve point in such a way that we can't compute what needs to "happen next". Move the top point down, below the left and right points, for instance. There is no way to preserve correct control points without a kink at the bottom point. Quadratic curves: just not that good...

    -

    A final improvement is to offer fine-level control over which points behave which, so that you can have "kinks" or individually controlled segments when you need them, with nicely well-behaved curves for the rest of the path. Implementing that, is left as an exercise for the reader.

    - -
    -
    -

    Curve offsetting

    -

    Perhaps you're like me, and you've been writing various small programs that use Bézier curves in some way or another, and at some point you make the step to implementing path extrusion. But you don't want to do it pixel based; you want to stay in the vector world. You find that extruding lines is relatively easy, and tracing outlines is coming along nicely (although junction caps and fillets are a bit of a hassle), and then you decide to do things properly and add Bézier curves to the mix. Now you have a problem.

    -

    Unlike lines, you can't simply extrude a Bézier curve by taking a copy and moving it around, because of the curvatures; rather than a uniform thickness, you get an extrusion that looks too thin in places, if you're lucky, but more likely will self-intersect. The trick, then, is to scale the curve, rather than simply copying it. But how do you scale a Bézier curve?

    -

    Bottom line: you can't. So you cheat. We're not going to do true curve scaling, or rather curve offsetting, because that's impossible. Instead we're going to try to generate 'looks good enough' offset curves.

    -
    - -

    "What do you mean, you can't? Prove it."

    -

    First off, when I say "you can't," what I really mean is "you can't offset a Bézier curve with another Bézier curve", not even by using a really high order curve. You can find the function that describes the offset curve, but it won't be a polynomial, and as such it cannot be represented as a Bézier curve, which has to be a polynomial. Let's look at why this is:

    -

    From a mathematical point of view, an offset curve O(t) is a curve such that, given our original curve B(t), any point on O(t) is a fixed distance d away from coordinate B(t). So let's math that:

    - - -

    However, we're working in 2D, and d is a single value, so we want to turn it into a vector. If we want a point distance d "away" from the curve B(t) then what we really mean is that we want a point at d times the "normal vector" from point B(t), where the "normal" is a vector that runs perpendicular ("at a right angle") to the tangent at B(t). Easy enough:

    - - -

    Now this still isn't very useful unless we know what the formula for N(t) is, so let's find out. N(t) runs perpendicular to the original curve tangent, and we know that the tangent is simply B'(t), so we could just rotate that 90 degrees and be done with it. However, we need to ensure that N(t) has the same magnitude for every t, or the offset curve won't be at a uniform distance, thus not being an offset curve at all. The easiest way to guarantee this is to make sure N(t) always has length 1, which we can achieve by dividing B'(t) by its magnitude:

    - - -

    Determining the length requires computing an arc length, and this is where things get Tricky with a capital T. First off, to compute arc length from some start a to end b, we must use the formula we saw earlier. Noting that "length" is usually denoted with double vertical bars:

    - - -

    So if we want the length of the tangent, we plug in B'(t), with t = 0 as start and -t = 1 as end:

    - - -

    And that's where things go wrong. It doesn't even really matter what the second derivative for B(t) is, that square root is screwing everything up, because it turns our nice polynomials into things that are no longer polynomials.

    -

    There is a small class of polynomials where the square root is also a polynomial, but they're utterly useless to us: any polynomial with unweighted binomial coefficients has a square root that is also a polynomial. Now, you might think that Bézier curves are just fine because they do, but they don't; remember that only the base function has binomial coefficients. That's before we factor in our coordinates, which turn it into a non-binomial polygon. The only way to make sure the functions stay binomial is to make all our coordinates have the same value. And that's not a curve, that's a point. We can already create offset curves for points, we call them circles, and they have much simpler functions than Bézier curves.

    -

    So, since the tangent length isn't a polynomial, the normalised tangent won't be a polynomial either, which means N(t) won't be a polynomial, which means that d times N(t) won't be a polynomial, which means that, ultimately, O(t) won't be a polynomial, which means that even if we can determine the function for O(t) just fine (and that's far from trivial!), it simply cannot be represented as a Bézier curve.

    -

    And that's one reason why Bézier curves are tricky: there are actually a lot of curves that cannot be represented as a Bézier curve at all. They can't even model their own offset curves. They're weird that way. So how do all those other programs do it? Well, much like we're about to do, they cheat. We're going to approximate an offset curve in a way that will look relatively close to what the real offset curve would look like, if we could compute it.

    -
    + +

    + And that's where things go wrong. It doesn't even really matter what the second derivative for B(t) is, that square root is + screwing everything up, because it turns our nice polynomials into things that are no longer polynomials. +

    +

    + There is a small class of polynomials where the square root is also a polynomial, but they're utterly useless to us: any polynomial with + unweighted binomial coefficients has a square root that is also a polynomial. Now, you might think that Bézier curves are just fine + because they do, but they don't; remember that only the base function has binomial coefficients. That's before we + factor in our coordinates, which turn it into a non-binomial polygon. The only way to make sure the functions stay binomial is to make + all our coordinates have the same value. And that's not a curve, that's a point. We can already create offset curves for points, we call + them circles, and they have much simpler functions than Bézier curves. +

    +

    + So, since the tangent length isn't a polynomial, the normalised tangent won't be a polynomial either, which means + N(t) won't be a polynomial, which means that d times N(t) won't be a polynomial, which means + that, ultimately, O(t) won't be a polynomial, which means that even if we can determine the function for + O(t) just fine (and that's far from trivial!), it simply cannot be represented as a Bézier curve. +

    +

    + And that's one reason why Bézier curves are tricky: there are actually a lot of curves that cannot be represented as a Bézier + curve at all. They can't even model their own offset curves. They're weird that way. So how do all those other programs do it? Well, + much like we're about to do, they cheat. We're going to approximate an offset curve in a way that will look relatively close to what the + real offset curve would look like, if we could compute it. +

    + -

    So, you cannot offset a Bézier curve perfectly with another Bézier curve, no matter how high-order you make that other Bézier curve. However, we can chop up a curve into "safe" sub-curves (where "safe" means that all the control points are always on a single side of the baseline, and the midpoint of the curve at t=0.5 is roughly in the center of the polygon defined by the curve coordinates) and then point-scale each sub-curve with respect to its scaling origin (which is the intersection of the point normals at the start and end points).

    -

    A good way to do this reduction is to first find the curve's extreme points, as explained in the earlier section on curve extremities, and use these as initial splitting points. After this initial split, we can check each individual segment to see if it's "safe enough" based on where the center of the curve is. If the on-curve point for t=0.5 is too far off from the center, we simply split the segment down the middle. Generally this is more than enough to end up with safe segments.

    -

    The following graphics show off curve offsetting, and you can use the slider to control the distance at which the curve gets offset. The curve first gets reduced to safe segments, each of which is then offset at the desired distance. Especially for simple curves, particularly easily set up for quadratic curves, no reduction is necessary, but the more twisty the curve gets, the more the curve needs to be reduced in order to get segments that can safely be scaled.

    - - - Скрипти вимкнено. показує резервний. - - - - - - - - Скрипти вимкнено. показує резервний. - - - - - -

    You may notice that this may still lead to small 'jumps' in the sub-curves when moving the curve around. This is caused by the fact that we're still performing a naive form of offsetting, moving the control points the same distance as the start and end points. If the curve is large enough, this may still lead to incorrect offsets.

    - -
    -
    -

    Graduated curve offsetting

    -

    What if we want to do graduated offsetting, starting at some distance s but ending at some other distance e? Well, if we can compute the length of a curve (which we can if we use the Legendre-Gauss quadrature approach) then we can also determine how far "along the line" any point on the curve is. With that knowledge, we can offset a curve so that its offset curve is not uniformly wide, but graduated between with two different offset widths at the start and end.

    -

    Like normal offsetting we cut up our curve in sub-curves, and then check at which distance along the original curve each sub-curve starts and ends, as well as to which point on the curve each of the control points map. This gives us the distance-along-the-curve for each interesting point in the sub-curve. If we call the total length of all sub-curves seen prior to seeing "the current" sub-curve S (and if the current sub-curve is the first one, S is zero), and we call the full length of our original curve L, then we get the following graduation values:

    -
      -
    • start: map S from interval (0,L) to interval (s,e)
    • -
    • c1: map(<strong>S+d1</strong>, 0,L, s,e), d1 = distance along curve to projection of c1
    • -
    • c2: map(<strong>S+d2</strong>, 0,L, s,e), d2 = distance along curve to projection of c2
    • -
    • ...
    • -
    • end: map(<strong>S+length(subcurve)</strong>, 0,L, s,e)
    • -
    -

    At each of the relevant points (start, end, and the projections of the control points onto the curve) we know the curve's normal, so offsetting is simply a matter of taking our original point, and moving it along the normal vector by the offset distance for each point. Doing so will give us the following result (these have with a starting width of 0, and an end width of 40 pixels, but can be controlled with your up and down arrow keys):

    - - - Скрипти вимкнено. показує резервний. - - - - - - - - Скрипти вимкнено. показує резервний. - - - - - - -
    -
    -

    Circles and quadratic Bézier curves

    -

    Circles and Bézier curves are very different beasts, and circles are infinitely easier to work with than Bézier curves. Their formula is much simpler, and they can be drawn more efficiently. But, sometimes you don't have the luxury of using circles, or ellipses, or arcs. Sometimes, all you have are Bézier curves. For instance, if you're doing font design, fonts have no concept of geometric shapes, they only know straight lines, and Bézier curves. OpenType fonts with TrueType outlines only know quadratic Bézier curves, and OpenType fonts with Type 2 outlines only know cubic Bézier curves. So how do you draw a circle, or an ellipse, or an arc?

    -

    You approximate.

    -

    We already know that Bézier curves cannot model all curves that we can think of, and this includes perfect circles, as well as ellipses, and their arc counterparts. However, we can certainly approximate them to a degree that is visually acceptable. Quadratic and cubic curves offer us different curvature control, so in order to approximate a circle we will first need to figure out what the error is if we try to approximate arcs of increasing degree with quadratic and cubic curves, and where the coordinates even lie.

    -

    Since arcs are mid-point-symmetrical, we need the control points to set up a symmetrical curve. For quadratic curves this means that the control point will be somewhere on a line that intersects the baseline at a right angle. And we don't get any choice on where that will be, since the derivatives at the start and end point have to line up, so our control point will lie at the intersection of the tangents at the start and end point.

    -

    First, let's try to fit the quadratic curve onto a circular arc. In the following sketch you can move the mouse around over a unit circle, to see how well, or poorly, a quadratic curve can approximate the arc from (1,0) to where your mouse cursor is:

    - - - Скрипти вимкнено. показує резервний. - - - - - -

    As you can see, things go horribly wrong quite quickly; even trying to approximate a quarter circle using a quadratic curve is a bad idea. An eighth of a turns might look okay, but how okay is okay? Let's apply some maths and find out. What we're interested in is how far off our on-curve coordinates are with respect to a circular arc, given a specific start and end angle. We'll be looking at how much space there is between the circular arc, and the quadratic curve's midpoint.

    -

    We start out with our start and end point, and for convenience we will place them on a unit circle (a circle around 0,0 with radius 1), at some angle φ:

    - - -

    What we want to find is the intersection of the tangents, so we want a point C such that:

    - - -

    i.e. we want a point that lies on the vertical line through S (at some distance a from S) and also lies on the tangent line through E (at some distance b from E). Solving this gives us:

    - - -

    First we solve for b:

    - - -

    which yields:

    - - -

    which we can then substitute in the expression for a:

    - - -

    A quick check shows that plugging these values for a and b into the expressions for Cx and Cy give the same x/y coordinates for both "a away from A" and "b away from B", so let's continue: now that we know the coordinate values for C, we know where our on-curve point T for t=0.5 (or angle φ/2) is, because we can just evaluate the Bézier polynomial, and we know where the circle arc's actual point P is for angle φ/2:

    - - -

    We compute T, observing that if t=0.5, the polynomial values (1-t)², 2(1-t)t, and t² are 0.25, 0.5, and 0.25 respectively:

    - - -

    Which, worked out for the x and y components, gives:

    - - -

    And the distance between these two is the standard Euclidean distance:

    - - -

    So, what does this distance function look like when we plot it for a number of ranges for the angle φ, such as a half circle, quarter circle and eighth circle?

    -
    - - plotted for 0 ≤ φ ≤ π: - - - plotted for 0 ≤ φ ≤ ½π: - - - - - plotted for 0 ≤ φ ≤ ¼π: -
    + +

    + So, what does this distance function look like when we plot it for a number of ranges for the angle φ, such as a half circle, quarter + circle and eighth circle? +

    + + + + + + + + +
    + + plotted for 0 ≤ φ ≤ π: + + + plotted for 0 ≤ φ ≤ ½π: + + + + + plotted for 0 ≤ φ ≤ ¼π: +
    -

    We now see why the eighth circle arc looks decent, but the quarter circle arc doesn't: an error of roughly 0.06 at t=0.5 means we're 6% off the mark... we will already be off by one pixel on a circle with pixel radius 17. Any decent sized quarter circle arc, say with radius 100px, will be way off if approximated by a quadratic curve! For the eighth circle arc, however, the error is only roughly 0.003, or 0.3%, which explains why it looks so close to the actual eighth circle arc. In fact, if we want a truly tiny error, like 0.001, we'll have to contend with an angle of (rounded) 0.593667, which equates to roughly 34 degrees. We'd need 11 quadratic curves to form a full circle with that precision! (technically, 10 and ten seventeenth, but we can't do partial curves, so we have to round up). That's a whole lot of curves just to get a shape that can be drawn using a simple function!

    -

    In fact, let's flip the function around, so that if we plug in the precision error, labelled ε, we get back the maximum angle for that precision:

    - - -

    And frankly, things are starting to look a bit ridiculous at this point, we're doing way more maths than we've ever done, but thankfully this is as far as we need the maths to take us: If we plug in the precisions 0.1, 0.01, 0.001 and 0.0001 we get the radians values 1.748, 1.038, 0.594 and 0.3356; in degrees, that means we can cover roughly 100 degrees (requiring four curves), 59.5 degrees (requiring six curves), 34 degrees (requiring 11 curves), and 19.2 degrees (requiring a whopping nineteen curves).

    -

    The bottom line? Quadratic curves are kind of lousy if you want circular (or elliptical, which are circles that have been squashed in one dimension) curves. We can do better, even if it's just by raising the order of our curve once. So let's try the same thing for cubic curves.

    - -
    -
    -

    Circular arcs and cubic Béziers

    -

    Let's look at approximating circles and circular arcs using cubic Béziers. How much better is that?

    - - - Скрипти вимкнено. показує резервний. - - - - - -

    At cursory glance, a fair bit better, but let's find out how much better by looking at how to construct the Bézier curve.

    -

    A construction diagram for a cubic approximation of a circular arc

    -

    The start and end points are trivial, but the mid point requires a bit of work, but it's mostly basic trigonometry once we know the angle θ for our circular arc: if we scale our circular arc to a unit circle, we can always start our arc, with radius 1, at (1,0) and then given our arc angle θ, we also know that the circular arc has length θ (because unit circles are nice that way). We also know our end point, because that's just (cos(θ), sin(θ)), and so the challenge is to figure out what control points we need in order for the curve at t=0.5 to exactly touch the circular arc at the angle θ/2:

    -

    So let's again formally describe this:

    - - -

    Only P3 isn't quite straight-forward here, and its description is based on the fact that the triangle (origin, P4, P3) is a right angled triangle, with the distance between the origin and P4 being 1 (because we're working with a unit circle), and the distance between P4 and P3 being k, so that we can represent P3 as "The point P4 plus the vector from the origin to P4 but then rotated a quarter circle, counter-clockwise, and scaled by k".

    -

    With that, we can determine the y-coordinates for A, B, e1, and e2, after which we have all the information we need to determine what the value of k is. We can find these values by using (no surprise here) linear interpolation between known points, as A is midway between P2 and P3, e1 is between A and "midway between P1 and P2" (which is "half height" P2), and so forth:

    - - -

    Which now gives us two identities for B, because in addition to determining B through linear interpolation, we also know that B's y coordinate is just sin(θ/2): we started this exercise by saying we were going to approximate the circular arc using a Bézier curve that had its midpoint, which is point B, touching the unit circle at the arc's half-angle, by definition making B the point at (cos(θ/2), sin(θ/2)).

    -

    This means we can equate the two identities we now have for By and solve for k.

    -
    - -

    Deriving k

    -

    Solving for k is fairly straight forward, but it's a fair few steps, and if you just the immediate result: using a tool like Wolfram Alpha is definitely the way to go. That said, let's get going:

    - - -

    And finally, we can take further advantage of several trigonometric identities to drastically simplify our formula for k:

    - - -

    And we're done.

    -
    + +

    And we're done.

    + -

    So, the distance of our control points to the start/end points can be expressed as a number that we get from an almost trivial expression involving the circular arc's angle:

    - - -

    Which means that for any circular arc with angle θ and radius r, our Bézier approximation based on three points of incidence is:

    - - -

    Which also gives us the commonly found value of 0.55228 for quarter circles, based on them having an angle of half π:

    - - -

    And thus giving us the following Bézier coordinates for a quarter circle of radius r:

    - - -
    + +
    +

    So, how accurate is this?

    +

    + Unlike for the quadratic curve, we can't use t=0.5 as our reference point because by its very nature it's one of the three points + that are actually guaranteed to be on the circular arc itself. Instead, we need a different t value that will give us the maximum + deflection - there are two possible choices (as our curve is still strictly "overshoots" the circular arc, and it's symmetrical) but + rather than trying to use calculus to find the perfect t value—which we could! the maths is perfectly reasonable as long as we + get to use computers—we can also just perform a binary search for the biggest deflection and not bother with all this maths stuff. +

    +

    + So let's do that instead: we can run a maximum deflection check that just runs through t from 0 to 1 at some coarse interval, + finds a t value that has "the highest deflection of the bunch", then reruns the same check with a much smaller interval around + that t value, repeating as many times as necessary to get us an arbitrarily precise value of t: +

    -

    So, how accurate is this?

    -

    Unlike for the quadratic curve, we can't use t=0.5 as our reference point because by its very nature it's one of the three points that are actually guaranteed to be on the circular arc itself. Instead, we need a different t value that will give us the maximum deflection - there are two possible choices (as our curve is still strictly "overshoots" the circular arc, and it's symmetrical) but rather than trying to use calculus to find the perfect t value—which we could! the maths is perfectly reasonable as long as we get to use computers—we can also just perform a binary search for the biggest deflection and not bother with all this maths stuff.

    -

    So let's do that instead: we can run a maximum deflection check that just runs through t from 0 to 1 at some coarse interval, finds a t value that has "the highest deflection of the bunch", then reruns the same check with a much smaller interval around that t value, repeating as many times as necessary to get us an arbitrarily precise value of t:

    - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    - -

    Plus, how often do you get to write a function with that name?

    -

    Using this code, we find that our t values are approximately 0.211325 and 0.788675, so let's pick the lower of the two and see what the maximum deflection is across our domain of angles, with the original quadratic error show in green (rocketing off to infinity first, and then coming back down as we approach 2π)

    - - -
    - - - - - -
    - error plotted for 0 ≤ φ ≤ 2π - - error plotted for 0 ≤ φ ≤ π - - error plotted for 0 ≤ φ ≤ ½π -
    + return getMostWrongT(radius, bezier, worst_t - stepsize, worst_t + stepsize) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + -

    That last image is probably not quite clear enough: the cubic approximation of a quarter circle is so incredibly much better that we can't even really see it at the same scale of our quadratic curve. Let's scale the y-axis a little, and try that again:

    -

    +

    Plus, how often do you get to write a function with that name?

    +

    + Using this code, we find that our t values are approximately 0.211325 and 0.788675, so let's pick the lower of the two and see + what the maximum deflection is across our domain of angles, with the original quadratic error show in green (rocketing off to infinity + first, and then coming back down as we approach 2π) +

    + + + + + + + + + + + + + +
    + + + + + +
    + error plotted for 0 ≤ φ ≤ 2π + + error plotted for 0 ≤ φ ≤ π + + error plotted for 0 ≤ φ ≤ ½π +
    -

    Yeah... the error of a cubic approximation for a quarter circle turns out to be two orders of magnitude better. At approximately 0.00027 (or: just shy of being 2.7 pixels off for a circle with a radius of 10,000 pixels) the increase in precision over quadratic curves is quite spectacular - certainly good enough that no one in their right mind should ever use quadratic curves.

    -
    +

    + That last image is probably not quite clear enough: the cubic approximation of a quarter circle is so incredibly much better that we + can't even really see it at the same scale of our quadratic curve. Let's scale the y-axis a little, and try that again: +

    +

    -

    So that's it, kappa is 4/3 · tan(θ/4) , we're done! ...or are we?

    -

    Can we do better?

    -

    Technically: yes, we can. But I'm going to prefix this section with "we can, and we should investigate that possibility, but let me warn you up front that the result is only better if we're going to hard-code the values". We're about to get into the weeds and the standard three-points-of-incidence value is so good already that for most applications, trying to do better won't make any sense at all.

    -

    So with that said: what we calculated above is an upper bound for a best fit Bézier curve for a circular arc: anywhere we don't touch the circular arc in our approximation, we've "overshot" the arc. What if we dropped our value for k just a little, so that the curve starts out as an over-estimation, but then crosses the circular arc, yielding an region of underestimation, and then crosses the circular arc again, with another region of overestimation. This might give us a lower overall error, so let's see what we can do.

    -

    First, let's express the total error (given circular arc angle θ, and some k) using standard calculus notation:

    - - -

    This says that the error function for a given angle and value of k is equal to the "infinite" sum of differences between our curve and the circular arc, as we run t from 0 to 1, using an infinitely small step size. between subsequent t values.

    -

    Now, since we want to find the minimal error, that means we want to know where along this function things go from "error is getting progressively less" to "error is increasing again", which means we want to know where its derivative is zero, which as mathematical expression looks like:

    - - -

    And here we have the most direct application of the Fundamental Theorem of Calculus: the derivative and integral are each other's inverse operations, so they cancel out, leaving us with our original function:

    - - -

    And now we just solve for that... oh wait. We've seen this before. In order to solve this, we'd end up needing to solve this:

    - - -

    And both of those terms on the left of the equal sign are 6th degree polynomials, which means—as we've covered in the section on arc lengths—there is no symbolic solution for this equasion. Instead, we'll have to use a numerical approach to find the solutions here, so... to the computer!

    -
    + +

    + And both of those terms on the left of the equal sign are 6th degree polynomials, which means—as we've covered in the section + on arc lengths—there is no symbolic solution for this equasion. + Instead, we'll have to use a numerical approach to find the solutions here, so... to the computer! +

    +
    +

    Iterating on a solution

    +

    + By which I really mean "to the binary search algorithm", because we're dealing with a reasonably well behaved function: depending on the + value for k , we're either going to end up with a Bézier curve that's on average "not at distance r from the arc's + center", "exactly distance r from the arc's center", or "more than distance r from the arc's center", so we can just + binary search our way to the most accurate value for c that gets us that middle case. +

    +

    First our setup, where we determine our upper and lower bounds, before entering our binary search:

    -

    Iterating on a solution

    -

    By which I really mean "to the binary search algorithm", because we're dealing with a reasonably well behaved function: depending on the value for k , we're either going to end up with a Bézier curve that's on average "not at distance r from the arc's center", "exactly distance r from the arc's center", or "more than distance r from the arc's center", so we can just binary search our way to the most accurate value for c that gets us that middle case.

    -

    First our setup, where we determine our upper and lower bounds, before entering our binary search:

    - - - - -
    1 - -
    2
    3
    4
    - -

    And then the binary search algorithm, which can be found in pretty much any CS textbook, as well as more online articles, tutorials, and blog posts than you can ever read in a life time:

    + return binarySearch(radius, angle, points, lowerBound, upperBound) + + + + 2 + + + 3 + + + 4 + + - - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    - -

    Using the following radialError function, which samples the curve's approximation of the circular arc over several points (although the first and last point will never contribute anything, so we skip them):

    + return binarySearch(radius, angle, points, value, upperBound) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + 18 + + + 19 + + - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    - -

    In this, getOnCurvePoint is just the standard Bézier evaluation function, yielding a point. Treating that point as a vector, we can get its length to the origin using a magnitude call.

    -

    Examining the result

    -

    Running the above code we can get a list of k values associated with a list of angles θ from 0 to π, and we can use that to, for each angle, plot what the difference between the circular arc and the Bézier approximation looks like:

    -

    image-20210419085430711

    -

    Here we see the difference between an arc and its Bézier approximation plotted as we run t from 0 to 1. Just by looking at the plot we can tell that there is maximum deflection at t = 0.5, so let's plot the maximum deflection "function", for angles from 0 to θ:

    -

    In fact, let's plot the maximum deflections for both approaches as a functions over θ:

    - - -
    - - - - - -
    - max deflection using unit scale - - max deflection at 10x scale - - max deflection at 100x scale -
    + return err + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + -

    That doesn't actually appear to be all that much better, so let's look at some numbers, to see what the improvement actually is:

    - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
    angle"improved" deflection"upper bound" deflectiondifference
    1/8 π6.202833502388927E-86.657161222278773E-84.5432771988984655E-9
    1/4 π3.978021202111215E-64.246252911066506E-62.68231708955291E-7
    3/8 π4.547652269037972E-54.8397483513262785E-52.9209608228830675E-6
    1/2 π2.569196199214696E-42.7251652752280364E-41.559690760133403E-5
    5/8 π9.877526288810667E-40.00104441758597118025.666495709011343E-5
    3/4 π0.002981649786796270.00314556284145806051.6391305466179062E-4
    7/8 π0.00763231828070198850.0080477779099483734.1545962924638413E-4
    π0.0173621859640437080.0183490165195459029.86830555502194E-4
    -

    As we can see, the increase in precision is not particularly big: for a quarter circle (π/2) the traditional k will be off by 2.75 pixels on a circle with radius 10,000 pixels, whereas this "better" fit will be off by 2.56 pixels. And while that's certainly an almost 10% improvement, it's also nowhere near enough of an improvement to make a discernible difference.

    -
    +

    + In this, getOnCurvePoint is just the standard Bézier evaluation function, yielding a point. Treating that point as a + vector, we can get its length to the origin using a magnitude call. +

    +

    Examining the result

    +

    + Running the above code we can get a list of k values associated with a list of angles θ from 0 to π, and we can use that to, + for each angle, plot what the difference between the circular arc and the Bézier approximation looks like: +

    +

    image-20210419085430711

    +

    + Here we see the difference between an arc and its Bézier approximation plotted as we run t from 0 to 1. Just by looking at the + plot we can tell that there is maximum deflection at t = 0.5, so let's plot the maximum deflection "function", for angles from + 0 to θ: +

    +

    In fact, let's plot the maximum deflections for both approaches as a functions over θ:

    + + + + + + + + + + + + + +
    + + + + + +
    + max deflection using unit scale + + max deflection at 10x scale + + max deflection at 100x scale +
    -

    At this point it should be clear that while, yes, there are improvement to be had, they're essentially insignificant while also being much more computationally expensive.

    -

    TL;DR: just tell me which value I should be using

    -

    It depends on what we need to do. If we just want the best value for quarter circles, and we're going to hard code the value for k, then there is no reason to hard-code the constant k=4/3*tan(pi/8) when you can just as easily hard-code the constant as k=0.551784777779014 instead.

    -

    If you need "the" value for quarter circles, use 0.551785 instead of 0.55228

    -

    However, for dynamic arc approximation, in code that tries to fit circular paths using Bézier paths instead, it should be fairly obvious that the simple function involving a tangent computation, two divisions, and one multiplication, is vastly more performant than running all the code we ended writing just to get a 25% lower error value, and most certainly worth preferring over getting the "more accurate" value.

    -

    If you need to fit Béziers to circular arcs on the fly, use 4/3 * tan(θ/4)

    -

    However, always remember that if you're writing for humans, you can typically use the best of both worlds: as the user interacts with their curves, you should draw their curves instead of drawing approximations of them. If they need to draw circles or circular arcs, draw those, and only approximate them with a Bézier curve when the data needs to be exported to a format that doesn't support those. Ideally with a preview mechanism that highlights where the errors will be, and how large they will be.

    -

    If you're writing code for graphics design by humans, use circular arcs for circular arcs

    -

    And that's it. We have pretty well exhausted this subject. There are different metrics we could use to find "different best k values", like trying to match arc length (e.g. when we're optimizing for material cost), or minimizing the area between the circular arc and the Bézier curve (e.g. when we're optimizing for inking), or minimizing the rate of change of the Bézier's curvature (e.g. when we're optimizing for curve traversal) and they all yield values that are so similar that it's almost certainly not worth it. (For instance, for quarter circle approximations those values are 0.551777, 0.5533344, and 0.552184 respectively. Much like the 0.551785 we get from minimizing the maximum deflection, none of these values are significantly better enough to prefer them over the upper bound value).

    +

    That doesn't actually appear to be all that much better, so let's look at some numbers, to see what the improvement actually is:

    + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
    angle"improved" deflection"upper bound" deflectiondifference
    1/8 π6.202833502388927E-86.657161222278773E-84.5432771988984655E-9
    1/4 π3.978021202111215E-64.246252911066506E-62.68231708955291E-7
    3/8 π4.547652269037972E-54.8397483513262785E-52.9209608228830675E-6
    1/2 π2.569196199214696E-42.7251652752280364E-41.559690760133403E-5
    5/8 π9.877526288810667E-40.00104441758597118025.666495709011343E-5
    3/4 π0.002981649786796270.00314556284145806051.6391305466179062E-4
    7/8 π0.00763231828070198850.0080477779099483734.1545962924638413E-4
    π0.0173621859640437080.0183490165195459029.86830555502194E-4
    +

    + As we can see, the increase in precision is not particularly big: for a quarter circle (π/2) the traditional k will be off by + 2.75 pixels on a circle with radius 10,000 pixels, whereas this "better" fit will be off by 2.56 pixels. And while that's certainly an + almost 10% improvement, it's also nowhere near enough of an improvement to make a discernible difference. +

    +
    -
    -
    -

    Approximating Bézier curves with circular arcs

    -

    Let's look at doing the exact opposite of the previous section: rather than approximating circular arc using Bézier curves, let's approximate Bézier curves using circular arcs.

    -

    We already saw in the section on circle approximation that this will never yield a perfect equivalent, but sometimes you need circular arcs, such as when you're working with fabrication machinery, or simple vector languages that understand lines and circles, but not much else.

    -

    The approach is fairly simple: pick a starting point on the curve, and pick two points that are further along the curve. Determine the circle that goes through those three points, and see if it fits the part of the curve we're trying to approximate. Decent fit? Try spacing the points further apart. Bad fit? Try spacing the points closer together. Keep doing this until you've found the "good approximation/bad approximation" boundary, record the "good" arc, and then move the starting point up to overlap the end point we previously found. Rinse and repeat until we've covered the entire curve.

    -

    We already saw how to fit a circle through three points in the section on creating a curve from three points, and finding the arc through those points is straight-forward: pick one of the three points as start point, pick another as an end point, and the arc has to necessarily go from the start point, to the end point, over the remaining point.

    -

    So, how can we convert a Bézier curve into a (sequence of) circular arc(s)?

    -
      -
    • Start at t=0
    • -
    • Pick two points further down the curve at some value m = t + n and e = t + 2n
    • -
    • Find the arc that these points define
    • -
    • Determine how close the found arc is to the curve:
        -
      • Pick two additional points e1 = t + n/2 and e2 = t + n + n/2.
      • -
      • These points, if the arc is a good approximation of the curve interval chosen, should - lie on the circle, so their distance to the center of the circle should be the - same as the distance from any of the three other points to the center.
      • -
      • For point points, determine the (absolute) error between the radius of the circle, and the -actual distance from the center of the circle to the point on the curve.
      • -
      • If this error is too high, we consider the arc bad, and try a smaller interval.
      • -
      -
    • -
    -

    The result of this is shown in the next graphic: we start at a guaranteed failure: s=0, e=1. That's the entire curve. The midpoint is simply at t=0.5, and then we start performing a binary search.

    -
      -
    1. We start with low=0, mid=0.5 and high=1
    2. -
    3. That'll fail, so we retry with the interval halved: {0, 0.25, 0.5}
        -
      • If that arc's good, we move back up by half distance: {0, 0.375, 0.75}.
      • -
      • However, if the arc was still bad, we move down by half the distance: {0, 0.125, 0.25}.
      • -
      -
    4. -
    5. We keep doing this over and over until we have two arcs, in sequence, of which the first arc is good, and the second arc is bad. When we find that pair, we've found the boundary between a good approximation and a bad approximation, and we pick the good arc.
    6. -
    -

    The following graphic shows the result of this approach, with a default error threshold of 0.5, meaning that if an arc is off by a combined half pixel over both verification points, then we treat the arc as bad. This is an extremely simple error policy, but already works really well. Note that the graphic is still interactive, and you can use your up and down arrow keys keys to increase or decrease the error threshold, to see what the effect of a smaller or larger error threshold is.

    - - - Скрипти вимкнено. показує резервний. - - - - - -

    With that in place, all that's left now is to "restart" the procedure by treating the found arc's end point as the new to-be-determined arc's starting point, and using points further down the curve. We keep trying this until the found end point is for t=1, at which point we are done. Again, the following graphic allows for up and down arrow key input to increase or decrease the error threshold, so you can see how picking a different threshold changes the number of arcs that are necessary to reasonably approximate a curve:

    - - - Скрипти вимкнено. показує резервний. - - - - - -

    So... what is this good for? Obviously, if you're working with technologies that can't do curves, but can do lines and circles, then the answer is pretty straightforward, but what else? There are some reasons why you might need this technique: using circular arcs means you can determine whether a coordinate lies "on" your curve really easily (simply compute the distance to each circular arc center, and if any of those are close to the arc radii, at an angle between the arc start and end, bingo, this point can be treated as lying "on the curve"). Another benefit is that this approximation is "linear": you can almost trivially travel along the arcs at fixed speed. You can also trivially compute the arc length of the approximated curve (it's a bit like curve flattening). The only thing to bear in mind is that this is a lossy equivalence: things that you compute based on the approximation are guaranteed "off" by some small value, and depending on how much precision you need, arc approximation is either going to be super useful, or completely useless. It's up to you to decide which, based on your application!

    - -
    -
    -

    B-Splines

    -

    No discussion on Bézier curves is complete without also giving mention of that other beast in the curve design space: B-Splines. Easily confused to mean Bézier splines, that's not actually what they are; they are "basis function" splines, which makes a lot of difference, and we'll be looking at those differences in this section. We're not going to dive as deep into B-Splines as we have for Bézier curves (that would be an entire primer on its own) but we'll be looking at how B-Splines work, what kind of maths is involved in computing them, and how to draw them based on a number of parameters that you can pick for individual B-Splines.

    -

    First off: B-Splines are piecewise, polynomial interpolation curves, where the "single curve" is built by performing polynomial interpolation over a set of points, using a sliding window of a fixed number of points. For instance, a "cubic" B-Spline defined by twelve points will have its curve built by evaluating the polynomial interpolation of four points, and the curve can be treated as a lot of different sections, each controlled by four points at a time, such that the full curve consists of smoothly connected sections defined by points {1,2,3,4}, {2,3,4,5}, ..., {8,9,10,11}, and finally {9,10,11,12}, for eight sections.

    -

    What do they look like? They look like this! Tap on the graphic to add more points, and move points around to see how they map to the spline curve drawn.

    - - - Скрипти вимкнено. показує резервний. - - - -

    The important part to notice here is that we are not doing the same thing with B-Splines that we do for poly-Béziers or Catmull-Rom curves: both of the latter simply define new sections as literally "new sections based on new points", so a 12 point cubic poly-Bézier curve is actually impossible, because we start with a four point curve, and then add three more points for each section that follows, so we can only have 4, 7, 10, 13, 16, etc. point Poly-Béziers. Similarly, while Catmull-Rom curves can grow by adding single points, this addition of a single point introduces three implicit Bézier points. Cubic B-Splines, on the other hand, are smooth interpolations of each possible curve involving four consecutive points, such that at any point along the curve except for our start and end points, our on-curve coordinate is defined by four control points.

    -

    Consider the difference to be this:

    -
      -
    • for Bézier curves, the curve is defined as an interpolation of points, but:
    • -
    • for B-Splines, the curve is defined as an interpolation of curves.
    • -
    -

    In fact, let's look at that again, but this time with the base curves shown, too. Each consecutive four points define one curve:

    - - - Скрипти вимкнено. показує резервний. - - - - - -

    In order to make this interpolation of curves work, the maths is necessarily more complex than the maths for Bézier curves, so let's have a look at how things work.

    -

    How to compute a B-Spline curve: some maths

    -

    Given a B-Spline of degree d and thus order k=d+1 (so a quadratic B-Spline is degree 2 and order 3, a cubic B-Spline is degree 3 and order 4, etc) and n control points P0 through Pn-1, we can compute a point on the curve for some value t in the interval [0,1] (where 0 is the start of the curve, and 1 the end, just like for Bézier curves), by evaluating the following function:

    - + +

    + In order to make this interpolation of curves work, the maths is necessarily more complex than the maths for Bézier curves, so let's have + a look at how things work. +

    +

    How to compute a B-Spline curve: some maths

    +

    + Given a B-Spline of degree d and thus order k=d+1 (so a quadratic B-Spline is degree 2 and order 3, a cubic + B-Spline is degree 3 and order 4, etc) and n control points P0 through Pn-1, we can compute a point on the curve for some value t in the interval [0,1] (where 0 is the start of the curve, and 1 the + end, just like for Bézier curves), by evaluating the following function: +

    + - -

    Which, honestly, doesn't tell us all that much. All we can see is that a point on a B-Spline curve is defined as "a mix of all the control points, weighted somehow", where the weighting is achieved through the N(...) function, subscripted with an obvious parameter i, which comes from our summation, and some magical parameter k. So we need to know two things: 1. what does N(t) do, and 2. what is that k? Let's cover both, in reverse order.

    -

    The parameter k represents the "knot interval" over which a section of curve is defined. As we learned earlier, a B-Spline curve is itself an interpolation of curves, and we can treat each transition where a control point starts or stops influencing the total curvature as a "knot on the curve". -Doing so for a degree d B-Spline with n control point gives us d + n + 1 knots, defining d + n intervals along the curve, and it is these intervals that the above k subscript to the N() function applies to.

    -

    Then the N() function itself. What does it look like?

    - - -

    So this is where we see the interpolation: N(t) for an (i,k) pair (that is, for a step in the above summation, on a specific knot interval) is a mix between N(t) for (i,k-1) and N(t) for (i+1,k-1), so we see that this is a recursive iteration where i goes up, and k goes down, so it seem reasonable to expect that this recursion has to stop at some point; obviously, it does, and specifically it does so for the following i/k values:

    - - -

    And this function finally has a straight up evaluation: if a t value lies within a knot-specific interval once we reach a k=1 value, it "counts", otherwise it doesn't. We did cheat a little, though, because for all these values we need to scale our t value first, so that it lies in the interval bounded by knots[d] and knots[n], which are the start point and end point where curvature is controlled by exactly order control points. For instance, for degree 3 (=order 4) and 7 control points, with knot vector [1,2,3,4,5,6,7,8,9,10,11], we map t from [the interval 0,1] to the interval [4,8], and then use that value in the functions above, instead.

    -

    Can we simplify that?

    -

    We can, yes.

    -

    People far smarter than us have looked at this work, and two in particular — Maurice Cox and Carl de Boor — came to a mathematically pleasing solution: to compute a point P(t), we can compute this point by evaluating d(t) on a curve section between knots i and i+1:

    - - -

    This is another recursive function, with k values decreasing from the curve order to 1, and the value α (alpha) defined by:

    - - -

    That looks complicated, but it's not. Computing alpha is just a fraction involving known, plain numbers. And, once we have our alpha value, we also have (1-alpha) because it's a trivial subtraction. Computing the d() function is thus mostly a matter of computing pretty simple arithmetical statements, with some caching of results so we can refer to them as we recurve. While the recursion might see computationally expensive, the total algorithm is cheap, as each step only involves very simple maths.

    -

    Of course, the recursion does need a stop condition:

    - - -

    So, we actually see two stopping conditions: either i becomes 0, in which case d() is zero, or k becomes zero, in which case we get the same "either 1 or 0" that we saw in the N() function above.

    -

    Thanks to Cox and de Boor, we can compute points on a B-Spline pretty easily using the same kind of linear interpolation we saw in de Casteljau's algorithm. For instance, if we write out d() for i=3 and k=3, we get the following recursion diagram:

    - - -

    That is, we compute d(3,3) as a mixture of d(2,3) and d(2,2), where those two are themselves a mixture of d(1,3) and d(1,2), and d(1,2) and d(1,1), respectively, which are themselves a mixture of etc. etc. We simply keep expanding our terms until we reach the stop conditions, and then sum everything back up. It's really quite elegant.

    -

    One thing we need to keep in mind is that we're working with a spline that is constrained by its control points, so even though the d(..., k) values are zero or one at the lowest level, they are really "zero or one, times their respective control point", so in the next section you'll see the algorithm for running through the computation in a way that starts with a copy of the control point vector and then works its way up to that single point, rather than first starting "on the left", working our way "to the right" and then summing back up "to the left". We can just start on the right and work our way left immediately.

    -

    Running the computation

    -

    Unlike the de Casteljau algorithm, where the t value stays the same at every iteration, for B-Splines that is not the case, and so we end having to (for each point we evaluate) run a fairly involving bit of recursive computation. The algorithm is discussed on this Michigan Tech page, but an easier to read version is implemented by b-spline.js, so we'll look at its code.

    -

    Given an input value t, we first map the input to a value from the domain [0,1] to the domain [knots[degree], knots[knots.length - 1 - degree]. Then, we find the section number s that this mapped t value lies on:

    + +

    + That is, we compute d(3,3) as a mixture of d(2,3) and d(2,2), where those two are themselves a + mixture of d(1,3) and d(1,2), and d(1,2) and d(1,1), respectively, which are + themselves a mixture of etc. etc. We simply keep expanding our terms until we reach the stop conditions, and then sum everything back up. + It's really quite elegant. +

    +

    + One thing we need to keep in mind is that we're working with a spline that is constrained by its control points, so even though the + d(..., k) values are zero or one at the lowest level, they are really "zero or one, times their respective control point", so + in the next section you'll see the algorithm for running through the computation in a way that starts with a copy of the control point + vector and then works its way up to that single point, rather than first starting "on the left", working our way "to the right" and then + summing back up "to the left". We can just start on the right and work our way left immediately. +

    +

    Running the computation

    +

    + Unlike the de Casteljau algorithm, where the t value stays the same at every iteration, for B-Splines that is not the case, + and so we end having to (for each point we evaluate) run a fairly involving bit of recursive computation. The algorithm is discussed on + this Michigan Tech page, but an easier to read version + is implemented by b-spline.js, so we'll look at its code. +

    +

    + Given an input value t, we first map the input to a value from the domain [0,1] to the domain + [knots[degree], knots[knots.length - 1 - degree]. Then, we find the section number s that this mapped + t value lies on: +

    - - -
    1 - -
    2
    3
    - -

    after running this code, s is the index for the section the point will lie on. We then run the algorithm mentioned on the MU page (updated to use this description's variable names):

    +} + + + + 2 + + + 3 + + - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    - -

    (A nice bit of behaviour in this code is that we work the interpolation "backwards", starting at i=s at each level of the interpolation, and we stop when i = s - order + level, so we always end up with a value for i such that those v[i-1] don't try to use an array index that doesn't exist)

    -

    Open vs. closed paths

    -

    Much like poly-Béziers, B-Splines can be either open, running from the first point to the last point, or closed, where the first and last point are the same coordinate. However, because B-Splines are an interpolation of curves, not just points, we can't simply make the first and last point the same, we need to link as many points as are necessary to form "a curve" that the spline performs interpolation with. As such, for an order d B-Spline, we need to make the first and last d points the same. This is of course hardly more work than before (simply append points.splice(0,d) to points) but it's important to remember that you need more than just a single point.

    -

    Of course if we want to manipulate these kind of curves we need to make sure to mark them as "closed" so that we know the coordinate for points[0] and points[n-k] etc. don't just happen to have the same x/y values, but really are the same coordinate, so that manipulating one will equally manipulate the other, but programming generally makes this really easy by storing references to points, rather than copies (or other linked values such as coordinate weights, discussed in the NURBS section) rather than separate coordinate objects.

    -

    Manipulating the curve through the knot vector

    -

    The most important thing to understand when it comes to B-Splines is that they work because of the concept of a knot vector. As mentioned above, knots represent "where individual control points start/stop influencing the curve", but we never looked at the values that go in the knot vector. If you look back at the N() and a() functions, you see that interpolations are based on intervals in the knot vector, rather than the actual values in the knot vector, and we can exploit this to do some pretty interesting things with clever manipulation of the knot vector. Specifically there are four things we can do that are worth looking at:

    -
      -
    1. we can use a uniform knot vector, with equally spaced intervals,
    2. -
    3. we can use a non-uniform knot vector, without enforcing equally spaced intervals,
    4. -
    5. we can collapse sequential knots to the same value, locally lowering curve complexity using "null" intervals, and
    6. -
    7. we can form a special case non-uniform vector, by combining (1) and (3) to for a vector with collapsed start and end knots, with a uniform vector in between.
    8. -
    -

    Uniform B-Splines

    -

    The most straightforward type of B-Spline is the uniform spline. In a uniform spline, the knots are distributed uniformly over the entire curve interval. For instance, if we have a knot vector of length twelve, then a uniform knot vector would be [0,1,2,3,...,9,10,11]. Or [4,5,6,...,13,14,15], which defines the same intervals, or even [0,2,3,...,18,20,22], which also defines the same intervals, just scaled by a constant factor, which becomes normalised during interpolation and so does not contribute to the curvature.

    - - - Скрипти вимкнено. показує резервний. - - - - - -

    This is an important point: the intervals that the knot vector defines are relative intervals, so it doesn't matter if every interval is size 1, or size 100 - the relative differences between the intervals is what shapes any particular curve.

    -

    The problem with uniform knot vectors is that, as we need order control points before we have any curve with which we can perform interpolation, the curve does not "start" at the first point, nor "ends" at the last point. Instead there are "gaps". We can get rid of these, by being clever about how we apply the following uniformity-breaking approach instead...

    -

    Reducing local curve complexity by collapsing intervals

    -

    Collapsing knot intervals, by making two or more consecutive knots have the same value, allows us to reduce the curve complexity in the sections that are affected by the knots involved. This can have drastic effects: for every interval collapse, the curve order goes down, and curve continuity goes down, to the point where collapsing order knots creates a situation where all continuity is lost and the curve "kinks".

    - - - Скрипти вимкнено. показує резервний. - - - - - -

    Open-Uniform B-Splines

    -

    By combining knot interval collapsing at the start and end of the curve, with uniform knots in between, we can overcome the problem of the curve not starting and ending where we'd kind of like it to:

    -

    For any curve of degree D with control points N, we can define a knot vector of length N+D+1 in which the values 0 ... D+1 are the same, the values D+1 ... N+1 follow the "uniform" pattern, and the values N+1 ... N+D+1 are the same again. For example, a cubic B-Spline with 7 control points can have a knot vector [0,0,0,0,1,2,3,4,4,4,4], or it might have the "identical" knot vector [0,0,0,0,2,4,6,8,8,8,8], etc. Again, it is the relative differences that determine the curve shape.

    - - - Скрипти вимкнено. показує резервний. - - - - - -

    Non-uniform B-Splines

    -

    This is essentially the "free form" version of a B-Spline, and also the least interesting to look at, as without any specific reason to pick specific knot intervals, there is nothing particularly interesting going on. There is one constraint to the knot vector, other than that any value knots[k+1] should be greater than or equal to knots[k].

    -

    One last thing: Rational B-Splines

    -

    While it is true that this section on B-Splines is running quite long already, there is one more thing we need to talk about, and that's "Rational" splines, where the rationality applies to the "ratio", or relative weights, of the control points themselves. By introducing a ratio vector with weights to apply to each control point, we greatly increase our influence over the final curve shape: the more weight a control point carries, the closer to that point the spline curve will lie, a bit like turning up the gravity of a control point, just like for rational Bézier curves.

    - - - Скрипти вимкнено. показує резервний. - - - - - -

    Of course this brings us to the final topic that any text on B-Splines must touch on before calling it a day: the NURBS, or Non-Uniform Rational B-Spline (NURBS is not a plural, the capital S actually just stands for "spline", but a lot of people mistakenly treat it as if it is, so now you know better). NURBS is an important type of curve in computer-facilitated design, used a lot in 3D modelling (typically as NURBS surfaces) as well as in arbitrary-precision 2D design due to the level of control a NURBS curve offers designers.

    -

    While a true non-uniform rational B-Spline would be hard to work with, when we talk about NURBS we typically mean the Open-Uniform Rational B-Spline, or OURBS, but that doesn't roll off the tongue nearly as nicely, and so remember that when people talk about NURBS, they typically mean open-uniform, which has the useful property of starting the curve at the first control point, and ending it at the last.

    -

    Extending our implementation to cover rational splines

    -

    The algorithm for working with Rational B-Splines is virtually identical to the regular algorithm, and the extension to work in the control point weights is fairly simple: we extend each control point from a point in its original number of dimensions (2D, 3D, etc.) to one dimension higher, scaling the original dimensions by the control point's weight, and then assigning that weight as its value for the extended dimension.

    -

    For example, a 2D point (x,y) with weight w becomes a 3D point (w * x, w * y, w).

    -

    We then run the same algorithm as before, which will automatically perform weight interpolation in addition to regular coordinate interpolation, because all we've done is pretended we have coordinates in a higher dimension. The algorithm doesn't really care about how many dimensions it needs to interpolate.

    -

    In order to recover our "real" curve point, we take the final result of the point generation algorithm, and "unweigh" it: we take the final point's derived weight w' and divide all the regular coordinate dimensions by it, then throw away the weight information.

    -

    Based on our previous example, we take the final 3D point (x', y', w'), which we then turn back into a 2D point by computing (x'/w', y'/w'). And that's it, we're done!

    +} + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + -
    -
    - +

    + (A nice bit of behaviour in this code is that we work the interpolation "backwards", starting at i=s at each level of the + interpolation, and we stop when i = s - order + level, so we always end up with a value for i such that those + v[i-1] don't try to use an array index that doesn't exist) +

    +

    Open vs. closed paths

    +

    + Much like poly-Béziers, B-Splines can be either open, running from the first point to the last point, or closed, where the first and last + point are the same coordinate. However, because B-Splines are an interpolation of curves, not just points, we can't simply make the first + and last point the same, we need to link as many points as are necessary to form "a curve" that the spline performs interpolation with. As + such, for an order d B-Spline, we need to make the first and last d points the same. This is of course hardly + more work than before (simply append points.splice(0,d) to points) but it's important to remember that you need + more than just a single point. +

    +

    + Of course if we want to manipulate these kind of curves we need to make sure to mark them as "closed" so that we know the coordinate for + points[0] and points[n-k] etc. don't just happen to have the same x/y values, but really are the same + coordinate, so that manipulating one will equally manipulate the other, but programming generally makes this really easy by storing + references to points, rather than copies (or other linked values such as coordinate weights, discussed in the NURBS section) rather than + separate coordinate objects. +

    +

    Manipulating the curve through the knot vector

    +

    + The most important thing to understand when it comes to B-Splines is that they work because of the concept of a knot vector. As + mentioned above, knots represent "where individual control points start/stop influencing the curve", but we never looked at the + values that go in the knot vector. If you look back at the N() and a() functions, you see that interpolations are based on + intervals in the knot vector, rather than the actual values in the knot vector, and we can exploit this to do some pretty interesting + things with clever manipulation of the knot vector. Specifically there are four things we can do that are worth looking at: +

    +
      +
    1. we can use a uniform knot vector, with equally spaced intervals,
    2. +
    3. we can use a non-uniform knot vector, without enforcing equally spaced intervals,
    4. +
    5. we can collapse sequential knots to the same value, locally lowering curve complexity using "null" intervals, and
    6. +
    7. + we can form a special case non-uniform vector, by combining (1) and (3) to for a vector with collapsed start and end knots, with a + uniform vector in between. +
    8. +
    +

    Uniform B-Splines

    +

    + The most straightforward type of B-Spline is the uniform spline. In a uniform spline, the knots are distributed uniformly over the entire + curve interval. For instance, if we have a knot vector of length twelve, then a uniform knot vector would be [0,1,2,3,...,9,10,11]. Or + [4,5,6,...,13,14,15], which defines the same intervals, or even [0,2,3,...,18,20,22], which also defines + the same intervals, just scaled by a constant factor, which becomes normalised during interpolation and so does not contribute to + the curvature. +

    + + + Скрипти вимкнено. показує резервний. + + + + + +

    + This is an important point: the intervals that the knot vector defines are relative intervals, so it doesn't matter if every + interval is size 1, or size 100 - the relative differences between the intervals is what shapes any particular curve. +

    +

    + The problem with uniform knot vectors is that, as we need order control points before we have any curve with which we can + perform interpolation, the curve does not "start" at the first point, nor "ends" at the last point. Instead there are "gaps". We can get + rid of these, by being clever about how we apply the following uniformity-breaking approach instead... +

    +

    Reducing local curve complexity by collapsing intervals

    +

    + Collapsing knot intervals, by making two or more consecutive knots have the same value, allows us to reduce the curve complexity in the + sections that are affected by the knots involved. This can have drastic effects: for every interval collapse, the curve order goes down, + and curve continuity goes down, to the point where collapsing order knots creates a situation where all continuity is lost + and the curve "kinks". +

    + + + Скрипти вимкнено. показує резервний. + + + + + +

    Open-Uniform B-Splines

    +

    + By combining knot interval collapsing at the start and end of the curve, with uniform knots in between, we can overcome the problem of the + curve not starting and ending where we'd kind of like it to: +

    +

    + For any curve of degree D with control points N, we can define a knot vector of length N+D+1 in + which the values 0 ... D+1 are the same, the values D+1 ... N+1 follow the "uniform" pattern, and the values + N+1 ... N+D+1 are the same again. For example, a cubic B-Spline with 7 control points can have a knot vector + [0,0,0,0,1,2,3,4,4,4,4], or it might have the "identical" knot vector [0,0,0,0,2,4,6,8,8,8,8], etc. Again, it is the relative differences + that determine the curve shape. +

    + + + Скрипти вимкнено. показує резервний. + + + + + +

    Non-uniform B-Splines

    +

    + This is essentially the "free form" version of a B-Spline, and also the least interesting to look at, as without any specific reason to + pick specific knot intervals, there is nothing particularly interesting going on. There is one constraint to the knot vector, other than + that any value knots[k+1] should be greater than or equal to knots[k]. +

    +

    One last thing: Rational B-Splines

    +

    + While it is true that this section on B-Splines is running quite long already, there is one more thing we need to talk about, and that's + "Rational" splines, where the rationality applies to the "ratio", or relative weights, of the control points themselves. By introducing a + ratio vector with weights to apply to each control point, we greatly increase our influence over the final curve shape: the more weight a + control point carries, the closer to that point the spline curve will lie, a bit like turning up the gravity of a control point, just like + for rational Bézier curves. +

    + + + Скрипти вимкнено. показує резервний. + + + + + +

    + Of course this brings us to the final topic that any text on B-Splines must touch on before calling it a day: the + NURBS, or Non-Uniform Rational B-Spline (NURBS is not a plural, + the capital S actually just stands for "spline", but a lot of people mistakenly treat it as if it is, so now you know better). NURBS is an + important type of curve in computer-facilitated design, used a lot in 3D modelling (typically as NURBS surfaces) as well as in + arbitrary-precision 2D design due to the level of control a NURBS curve offers designers. +

    +

    + While a true non-uniform rational B-Spline would be hard to work with, when we talk about NURBS we typically mean the Open-Uniform + Rational B-Spline, or OURBS, but that doesn't roll off the tongue nearly as nicely, and so remember that when people talk about NURBS, + they typically mean open-uniform, which has the useful property of starting the curve at the first control point, and ending it at the + last. +

    +

    Extending our implementation to cover rational splines

    +

    + The algorithm for working with Rational B-Splines is virtually identical to the regular algorithm, and the extension to work in the + control point weights is fairly simple: we extend each control point from a point in its original number of dimensions (2D, 3D, etc.) to + one dimension higher, scaling the original dimensions by the control point's weight, and then assigning that weight as its value for the + extended dimension. +

    +

    For example, a 2D point (x,y) with weight w becomes a 3D point (w * x, w * y, w).

    +

    + We then run the same algorithm as before, which will automatically perform weight interpolation in addition to regular coordinate + interpolation, because all we've done is pretended we have coordinates in a higher dimension. The algorithm doesn't really care about how + many dimensions it needs to interpolate. +

    +

    + In order to recover our "real" curve point, we take the final result of the point generation algorithm, and "unweigh" it: we take the + final point's derived weight w' and divide all the regular coordinate dimensions by it, then throw away the weight + information. +

    +

    + Based on our previous example, we take the final 3D point (x', y', w'), which we then turn back into a 2D point by computing + (x'/w', y'/w'). And that's it, we're done! +

    +
    +
    + -

    Comments and questions

    -

    First off, if you enjoyed this book, or you simply found it useful for something you were trying to get done, and you were wondering how to let me know you appreciated this book, you have two options: you can either head on over to the Patreon page for this book, or if you prefer to make a one-time donation, head on over to the buy Pomax a coffee page. This work has grown from a small primer to a 70-plus print-page-equivalent reader on the subject of Bézier curves over the years, and a lot of coffee went into the making of it. I don't regret a minute I spent on writing it, but I can always do with some more coffee to keep on writing.

    -

    With that said, on to the comments!

    -
    +

    + + Comments and questions +

    +

    + First off, if you enjoyed this book, or you simply found it useful for something you were trying to get done, and you were wondering how + to let me know you appreciated this book, you have two options: you can either head on over to the + Patreon page for this book, or if you prefer to make a one-time donation, head on over to + the buy Pomax a coffee page. This + work has grown from a small primer to a 70-plus print-page-equivalent reader on the subject of Bézier curves over the years, and a lot of + coffee went into the making of it. I don't regret a minute I spent on writing it, but I can always do with some more coffee to keep on + writing. +

    +

    With that said, on to the comments!

    +
    +
    +
    +
    +
    -
    - -
    + -
    - - - - - - - - - \ No newline at end of file + + + diff --git a/docs/zh-CN/index.html b/docs/zh-CN/index.html index 9b876770..4351cda0 100644 --- a/docs/zh-CN/index.html +++ b/docs/zh-CN/index.html @@ -1,156 +1,167 @@ + + + + 贝塞尔曲线入门 - - - - 贝塞尔曲线入门 + - + - - - - + - - - + - :root[lang="zh-CN"] { - font-family: "华文细黑", "STXihei", "PingFang TC", "微软雅黑体", "Microsoft YaHei New", "微软雅黑", "Microsoft Yahei", "宋体", SimSun, "Helvetica Neue", Helvetica, Arial, sans-serif; - font-size: 16.7px; - } + + + + + + + + + + + + + + + - + + + + + + + - + + - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - + + - - - - + + + - - + + -
    - This page on GitHub - - This page on GitHub - -
    +
    + This page on GitHub + + This page on GitHub + +
    -
    - - - submit to reddit - submit to hacker news - tweet your read - -
    +
    + + + submit to reddit + submit to hacker news + tweet your read + +
    - + +
    +

    + 贝塞尔曲线入门 +

    +

    一本免费的在线书籍,供你在非常需要了解如何处理贝塞尔相关的事情。

    +
    + 用母语阅读该资源: + +

    + (没有看到你的语言被列出,或者想看到它达到100%翻译? + 帮助翻译本资源!) +

    +
    -
    - -

    贝塞尔曲线入门

    -

    一本免费的在线书籍,供你在非常需要了解如何处理贝塞尔相关的事情。

    -
    - 用母语阅读该资源: - -

    (没有看到你的语言被列出,或者想看到它达到100%翻译? 帮助翻译本资源!)

    -
    - -

    - 欢迎来到贝塞尔曲线入门。这是一个免费的网站/电子书,涉及贝塞尔曲线的数学和编程的的部分,涵盖的 - 主题很多,包括绘制和处理那条似乎无处不在的曲线,从 Photoshop 的路径到 CSS 缓动函数,再到字体 - 轮廓描述。 -

    -

    - 如果这是你第一次来到这里,那么欢迎你!告诉我任何你想要在这里找到但却没有涵盖的内容,请通过 - issue tracker 反馈! -

    - -

    捐款和赞助

    +

    + 欢迎来到贝塞尔曲线入门。这是一个免费的网站/电子书,涉及贝塞尔曲线的数学和编程的的部分,涵盖的 + 主题很多,包括绘制和处理那条似乎无处不在的曲线,从 Photoshop 的路径到 CSS 缓动函数,再到字体 轮廓描述。 +

    +

    + 如果这是你第一次来到这里,那么欢迎你!告诉我任何你想要在这里找到但却没有涵盖的内容,请通过 + issue tracker 反馈! +

    -

    - 如果将此资源用于你的研究,作为工作参考,甚至是你编写的软件,可以考虑 捐款 - (任何数额都可)或注册成为 Patreon 上的资助人。 - 我在这方面的工作是没有报酬的,所以如果你觉得这个网站有价值,并且希望它能长期存在下去,那么在过去的几年 - 里,我喝了大量的咖啡来编写这个网站。如果你能捐助出一杯咖啡,你就在帮助保持这个资源的生存下去并良好发展。 -

    -

    - 此外,如果你是一家公司,你的员工使用这本书作为资源,或者你把它作为一个入职培训资源,那么请:考虑赞助此网站! - 我很高兴与您合作。我非常乐意和财务部门合作,提供赞助发票和表彰。 -

    - +

    捐款和赞助

    - -
    +
    -

    - — Pomax -

    - + +
    - -
    - -
    - -

    序言

    -

    我们通常用线条来绘制2D图形,大致分为两种线条:直线和曲线。不论我们动手还是用电脑,都能很容易地画出第一种线条。只要给电脑起点和终点,砰!直线就画出来了。没什么好疑问的。

    -

    然而,绘制曲线却是个大问题。虽然我们可以很容易地徒手画出曲线,但除非给出描述曲线的数学函数,不然计算机无法画出曲线。实际上,画直线时也需要数学函数,但画直线所需的方程式很简单,我们在这里不去考虑。在计算机看来,所有线条都是“函数”,不管它们是直线还是曲线。然而,这就表示我们需要找到能在计算机上表现良好的曲线方程。这样的曲线有很多种,在本文我们主要关注一类特殊的、备受关注的函数,基本上任何画曲线的地方都会用到它:贝塞尔曲线。

    -

    它们是以Pierre Bézier命名的,尽管他并不是第一个,或者说唯一“发明”了这种曲线的人,但他让世界知道了这种曲线十分适合设计工作(在1962年为Renault工作并发表了他的研究)。有人也许会说数学家Paul de Casteljau是第一个发现这类曲线特性的人,在Citroën工作时,他提出了一种很优雅的方法来画这些曲线。然而,de Casteljau没有发表他的工作,这使得“谁先发现”这一问题很难有一个确切的答案。 -贝塞尔曲线本质上是伯恩斯坦多项式,这是Sergei Natanovich Bernstein研究的一种数学函数,关于它们的出版物至少可以追溯到1912年。无论如何,这些都只是一些冷知识,你可能更在意的是这些曲线很方便:你可以连接多条贝塞尔曲线,并且连接起来的曲线看起来就像是一条曲线。甚至,在你在Photoshop中画“路径”或使用一些像Flash、Illustrator和Inkscape这样的矢量绘图程序时,所画的曲线都是贝塞尔曲线。

    -

    那么,要是你自己想编程实现它们呢?有哪些陷阱?你怎么画它们?包围盒是怎么样的,怎么确定交点,怎么拉伸曲线,简单来说:你怎么对曲线做一切你想做的事?这就是这篇文章想说的。准备好学习一些数学吧!

    -
    - -

    注意:几乎所有的贝塞尔图形都是可交互的。

    -

    这个页面使用了基于Bezier.js 的可交互例子。

    - -

    这本书是开源的。

    -

    这本书是开源的软件项目,现有两个github仓库。第一个https://github.com/pomax/bezierinfo,它是你现在在看的这个,纯粹用来展示的版本。另外一个https://github.com/pomax/BezierInfo-2,是带有所有html, javascript和css的开发版本。你可以fork任意一个,随便做些什么,当然除了把它当作自己的作品来商用。 =)

    -

    用到的数学将有多复杂?

    -

    这份入门读物用到的大部分数学知识都是高中所学的。如果你理解基本的计算并能看懂英文的话,就能上手这份材料。有时候会用到复杂一点的数学,但如果你不想深究它们,可以选择跳过段落里的“详解”部分,或者直接跳到章节末尾,避开那些看起来很深入的数学。章节的末尾往往会列出一些结论,因此你可以直接利用这些结论。

    -

    问题,评论:

    -

    如果你有对于新章节的一些建议,点击 Github issue tracker (也可以点右上角的repo链接)。如果你有关于材料的一些问题,由于我现在在做改写工作,目前没有评论功能,但你可以用issue跟踪来发表评论。一旦完成重写工作,我会把评论功能加上,或者会有“选择文字段落,点击‘问题’按钮来提问”的系统。到时候我们看看。

    -

    给我买杯咖啡?

    -

    如果你很喜欢这本书,或发现它对你要做的事很有帮助,或者你想知道怎么表达自己对这本书的感激,你可以 给我买杯咖啡 ,所少钱取决于你。这份工作持续了很多年,从一份小小的简要到70多页关于贝塞尔曲线的读物,在完成它的过程中倾注了很多咖啡。我从未后悔花在这上面的每一分钟,但如果有更多咖啡的话,我可以坚持写下去!

    -
    -
    -
    -

    变更日志

    -

    本入门是一份活动文档,因此它可能会有新的内容,这取决于你上次查看的时间。单击以下链接以展开,查看添加的内容、时间,或点击浏览 News posts 获取更多更新信息。 (RSS feed 可用)

    - - - -

    November 2020

    • Added a section on finding curve/circle intersections

      -
    -

    October 2020

    • Added the Ukranian locale! Help out in getting its localization to 100%!

      -
    -

    August-September 2020

    • Completely overhauled the site: the Primer is now a normal web page that works fine with JS disabled, but obviously better with JS turned on.

      -
    -

    June 2020

    • Added automatic CI/CD using Github Actions

      -
    -

    January 2020

    • Added reset buttons to all graphics

      -
    • -
    • Updated to preface to correctly describe the on-page maths

      -
    • -
    • Fixed the Catmull-Rom section because it had glaring maths errors

      -
    -

    August 2019

    • Added a section on (plain) rational Bezier curves

      -
    • -
    • Improved the Graphic component to allow for sliders

      -
    -

    December 2018

    • Added a section on curvature and calculating kappa.

      -
    • -
    • Added a Patreon page! Head on over to patreon.com/bezierinfo to help support this site!

      -
    -

    August 2018

    • Added a section on finding a curve's y, if all you have is the x coordinate.

      -
    -

    July 2018

    • Rewrote the 3D normals section, implementing and explaining Rotation Minimising Frames.

      -
    • -
    • Updated the section on curve order raising/lowering, showing how to get a least-squares optimized lower order curve.

      -
    • -
    • (Finally) updated 'npm test' so that it automatically rebuilds when files are changed while the dev server is running.

      -
    -

    June 2018

    • Added a section on direct curve fitting.

      -
    • -
    • Added source links for all graphics.

      -
    • -
    • Added this "What's new?" section.

      -
    -

    April 2017

    • Added a section on 3d normals.

      -
    • -
    • Added live-updating for the social link buttons, so they always link to the specific section you're reading.

      -
    -

    February 2017

    • Finished rewriting the entire codebase for localization.

      -
    -

    January 2016

    • Added a section to explain the Bezier interval.

      -
    • -
    • Rewrote the Primer as a React application.

      -
    -

    December 2015

    • Set up the split repository between BezierInfo-2 as development repository, and bezierinfo as live page.

      -
    • -
    • Removed the need for client-side LaTeX parsing entirely, so the site doesn't take a full minute or more to load all the graphics.

      -
    -

    May 2015

    • Switched over to pure JS rather than Processing-through-Processing.js

      -
    • -
    • Added Cardano's algorithm for finding the roots of a cubic polynomial.

      -
    -

    April 2015

    • Added a section on arc length approximations.

      -
    -

    February 2015

    • Added a section on the canonical cubic Bezier form.

      -
    -

    November 2014

    • Switched to HTTPS.

      -
    -

    July 2014

    • Added the section on arc approximation.

      -
    -

    April 2014

    • Added the section on Catmull-Rom fitting.

      -
    -

    November 2013

    • Added the section on Catmull-Rom / Bezier conversion.

      -
    • -
    • Added the section on Bezier cuves as matrices.

      -
    -

    April 2013

    • Added a section on poly-Beziers.

      -
    • -
    • Added a section on boolean shape operations.

      -
    -

    March 2013

    • First drastic rewrite.

      -
    • -
    • Added sections on circle approximations.

      -
    • -
    • Added a section on projecting a point onto a curve.

      -
    • -
    • Added a section on tangents and normals.

      -
    • -
    • Added Legendre-Gauss numerical data tables.

      -
    -

    October 2011

    • First commit for the bezierinfo site, based on the pre-Primer webpage that covered the basics of Bezier curves in HTML with Processing.js examples.

      -
    -
    -
    -

    简单介绍

    -

    让我们有个好的开始:当我们在谈论贝塞尔曲线的时候,所指的就是你在如下图像看到的东西。它们从某些起点开始,到终点结束,并且受到一个或多个的“中间”控制点的影响。本页面上的图形都是可交互的,你可以拖动这些点,看看这些形状在你的操作下会怎么变化。

    -
    - - - 脚本被禁用。显示备用图片。 - - - - - - 脚本被禁用。显示备用图片。 - - - -
    +

    这本书是开源的。

    +

    + 这本书是开源的软件项目,现有两个github仓库。第一个https://github.com/pomax/bezierinfo,它是你现在在看的这个,纯粹用来展示的版本。另外一个https://github.com/pomax/BezierInfo-2,是带有所有html, javascript和css的开发版本。你可以fork任意一个,随便做些什么,当然除了把它当作自己的作品来商用。 =) +

    +

    用到的数学将有多复杂?

    +

    + 这份入门读物用到的大部分数学知识都是高中所学的。如果你理解基本的计算并能看懂英文的话,就能上手这份材料。有时候会用到复杂一点的数学,但如果你不想深究它们,可以选择跳过段落里的“详解”部分,或者直接跳到章节末尾,避开那些看起来很深入的数学。章节的末尾往往会列出一些结论,因此你可以直接利用这些结论。 +

    +

    问题,评论:

    +

    + 如果你有对于新章节的一些建议,点击 + Github issue tracker + (也可以点右上角的repo链接)。如果你有关于材料的一些问题,由于我现在在做改写工作,目前没有评论功能,但你可以用issue跟踪来发表评论。一旦完成重写工作,我会把评论功能加上,或者会有“选择文字段落,点击‘问题’按钮来提问”的系统。到时候我们看看。 +

    +

    给我买杯咖啡?

    +

    + 如果你很喜欢这本书,或发现它对你要做的事很有帮助,或者你想知道怎么表达自己对这本书的感激,你可以 + 给我买杯咖啡 + ,所少钱取决于你。这份工作持续了很多年,从一份小小的简要到70多页关于贝塞尔曲线的读物,在完成它的过程中倾注了很多咖啡。我从未后悔花在这上面的每一分钟,但如果有更多咖啡的话,我可以坚持写下去! +

    + +
    +
    +

    变更日志

    +

    + 本入门是一份活动文档,因此它可能会有新的内容,这取决于你上次查看的时间。单击以下链接以展开,查看添加的内容、时间,或点击浏览 + News posts 获取更多更新信息。 (RSS feed 可用) +

    + + + +
    +

    November 2020

    +
      +
    • Added a section on finding curve/circle intersections

    • +
    +

    October 2020

    +
      +
    • Added the Ukranian locale! Help out in getting its localization to 100%!

    • +
    +

    August-September 2020

    +
      +
    • +

      + Completely overhauled the site: the Primer is now a normal web page that works fine with JS disabled, but obviously better with JS + turned on. +

      +
    • +
    +

    June 2020

    +
      +
    • Added automatic CI/CD using Github Actions

    • +
    +

    January 2020

    +
      +
    • Added reset buttons to all graphics

    • +
    • Updated to preface to correctly describe the on-page maths

    • +
    • Fixed the Catmull-Rom section because it had glaring maths errors

    • +
    +

    August 2019

    +
      +
    • Added a section on (plain) rational Bezier curves

    • +
    • Improved the Graphic component to allow for sliders

    • +
    +

    December 2018

    +
      +
    • Added a section on curvature and calculating kappa.

    • +
    • +

      + Added a Patreon page! Head on over to patreon.com/bezierinfo to help support this + site! +

      +
    • +
    +

    August 2018

    +
      +
    • Added a section on finding a curve's y, if all you have is the x coordinate.

    • +
    +

    July 2018

    +
      +
    • Rewrote the 3D normals section, implementing and explaining Rotation Minimising Frames.

    • +
    • Updated the section on curve order raising/lowering, showing how to get a least-squares optimized lower order curve.

    • +
    • +

      (Finally) updated 'npm test' so that it automatically rebuilds when files are changed while the dev server is running.

      +
    • +
    +

    June 2018

    +
      +
    • Added a section on direct curve fitting.

    • +
    • Added source links for all graphics.

    • +
    • Added this "What's new?" section.

    • +
    +

    April 2017

    +
      +
    • Added a section on 3d normals.

    • +
    • Added live-updating for the social link buttons, so they always link to the specific section you're reading.

    • +
    +

    February 2017

    +
      +
    • Finished rewriting the entire codebase for localization.

    • +
    +

    January 2016

    +
      +
    • Added a section to explain the Bezier interval.

    • +
    • Rewrote the Primer as a React application.

    • +
    +

    December 2015

    +
      +
    • Set up the split repository between BezierInfo-2 as development repository, and bezierinfo as live page.

    • +
    • +

      + Removed the need for client-side LaTeX parsing entirely, so the site doesn't take a full minute or more to load all the graphics. +

      +
    • +
    +

    May 2015

    +
      +
    • Switched over to pure JS rather than Processing-through-Processing.js

    • +
    • Added Cardano's algorithm for finding the roots of a cubic polynomial.

    • +
    +

    April 2015

    +
      +
    • Added a section on arc length approximations.

    • +
    +

    February 2015

    +
      +
    • Added a section on the canonical cubic Bezier form.

    • +
    +

    November 2014

    +
      +
    • Switched to HTTPS.

    • +
    +

    July 2014

    +
      +
    • Added the section on arc approximation.

    • +
    +

    April 2014

    +
      +
    • Added the section on Catmull-Rom fitting.

    • +
    +

    November 2013

    +
      +
    • Added the section on Catmull-Rom / Bezier conversion.

    • +
    • Added the section on Bezier cuves as matrices.

    • +
    +

    April 2013

    +
      +
    • Added a section on poly-Beziers.

    • +
    • Added a section on boolean shape operations.

    • +
    +

    March 2013

    +
      +
    • First drastic rewrite.

    • +
    • Added sections on circle approximations.

    • +
    • Added a section on projecting a point onto a curve.

    • +
    • Added a section on tangents and normals.

    • +
    • Added Legendre-Gauss numerical data tables.

    • +
    +

    October 2011

    +
      +
    • +

      + First commit for the bezierinfo site, based on the pre-Primer webpage that covered + the basics of Bezier curves in HTML with Processing.js examples. +

      +
    • +
    +
    +
    +
    +
    +

    + + 简单介绍 +

    +

    + 让我们有个好的开始:当我们在谈论贝塞尔曲线的时候,所指的就是你在如下图像看到的东西。它们从某些起点开始,到终点结束,并且受到一个或多个的“中间”控制点的影响。本页面上的图形都是可交互的,你可以拖动这些点,看看这些形状在你的操作下会怎么变化。 +

    +
    + + + 脚本被禁用。显示备用图片。 + + + + + + 脚本被禁用。显示备用图片。 + + + +
    -

    这些曲线在计算机辅助设计和计算机辅助制造应用(CAD/CAM)中用的很多。在图形设计软件中也常用到,像Adobe Illustrator, Photoshop, Inkscape, Gimp等等。还可以应用在一些图形技术中,像矢量图形(SVG)和OpenType字体(ttf/otf)。许多东西都用到贝塞尔曲线,如果你想更了解它们...准备好继续往下学吧!

    - -
    -
    -

    什么构成了贝塞尔曲线?

    -

    操作点的移动,看看曲线的变化,可能让你感受到了贝塞尔曲线是如何表现的。但贝塞尔曲线究竟什么呢?有两种方式来解释贝塞尔曲线,并且可以证明它们完全相等,但是其中一种用到了复杂的数学,另外一种比较简单。所以...我们先从简单的开始吧:

    -

    贝塞尔曲线是线性插值的结果。这听起来很复杂,但你在很小的时候就做过线性插值:当你指向两个物体中的另外一个物体时,你就用到了线性插值。它就是很简单的“选出两点之间的一个点”。

    -

    如果我们知道两点之间的距离,并想找出离第一个点20%间距的一个新的点(也就是离第二个点80%的间距),我们可以通过简单的计算来得到:

    - - -

    让我们来通过实际操作看一下:下面的图形都是可交互的,因此你可以通过上下键来增加或减少插值距离,来观察图形的变化。我们从三个点构成的两条线段开始。通过对各条线段进行线性插值得到两个点,对点之间的线段再进行线性插值,产生一个新的点。最终这些点——所有的点都可以通过选取不同的距离插值产生——构成了贝塞尔曲线:

    - - - 脚本被禁用。显示备用图片。 - - - - - -

    这为我们引出了复杂的数学:微积分。

    -

    虽然我们刚才好像没有用到这个,我们实际上只是逐步地画了一条二次曲线,而不是一次画好。贝塞尔曲线的一个很棒的特性就是它们可以通过多项式方程表示,也可以用很简单的插值形式表示。因此,反过来说,我们可以基于“真正的数学”(检查方程式,导数之类的东西),也可以通过观察曲线的“机械”构成(比如说,可以得知曲线永远不会延伸超过我们用来构造它的点),来看看这些曲线能够做什么。

    -

    让我们从更深的层次来观察贝塞尔曲线。看看它们的数学表达式,从这些表达式衍生得到的属性,以及我们可以对贝塞尔曲线做的事。

    - -
    -
    -

    贝塞尔曲线的数学原理

    -

    贝塞尔曲线是“参数”方程的一种形式。从数学上讲,参数方程作弊了:“方程”实际上是一个从输入到唯一输出的、良好定义的映射关系。几个输入进来,一个输出返回。改变输入变量,还是只有一个输出值。参数方程在这里作弊了。它们基本上干了这么件事,“好吧,我们想要更多的输出值,所以我们用了多个方程”。举个例子:假如我们有一个方程,通过一些计算,将假设为x的一些值映射到另外的值:

    - - -

    记号f(x)是表示函数的标准方式(为了方便起见,如果只有一个的话,我们称函数为f),函数的输出根据一个变量(本例中是x)变化。改变xf(x)的输出值也会变。

    -

    到目前没什么问题。现在,让我们来看一下参数方程,以及它们是怎么作弊的。我们取以下两个方程:

    - - -

    这俩方程没什么让人印象深刻的,只不过是正弦函数和余弦函数,但正如你所见,输入变量有两个不同的名字。如果我们改变了a的值,f(b)的输出不会有变化,因为这个方程没有用到a。参数方程通过改变这点来作弊。在参数方程中,所有不同的方程共用一个变量,如下所示:

    - - -

    多个方程,但只有一个变量。如果我们改变了t的值,fa(t)fb(t)的输出都会发生变化。你可能会好奇这有什么用,答案其实很简单:对于参数曲线,如果我们用常用的标记来替代fa(t)fb(t),看起来就有些明朗了:

    - - -

    好了,通过一些神秘的t值将x/y坐标系联系起来。

    -

    所以,参数曲线不像一般函数那样,通过x坐标来定义y坐标,而是用一个“控制”变量将它们连接起来。如果改变t的值,每次变化时我们都能得到两个值,这可以作为图形中的(x,y)坐标。比如上面的方程组,生成位于一个圆上的点:我们可以使t在正负极值间变化,得到的输出(x,y)都会位于一个以原点(0,0)为中心且半径为1的圆上。如果我们画出t从0到5时的值,将得到如下图像:

    - - - 脚本被禁用。显示备用图片。 - - - - - -

    贝塞尔曲线是(一种)参数方程,并在它的多个维度上使用相同的基本方程。在上述的例子中x值和y值使用了不同的方程,与此不同的是,贝塞尔曲线的xy都用了“二项多项式”。那什么是二项多项式呢?

    -

    你可能记得高中所学的多项式,看起来像这样:

    - - -

    如果它的最高次项是就称为“三次”多项式,如果最高次项是,称为“二次”多项式,如果只含有x的项,它就是一条线(不过不含任何x的项它就不是一个多项式!)

    -

    贝塞尔曲线不是x的多项式,它是t的多项式,t的值被限制在0和1之间,并且含有ab等参数。它采用了二次项的形式,听起来很神奇但实际上就是混合不同值的简单描述:

    - - -

    我明白你在想什么:这看起来并不简单,但如果我们拿掉t并让系数乘以1,事情就会立马简单很多,看看这些二次项:

    - - -

    需要注意的是,2与1+1相同,3相当于2+1或1+2,6相当于3+3...如你所见,每次我们增加一个维度,只要简单地将头尾置为1,中间的操作都是“将上面的两个数字相加”。现在就能很容易地记住了。

    -

    还有一个简单的办法可以弄清参数项怎么工作的:如果我们将(1-t)重命名为a,将t重命名为b,暂时把权重删掉,可以得到这个:

    - - -

    基本上它就是“每个ab结合项”的和,在每个加号后面逐步的将a换成b。因此这也很简单。现在你已经知道了二次多项式,为了叙述的完整性,我将给出一般方程:

    - - -

    这就是贝塞尔曲线完整的描述。在这个函数中的Σ表示了这是一系列的加法(用Σ下面的变量,从...=<值>开始,直到Σ上面的数字结束)。

    -
    + +

    这就是贝塞尔曲线完整的描述。在这个函数中的Σ表示了这是一系列的加法(用Σ下面的变量,从...=<值>开始,直到Σ上面的数字结束)。

    +
    +

    如何实现基本方程

    +

    我们可以用之前说过的方程,来简单地实现基本方程作为数学构造,如下:

    -

    如何实现基本方程

    -

    我们可以用之前说过的方程,来简单地实现基本方程作为数学构造,如下:

    - - - - - -
    1 - -
    2
    3
    4
    5
    - -

    我说我们“可以用”是因为我们不会这么去做:因为阶乘函数开销非常大。并且,正如我们在上面所看到的,我们不用阶乘也能够很容易地构造出帕斯卡三角形:一开始是[1],接着是[1,2,1],然后是[1,3,3,1]等等。下一行都比上一行多一个数,首尾都为1,中间的数字是上一行两边元素的和。

    -

    我们可以很快的生成这个列表,并在之后使用这个查找表而不用再计算二次多项式的系数:

    + return sum + + + + 2 + + + 3 + + + 4 + + + 5 + + - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    - -

    这里做了些什么?首先,我们声明了一个足够大的查找表。然后,我们声明了一个函数来获取我们想要的值,并且确保当一个请求的n/k对不在LUT查找表中时,先将表扩大。我们的基本函数如下所示:

    + return lut[n][k] + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + 18 + + - - - - -
    1 - -
    2
    3
    4
    5
    - -

    完美。当然我们可以进一步优化。为了大部分的计算机图形学目的,我们不需要任意的曲线。我们需要二次曲线和三次曲线(实际上这篇文章没有涉及任意次的曲线,因此你会在其他地方看到与这些类似的代码),这说明我们可以彻底简化代码:

    + return sum + + + + 2 + + + 3 + + + 4 + + + 5 + + - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    - -

    现在我们知道如何代用码实现基本方程了。很好。

    -
    + return mt3 + 3*mt2*t + 3*mt*t2 + t3 + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + -

    既然我们已经知道基本函数的样子,是时候添加一些魔法来使贝塞尔曲线变得特殊了:控制点。

    +

    现在我们知道如何代用码实现基本方程了。很好。

    +
    -
    -
    -

    控制贝塞尔的曲率

    -

    贝塞尔曲线是插值方程(就像所有曲线一样),这表示它们取一系列的点,生成一些处于这些点之间的值。(一个推论就是你永远无法生成一个位于这些控制点轮廓线外面的点,更普遍是称为曲线的外壳。这信息很有用!)实际上,我们可以将每个点对方程产生的曲线做出的贡献进行可视化,因此可以看出曲线上哪些点是重要的,它们处于什么位置。

    -

    下面的图形显示了二次曲线和三次曲线的差值方程,“S”代表了点对贝塞尔方程总和的贡献。点击拖动点来看看在特定的t值时,每个曲线定义的点的插值百分比。

    -
    - - - 脚本被禁用。显示备用图片。 - - - - - - - - 脚本被禁用。显示备用图片。 - - - - - - - - 脚本被禁用。显示备用图片。 - - - - - -
    +

    既然我们已经知道基本函数的样子,是时候添加一些魔法来使贝塞尔曲线变得特殊了:控制点。

    +
    +
    +

    + + 控制贝塞尔的曲率 +

    +

    + 贝塞尔曲线是插值方程(就像所有曲线一样),这表示它们取一系列的点,生成一些处于这些点之间的值。(一个推论就是你永远无法生成一个位于这些控制点轮廓线外面的点,更普遍是称为曲线的外壳。这信息很有用!)实际上,我们可以将每个点对方程产生的曲线做出的贡献进行可视化,因此可以看出曲线上哪些点是重要的,它们处于什么位置。 +

    +

    + 下面的图形显示了二次曲线和三次曲线的差值方程,“S”代表了点对贝塞尔方程总和的贡献。点击拖动点来看看在特定的t值时,每个曲线定义的点的插值百分比。 +

    +
    + + + 脚本被禁用。显示备用图片。 + + + + + + + + 脚本被禁用。显示备用图片。 + + + + + + + + 脚本被禁用。显示备用图片。 + + + + + +
    -

    上面有一张是15th阶的插值方程。如你所见,在所有控制点中,起点和终点对曲线形状的贡献比其他点更大些。

    -

    如果我们要改变曲线,就需要改变每个点的权重,有效地改变插值。可以很直接地做到这个:只要用一个值乘以每个点,来改变它的强度。这个值照惯例称为“权重”,我们可以将它加入我们原始的贝塞尔函数:

    - - -

    看起来很复杂,但实际上“权重”只是我们想让曲线所拥有的坐标值:对于一条nth阶曲线,w0是起始坐标,wn是终点坐标,中间的所有点都是控制点坐标。假设说一条曲线的起点为(110,150),终点为(210,30),并受点(25,190)和点(210,250)的控制,贝塞尔曲线方程就为:

    - - -

    这就是我们在文章开头看到的曲线:

    - + +

    这就是我们在文章开头看到的曲线:

    + -

    我们还能对贝塞尔曲线做些什么?实际上还有很多。文章接下来涉及到我们可能运用到的一系列操作和算法,以及它们可以完成的任务。

    -
    +

    我们还能对贝塞尔曲线做些什么?实际上还有很多。文章接下来涉及到我们可能运用到的一系列操作和算法,以及它们可以完成的任务。

    +
    +

    如何实现权重基本函数

    +

    鉴于我们已经知道怎样实现基本函数,在其加入控制点是非常简单的:

    -

    如何实现权重基本函数

    -

    鉴于我们已经知道怎样实现基本函数,在其加入控制点是非常简单的:

    - - - - - -
    1 - -
    2
    3
    4
    5
    - -

    下面是优化过的版本:

    + return sum + + + + 2 + + + 3 + + + 4 + + + 5 + + - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    - -

    现在我们知道如何编程实现基本权重函数了。

    -
    + return w[0]*mt3 + 3*w[1]*mt2*t + 3*w[2]*mt*t2 + w[3]*t3 + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + -
    -
    -

    Controlling Bézier curvatures, part 2: Rational Béziers

    -

    We can further control Bézier curves by "rationalising" them: that is, adding a "ratio" value in addition to the weight value discussed in the previous section, thereby gaining control over "how strongly" each coordinate influences the curve.

    -

    Adding these ratio values to the regular Bézier curve function is fairly easy. Where the regular function is the following:

    - - -

    The function for rational Bézier curves has two more terms:

    - - -

    In this, the first new term represents an additional weight for each coordinate. For example, if our ratio values are [1, 0.5, 0.5, 1] then ratio0 = 1, ratio1 = 0.5, and so on, and is effectively identical as if we were just using different weight. So far, nothing too special.

    -

    However, the second new term is what makes the difference: every point on the curve isn't just a "double weighted" point, it is a fraction of the "doubly weighted" value we compute by introducing that ratio. When computing points on the curve, we compute the "normal" Bézier value and then divide that by the Bézier value for the curve that only uses ratios, not weights.

    -

    This does something unexpected: it turns our polynomial into something that isn't a polynomial anymore. It is now a kind of curve that is a super class of the polynomials, and can do some really cool things that Bézier curves can't do "on their own", such as perfectly describing circles (which we'll see in a later section is literally impossible using standard Bézier curves).

    -

    But the best way to show what this does is to do literally that: let's look at the effect of "rationalising" our Bézier curves using an interactive graphic for a rationalised curves. The following graphic shows the Bézier curve from the previous section, "enriched" with ratio factors for each coordinate. The closer to zero we set one or more terms, the less relative influence the associated coordinate exerts on the curve (and of course the higher we set them, the more influence they have). Try to change the values and see how it affects what gets drawn:

    - - - 脚本被禁用。显示备用图片。 - - - - - - - - -

    You can think of the ratio values as each coordinate's "gravity": the higher the gravity, the closer to that coordinate the curve will want to be. You'll also notice that if you simply increase or decrease all the ratios by the same amount, nothing changes... much like with gravity, if the relative strengths stay the same, nothing really changes. The values define each coordinate's influence relative to all other points.

    -
    + +

    + In this, the first new term represents an additional weight for each coordinate. For example, if our ratio values are [1, 0.5, 0.5, 1] + then ratio0 = 1, ratio1 = 0.5, and so on, and is effectively identical as if we were just + using different weight. So far, nothing too special. +

    +

    + However, the second new term is what makes the difference: every point on the curve isn't just a "double weighted" point, it is a + fraction of the "doubly weighted" value we compute by introducing that ratio. When computing points on the curve, we compute the + "normal" Bézier value and then divide that by the Bézier value for the curve that only uses ratios, not weights. +

    +

    + This does something unexpected: it turns our polynomial into something that isn't a polynomial anymore. It is now a kind of curve + that is a super class of the polynomials, and can do some really cool things that Bézier curves can't do "on their own", such as perfectly + describing circles (which we'll see in a later section is literally impossible using standard Bézier curves). +

    +

    + But the best way to show what this does is to do literally that: let's look at the effect of "rationalising" our Bézier curves using an + interactive graphic for a rationalised curves. The following graphic shows the Bézier curve from the previous section, "enriched" with + ratio factors for each coordinate. The closer to zero we set one or more terms, the less relative influence the associated coordinate + exerts on the curve (and of course the higher we set them, the more influence they have). Try to change the values and see how it affects + what gets drawn: +

    + + + 脚本被禁用。显示备用图片。 + + + + + + + + +

    + You can think of the ratio values as each coordinate's "gravity": the higher the gravity, the closer to that coordinate the curve will + want to be. You'll also notice that if you simply increase or decrease all the ratios by the same amount, nothing changes... much like + with gravity, if the relative strengths stay the same, nothing really changes. The values define each coordinate's influence + relative to all other points. +

    +
    +

    How to implement rational curves

    +

    Extending the code of the previous section to include ratios is almost trivial:

    -

    How to implement rational curves

    -

    Extending the code of the previous section to include ratios is almost trivial:

    - - - - - - - - - - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    20
    21
    22
    23
    24
    25
    26
    - -

    And that's all we have to do.

    -
    + return (f[0] * w[0] + f[1] * w[1] + f[2] * w[2] + f[3] * w[3])/basis + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + 18 + + + 19 + + + 20 + + + 21 + + + 22 + + + 23 + + + 24 + + + 25 + + + 26 + + -
    -
    -

    贝塞尔区间[0,1]

    -

    既然我们知道了贝塞尔曲线背后的数学原理,你可能会注意到一件奇怪的事:它们都是从t=0t=1。为什么是这个特殊区间?

    -

    这一切都与我们如何从曲线的“起点”变化到曲线“终点”有关。如果有一个值是另外两个值的混合,一般方程如下:

    - - -

    很显然,起始值需要a=1, b=0,混合值就为100%的value 1和0%的value 2。终点值需要a=0, b=1,则混合值是0%的value 1和100%的value 2。另外,我们不想让“a”和“b”是互相独立的:如果它们是互相独立的话,我们可以任意选出自己喜欢的值,并得到混合值,比如说100%的value1和100%的value2。原则上这是可以的,但是对于贝塞尔曲线来说,我们通常想要的是起始值和终点值之间的混合值,所以要确保我们不会设置一些“a”和"b"而导致混合值超过100%。这很简单:

    - - -

    用这个式子我们可以保证相加的值永远不会超过100%。通过将a限制在区间[0,1],我们将会一直处于这两个值之间(包括这两个端点),并且相加为100%。

    -

    但是...如果我们没有假定只使用0到1之间的数,而是用一些区间外的值呢,事情会变得很糟糕吗?好吧...不全是,我们接下来看看。

    -

    对于贝塞尔曲线的例子,扩展区间只会使我们的曲线“保持延伸”。贝塞尔曲线是多项式曲线上简单的片段,如果我们选一个更大的区间,会看到曲线更多部分。它们看起来是什么样的呢?

    -

    下面两个图形给你展示了以“普通方式”来渲染的贝塞尔曲线,以及如果我们扩大t值时它们所“位于”的曲线。如你所见,曲线的剩余部分隐藏了很多“形状”,我们可以通过移动曲线的点来建模这部分。

    -
    - - - 脚本被禁用。显示备用图片。 - - - - - - 脚本被禁用。显示备用图片。 - - - -
    + +

    + 用这个式子我们可以保证相加的值永远不会超过100%。通过将a限制在区间[0,1],我们将会一直处于这两个值之间(包括这两个端点),并且相加为100%。 +

    +

    但是...如果我们没有假定只使用0到1之间的数,而是用一些区间外的值呢,事情会变得很糟糕吗?好吧...不全是,我们接下来看看。

    +

    + 对于贝塞尔曲线的例子,扩展区间只会使我们的曲线“保持延伸”。贝塞尔曲线是多项式曲线上简单的片段,如果我们选一个更大的区间,会看到曲线更多部分。它们看起来是什么样的呢? +

    +

    + 下面两个图形给你展示了以“普通方式”来渲染的贝塞尔曲线,以及如果我们扩大t值时它们所“位于”的曲线。如你所见,曲线的剩余部分隐藏了很多“形状”,我们可以通过移动曲线的点来建模这部分。 +

    +
    + + + 脚本被禁用。显示备用图片。 + + + + + + 脚本被禁用。显示备用图片。 + + + +
    -

    实际上,图形设计和计算机建模中还用了一些和贝塞尔曲线相反的曲线,这些曲线没有固定区间和自由的坐标,相反,它们固定座标但给你自由的区间。"Spiro"曲线就是一个很好的例子,它的构造是基于羊角螺线,也就是欧拉螺线的一部分。这是在美学上很令人满意的曲线,你可以在一些图形包中看到它,比如FontForgeInkscape,它也被用在一些字体设计中(比如Inconsolata字体)。

    - -
    -
    -

    用矩阵运算来表示贝塞尔曲率

    -

    通过将贝塞尔公式表示成一个多项式基本方程、系数矩阵以及实际的坐标,我们也可以用矩阵运算来表示贝塞尔。让我们看一下这对三次曲线来说有什么含义:

    - - -

    暂时不用管我们具体的坐标,现在有:

    - - -

    可以将它写成四个表达式之和:

    - - -

    我们可以扩展这些表达式:

    - - -

    更进一步,我们可以加上所有的1和0系数,以便看得更清楚:

    - - -

    现在,我们可以将它看作四个矩阵运算:

    - - -

    如果我们将它压缩到一个矩阵操作里,就能得到:

    - - -

    这种多项式表达式一般是以递增的顺序来写的,所以我们应该将t矩阵水平翻转,并将大的那个“混合”矩阵上下颠倒:

    - - -

    最终,我们可以加入原始的坐标,作为第三个单独矩阵:

    - - -

    我们可以对二次曲线运用相同的技巧,可以得到:

    - - -

    如果我们代入t值并乘以矩阵来计算,得到的值与解原始多项式方程或用逐步线性插值计算的结果一样。

    -

    因此:为什么我们要用矩阵来计算? 用矩阵形式来表达曲线可以让我们去探索函数的一些很难被发现的性质。可以证明的是曲线构成了三角矩阵,并且它与我们用在曲线中的实际坐标的求积相同。它还是可颠倒的,这说明可以满足大量特性。当然,主要问题是:“现在,为什么这些对我们很有用?”,答案就是这些并不是立刻就很有用,但是以后你会看到在一些例子中,曲线的一些属性可以用函数式来计算,也可以巧妙地用矩阵运算来得到,有时候矩阵方法要快得多。

    -

    所以,现在只要记着我们可以用这种形式来表示曲线,让我们接着往下看看。

    + +

    如果我们代入t值并乘以矩阵来计算,得到的值与解原始多项式方程或用逐步线性插值计算的结果一样。

    +

    + 因此:为什么我们要用矩阵来计算? 用矩阵形式来表达曲线可以让我们去探索函数的一些很难被发现的性质。可以证明的是曲线构成了三角矩阵,并且它与我们用在曲线中的实际坐标的求积相同。它还是可颠倒的,这说明可以满足大量特性。当然,主要问题是:“现在,为什么这些对我们很有用?”,答案就是这些并不是立刻就很有用,但是以后你会看到在一些例子中,曲线的一些属性可以用函数式来计算,也可以巧妙地用矩阵运算来得到,有时候矩阵方法要快得多。 +

    +

    所以,现在只要记着我们可以用这种形式来表示曲线,让我们接着往下看看。

    +
    +
    +

    + + de Casteljau's 算法 +

    +

    + 要绘制贝塞尔曲线,我们可以从01遍历t的所有值,计算权重函数,得到需要画的x/y值。但曲线越复杂,计算量也变得越大。我们可以利用“de + Casteljau算法",这是一种几何画法,并且易于实现。实际上,你可以轻易地用笔和尺画出曲线。 +

    +

    我们用以下步骤来替代用t计算x/y的微积分算法:

    +
      +
    • t看做一个比例(实际上它就是),t=0代表线段的0%,t=1代表线段的100%。
    • +
    • 画出所有点的连线,对n阶曲线来说可以画出n条线。
    • +
    • 在每条线的t处做一个记号。比如t是0.2,就在离起点20%(离终点80%)的地方做个记号。
    • +
    • 连接这些点,得到n-1条线。
    • +
    • 在这些新得到的线上同样用t为比例标记。
    • +
    • 把相邻的那些点连线,得到n-2条线。
    • +
    • 取记号,连线,取记号,等等。
    • +
    • 重复这些步骤,直到剩下一条线。这条线段上的t点就是原始曲线在t处的点。
    • +
    +

    + 我们通过实际操作来观察这个过程。在以下的图表中,移动鼠标来改变用de + Casteljau算法计算得到的曲线点,左右移动鼠标,可以实时看到曲线是如何生成的。 +

    + + + 脚本被禁用。显示备用图片。 + + + + + +
    +

    如何实现de Casteljau算法

    +

    让我们使用刚才描述过的算法,并实现它:

    -
    -
    -

    de Casteljau's 算法

    -

    要绘制贝塞尔曲线,我们可以从01遍历t的所有值,计算权重函数,得到需要画的x/y值。但曲线越复杂,计算量也变得越大。我们可以利用“de Casteljau算法",这是一种几何画法,并且易于实现。实际上,你可以轻易地用笔和尺画出曲线。

    -

    我们用以下步骤来替代用t计算x/y的微积分算法:

    -
      -
    • t看做一个比例(实际上它就是),t=0代表线段的0%,t=1代表线段的100%。
    • -
    • 画出所有点的连线,对n阶曲线来说可以画出n条线。
    • -
    • 在每条线的t处做一个记号。比如t是0.2,就在离起点20%(离终点80%)的地方做个记号。
    • -
    • 连接这些点,得到n-1条线。
    • -
    • 在这些新得到的线上同样用t为比例标记。
    • -
    • 把相邻的那些点连线,得到n-2条线。
    • -
    • 取记号,连线,取记号,等等。
    • -
    • 重复这些步骤,直到剩下一条线。这条线段上的t点就是原始曲线在t处的点。
    • -
    -

    我们通过实际操作来观察这个过程。在以下的图表中,移动鼠标来改变用de Casteljau算法计算得到的曲线点,左右移动鼠标,可以实时看到曲线是如何生成的。

    - - - 脚本被禁用。显示备用图片。 - - - - - -
    - -

    如何实现de Casteljau算法

    -

    让我们使用刚才描述过的算法,并实现它:

    - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    - -

    好了,这就是算法的实现。一般来说你不能随意重载“+”操作符,因此我们给出计算xy坐标的实现:

    + drawCurvePoint(newpoints, t) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    - -

    以上算法做了什么?如果参数points列表只有一个点, 就画出一个点。如果有多个点,就生成以t为比例的一系列点(例如,以上算法中的"标记点"),然后为新的点列表调用绘制函数。

    -
    + drawCurvePoint(newpoints, t) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + -
    -
    -

    简化绘图

    -

    我们可以简化绘制的过程,先在具体的位置“采样”曲线,然后用线段把这些点连接起来。由于我们是将曲线转换成一系列“平整的”直线,故将这个过程称之为“拉平(flattening)”。

    -

    我们可以先确定“想要X个分段”,然后在间隔的地方采样曲线,得到一定数量的分段。这种方法的优点是速度很快:比起遍历100甚至1000个曲线坐标,我们可以采样比较少的点,仍然得到看起来足够好的曲线。这么做的缺点是,我们失去了“真正的曲线”的精度,因此不能用此方法来做真实的相交检测或曲率对齐。

    -
    - - - 脚本被禁用。显示备用图片。 - - - - - - - - 脚本被禁用。显示备用图片。 - - - - - -
    +

    + 以上算法做了什么?如果参数points列表只有一个点, + 就画出一个点。如果有多个点,就生成以t为比例的一系列点(例如,以上算法中的"标记点"),然后为新的点列表调用绘制函数。 +

    + +
    +
    +

    + + 简化绘图 +

    +

    + 我们可以简化绘制的过程,先在具体的位置“采样”曲线,然后用线段把这些点连接起来。由于我们是将曲线转换成一系列“平整的”直线,故将这个过程称之为“拉平(flattening)”。 +

    +

    + 我们可以先确定“想要X个分段”,然后在间隔的地方采样曲线,得到一定数量的分段。这种方法的优点是速度很快:比起遍历100甚至1000个曲线坐标,我们可以采样比较少的点,仍然得到看起来足够好的曲线。这么做的缺点是,我们失去了“真正的曲线”的精度,因此不能用此方法来做真实的相交检测或曲率对齐。 +

    +
    + + + 脚本被禁用。显示备用图片。 + + + + + + + + 脚本被禁用。显示备用图片。 + + + + + +
    -

    试着点击图形,并用上下键来降低二次曲线和三次曲线的分段数量。你会发现对某些曲率来说,数量少的分段也能做的很好,但对于复杂的曲率(在三次曲线上试试),足够多的分段才能很好地满足曲率的变化。

    -
    +

    + 试着点击图形,并用上下键来降低二次曲线和三次曲线的分段数量。你会发现对某些曲率来说,数量少的分段也能做的很好,但对于复杂的曲率(在三次曲线上试试),足够多的分段才能很好地满足曲率的变化。 +

    +
    +

    如何实现曲线的拉平

    +

    让我们来实现刚才简述过的算法:

    -

    如何实现曲线的拉平

    -

    让我们来实现刚才简述过的算法:

    - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    - -

    好了,这就是算法的实现。它基本上是画出一系列的线段来模拟“曲线”。

    + return coordinates; + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    - -

    我们将第一个坐标作为参考点,然后在相邻两个点之间画线。

    -
    + coord = _coord + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + -
    -
    -

    分割曲线

    -

    使用 de Casteljau 算法我们也可以将一条贝塞尔曲线分割成两条更小的曲线,二者拼接起来即可形成原来的曲线。当采用某个 t 值构造 de Casteljau 算法时,该过程会给到我们在 t 点分割曲线的所有点: 一条曲线包含该曲线上点之前的所有点,另一条曲线包含该曲线上点之后的所有点。

    - - - 脚本被禁用。显示备用图片。 - - - - - -
    +

    我们将第一个坐标作为参考点,然后在相邻两个点之间画线。

    +
    +
    +
    +

    + + 分割曲线 +

    +

    + 使用 de Casteljau 算法我们也可以将一条贝塞尔曲线分割成两条更小的曲线,二者拼接起来即可形成原来的曲线。当采用某个 t 值构造 de + Casteljau 算法时,该过程会给到我们在 t 点分割曲线的所有点: + 一条曲线包含该曲线上点之前的所有点,另一条曲线包含该曲线上点之后的所有点。 +

    + + + 脚本被禁用。显示备用图片。 + + + + + +
    +

    分割曲线的代码实习

    +

    通过在 de Casteljau 函数里插入一些额外的输出代码,我们就可以实现曲线的分割:

    -

    分割曲线的代码实习

    -

    通过在 de Casteljau 函数里插入一些额外的输出代码,我们就可以实现曲线的分割:

    - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    - -

    对某个给定 t 值,该函数执行后,数组 leftright 将包含两条曲线的所有点的坐标 -- 一条是t值左侧的曲线,一条是t值右侧的曲线, 与原始曲线同序且完全重合。

    -
    + drawCurvePoint(newpoints, t) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + -
    -
    -

    用矩阵分割曲线

    -

    另一种分割曲线的方法是利用贝塞尔曲线的矩阵表示。矩阵一章已经说明可以用矩阵乘法表示曲线,尤其是以下两种分别表示二次曲线和三次曲线的形式(为提高可读性,贝塞尔曲线的系数向量已被倒转次序):

    - - -

    - - -

    假设要在点t = z处分割曲线得到两条新的(且显然更小的)贝塞尔曲线,则可以用矩阵表示和一点线性代数求出这两条贝塞尔曲线的坐标。首先将实际的“线上点”信息分离到新的矩阵乘法中:

    - - -

    以及

    - - -

    如果可以将这些矩阵组合成**[t值] · [贝塞尔矩阵] · [列矩阵]**的形式且前两项保持不变,那么右侧的列矩阵即为描述从t = 0t = z的第一段新的贝塞尔曲线的坐标。利用线性代数的一些简单的法则可以很轻松地做到这一点。(如果不在乎推导过程,那么可以直接跳到方框底部得到结果!)

    -
    - -

    推导新的凸包坐标

    -

    推导分割曲线后所得两段曲线的坐标要花上几步,而且曲线次数越高,花的工夫越多,因此先看二次曲线:

    - - - - - - - - -

    以上变形可行是因为[M · M-1]为单位矩阵。这有点像在计算中让某项乘以x/x——不改变函数本身,却可以将函数改写为更好处理的形式或者得到不同的分解。类似地将上式中的矩阵左乘以[M · M-1]不会影响整个式子,却可以将矩阵序列[某项 · M]变为[M · 某项],而这至关重要——如果知道了[M-1 · Z · M]是什么,那就可以将其施加到已有坐标上得到一条二次贝塞尔曲线的标准矩阵表示(即[T · M · P]))和表示从t = 0t = z的曲线的一系列新坐标。计算如下:

    - - -

    很好!现在得出新的二次曲线:

    - - - - - - -

    非常好——如果需要从t = 0t = z的子曲线,那么只需保持第一个坐标不变(很合理),控制点变为原有控制点和起点关于z的分比的平均点,而且新的终点为平均点,其比例与二次伯恩斯坦多项式莫名地相似。这些新坐标其实非常容易直接计算得到!

    -

    当然这只是两条曲线中的一条,得到从t = zt = 1的一段需要再算一次。首先注意到之前的计算其实是在一般的区间[0,z]上进行的。之所以可以写成更简单的形式是因为0为端点,但将0显式地写出可知真正所计算的是:

    - - - - -

    如果需要[z,1]区间,那么计算如下:

    - - - - -

    用同样的技巧左乘单位矩阵将[某项 · M]变为[M · 某项]

    - - -

    那么最终第二条曲线为:

    - - - - - - -

    很好。出现了与之前相同的情形:保持最后一个坐标不变(很合理),控制点变为原有控制点和终点关于z的分比的平均点,而且新的终点为平均点,其比例与二次伯恩斯坦多项式莫名地相似,只不过这次用的是z-1而不是1-z。这些新坐标非常容易直接计算得到!

    -
    + +

    + 很好。出现了与之前相同的情形:保持最后一个坐标不变(很合理),控制点变为原有控制点和终点关于z的分比的平均点,而且新的终点为平均点,其比例与二次伯恩斯坦多项式莫名地相似,只不过这次用的是z-1而不是1-z。这些新坐标非常容易直接计算得到! +

    + -

    因此,不用德•卡斯特如算法而用线性代数可知在点t = z处分割一条二次曲线可得两条子曲线,它们均为用容易求得的坐标所描述的贝塞尔曲线:

    - - -

    - - -

    虽然三次曲线可以同理推导,但此处省略实际的推导过程(读者可自行写出)并直接展示所得的新坐标集:

    - - -

    - - -

    对于以上矩阵而言,是否真的有必要计算第二段的矩阵?其实不然。有了第一段的矩阵就意味着有了第二段的:将矩阵Q每行的非零值推到右侧,左侧的空位补零,再将矩阵上下翻转,**Q'**就“计算”出来了。搞定!

    -

    如此实现曲线分割需要的迭代更少,且只用缓存值直接进行四则运算,因此对于迭代耗费较大的系统更为划算。如果使用擅长矩阵操作的设备进行计算,那么用这种方法切割贝塞尔曲线会比使用德•卡斯特如算法快得多。

    - -
    -
    -

    曲线的升次与降次

    -

    贝塞尔曲线有一个有意思的性质——n阶曲线总可通过给出n+1阶曲线对应的控制点而用高一阶的曲线精确表示。

    -

    如果有一条二次曲线,那么可以如下构造三次曲线精确重现原来的曲线:首先选择相同的起点和终点,然后两个控制点分别选为“1/3起点+2/3终点”和“2/3起点+1/3终点”。所得曲线与原来的相同,只不过表示为了三次曲线而不是二次曲线。

    -

    n次曲线升为n+1次曲线的一般规则如下(注意起点和终点的权重与旧曲线的相同):

    - - -

    然而这一规则也直接意味着通常无法n次曲线稳妥地降到n-1次,这是因为控制点无法被简洁地“拆开”。可以做些尝试,但所得曲线不会与原曲线重合,而且其实还可能看起来完全不同。

    -

    不过有一种好得出人意料的方法可以保证低次曲线看起来与原曲线“尽可能地接近”——用仅仅一次操作优化低次曲线与原曲线之间的“最小二乘法距离”(Sirver's Castle中亦有解释),但是为了用上这种方法,需要先做些变形再转用线性代数。正如矩阵表示一章所言,有些东西用矩阵去做比用函数方便得多,而这就是一例。那么……开始吧!

    -

    先将标准的贝塞尔函数写得紧凑一些:

    - - -

    然后用一个朴素(其实极其有用)的变形技巧:既然t值总在0到1之间(含端点),且1-tt恒等于1,那么任何数都可表示为t1-t的和:

    - - -

    于是用这一看似平凡的性质可将贝塞尔函数拆分为1-tt两部分之和:

    - - -

    目前一切顺利。现在为了理解为什么这么做,将1-tt两部分具体写出并观察结果。首先是1-t

    - - -

    用这一看似朴素的技巧瞬间就将n次贝塞尔函数的一个部分用n+1次贝塞尔函数表示出来了,这非常像曲线升次!当然t的部分也要表示出来,但这不是问题:

    - - -

    n次的表达式变为n+1次的之后再将其重新合并。虽然n次函数是从0到n求和,n+1次函数是从0到n+1求和,但补上“贡献为零”的项即可。下一章“导数”会论述为什么“没有对应的二项式系数的更高次项”和“低于零次的项”都“贡献为零”,因此需要什么形式的项就可以加上什么项。将这些项包含在和式中没有影响,而所得函数与低次曲线依然相等:

    - - -

    接下来从变形转到线性代数(矩阵)——现在Bézier(n,t)和Bézier(n+1,t)之间的关系可用非常简单的矩阵乘法表示:

    - - -

    其中矩阵M(n+1)×n阶的矩阵,其形如:

    - - -

    这虽然看似庞杂,但真的只是几乎全为零的矩阵,而且对角线上为很简单的分数,其左侧为更简单的分数。这意味着将一列坐标乘以这一矩阵,再将所得变形之后的坐标代入高一次的函数即可得到与原曲线一模一样的曲线。

    -

    还不错!

    -

    同样有意思的一点是,在建立这一矩阵操作之后即可利用非常强大又极其简单的方法求出“最优拟合”倒转操作——即法方程组,这种方法将一组数与另一组数的平方差之和最小化。具体而言,对于超定方程组A x = b,利用法方程组可以求出使方程两侧之差长度最小的x。既然现在面临的问题即为如此,那么:

    - - -

    其中的步骤为:

    -
      -
    1. 既然有一个具有法方程组可以处理的形式的方程组,那么
    2. -
    3. 使用法方程组!
    4. -
    5. 然后因为左侧只需保留Bn,所以在两侧左乘矩阵使左侧的很多东西化为“因数1”(在矩阵语言中即为单位矩阵)。
    6. -
    7. 具体而言,左乘左侧已有项的逆可以将这个庞大的矩阵约简为单位矩阵I。于是将这一大堆替换为I,然后
    8. -
    9. 因为矩阵与单位矩阵相乘不会发生变化(就像在四则运算中数与1相乘不会发生变化),所以略去单位矩阵。
    10. -
    -

    此即用n次曲线逼近n+1次曲线的表达式。这虽然不是精确拟合,但却是非常好的近似。下图对一条(半)随机的曲线实现了这些升次和降次的规则,图上的控制点可以移动,点击按钮可以升高或降低曲线的次数。

    - - - 脚本被禁用。显示备用图片。 - - - - - - - -
    -
    -

    导数

    -

    利用贝塞尔函数的导数可以对贝塞尔曲线做一些有用的事,而贝塞尔函数较为有趣的一个性质是其导数也为贝塞尔函数。其实贝塞尔函数的求导相对而言比较直接,只是需要一点数学运算。

    -

    首先观察贝塞尔函数的求导法则,即:

    - - -

    上式可改写为(注意式中的b即权重w,且n乘以一个和式等于每个求和项乘以n再求和):

    - - -

    直白地说,n次贝塞尔函数的导数为n-1次贝塞尔函数,少了一项,而且新的权重w'0、……、w'n-1可用旧的权重通过n(wi+1 - wi)求得。对于带四个权重的三次函数,其导数的三个新权重为:w'0 = 3(w1-w0),w'1 = 3(w2-w1)和w'2 = 3(w3-w2)。

    -
    + +

    + 直白地说,n次贝塞尔函数的导数为n-1次贝塞尔函数,少了一项,而且新的权重w'0、……、w'n-1可用旧的权重通过n(wi+1 - wi)求得。对于带四个权重的三次函数,其导数的三个新权重为:w'0 = 3(w1-w0),w'1 = 3(w2-w1)和w'2 = 3(w3-w2)。 +

    +
    +

    “慢着,为什么这是对的?”

    +

    + 虽然有时候有人告诉说“这是导数”就行,但还是可能想一探究竟。既然如此,就来看看这个导数的证明。首先,因为权重不影响完整的贝塞尔函数的求导,所以求导只涉及多项式基函数的导数。基函数的导数为: +

    + +

    + 使用乘积求导法则复合函数求导法则得到: +

    + +

    上式不易处理,因此打开括号:

    + +

    + 现在技巧性的一步是将上式再次化为含二项式系数的形式,需要得到形如“x!/y!(x-y)!”的项。如果得到关于n-1和k-1的项,那么说明方向是对的。 +

    + +

    这是第一步。上式括号里的两项其实为标准的、低一次的贝塞尔函数:

    + +

    现在将上式应用于已有的加权贝塞尔函数。先写出之前所见的平面曲线公式,再逐步求出导数:

    + +

    如果打开上式的括号(用颜色表示相匹配的项),再按递增的k值重排各项,那么有:

    + +

    + 上式中有两项会消失掉:因为任意和式都没有第-1项,所以上式第一项消失。既然这一项总是贡献为零,那么求导时就可以放心地将其完全无视。消失的另外一项为展开式的最后一项——包含Bn-1,n的一项。这一项含有二项式系数Cii+1,而这一系数通常约定等于0。因此这一项贡献为零,也可被略去。这意味着剩下的项为: +

    + +

    此即低次函数之和:

    + +

    将上式改写为正常的和式即可:

    + +
    -

    “慢着,为什么这是对的?”

    -

    虽然有时候有人告诉说“这是导数”就行,但还是可能想一探究竟。既然如此,就来看看这个导数的证明。首先,因为权重不影响完整的贝塞尔函数的求导,所以求导只涉及多项式基函数的导数。基函数的导数为:

    - - -

    使用乘积求导法则复合函数求导法则得到:

    - - -

    上式不易处理,因此打开括号:

    - - -

    现在技巧性的一步是将上式再次化为含二项式系数的形式,需要得到形如“x!/y!(x-y)!”的项。如果得到关于n-1和k-1的项,那么说明方向是对的。

    - - -

    这是第一步。上式括号里的两项其实为标准的、低一次的贝塞尔函数:

    - - -

    现在将上式应用于已有的加权贝塞尔函数。先写出之前所见的平面曲线公式,再逐步求出导数:

    - - -

    如果打开上式的括号(用颜色表示相匹配的项),再按递增的k值重排各项,那么有:

    - - -

    上式中有两项会消失掉:因为任意和式都没有第-1项,所以上式第一项消失。既然这一项总是贡献为零,那么求导时就可以放心地将其完全无视。消失的另外一项为展开式的最后一项——包含Bn-1,n的一项。这一项含有二项式系数Cii+1,而这一系数通常约定等于0。因此这一项贡献为零,也可被略去。这意味着剩下的项为:

    - - -

    此即低次函数之和:

    - - -

    将上式改写为正常的和式即可:

    - - -
    - -

    将上式改写为与原式相似的形式有助于看出它们的区别。先写出原式,再写出导数:

    - - - - -

    有什么区别?对于实际的贝塞尔曲线而言几乎没有区别!虽然次数降低了(从n次变为n-1次),但是贝塞尔函数没有改变。唯一的真正的区别在于推导表示曲线的函数时权重如何变化。如果有A、B、C、D四个点,那么导数有三个点,二阶导数有两个点,三阶导数有一个点:

    - - -

    只要有多于一个权重即可运用这一方法。只剩一个权重时,下一步会出现k=0,而贝塞尔函数的和式因为无项可加而化为零。因此二次函数没有二阶导数,三次函数没有三阶导数,更一般地有n次函数有n-1阶(有意义的)导数,其更高阶导数为零。

    - -
    -
    -

    切线与法线

    -

    如果要将物体沿曲线移动或者从曲线附近“移向远处”,那么与之最相关的两个向量为曲线的切向量和法向量,而这两者都非常容易求得。切向量用于沿曲线移动或者对准曲线方向,它标志着曲线在指定点的行进方向,而且就是曲线函数的一阶导数:

    - - -

    此即所需的方向向量。可以在每一点将方向向量规范化后得到单位方向向量(即长度为1.0),再根据这些方向进行所需的操作:

    - - -

    切向量对于沿曲线移动很有用,但如果要从曲线附近“移向远处”,而且移动方向与曲线在某点t处垂直,那该怎么办?这时需要的是向量。这一向量与曲线的方向保持垂直,且长度通常为1.0,因此只需旋转单位方向向量即可:

    - - -
    - -

    其实旋转坐标只要知道方法就非常简单——“施加旋转矩阵”,以下即采用这种方法。本质上这一做法是先选取用于旋转的圆,再将坐标沿着圆“滑动所需的角度”。如果需要转动90度,那么将坐标沿着圆滑动90度即可。

    -

    为了将点(x,y)(绕(0,0))旋转φ度得到点(x',y'),可以使用以下简洁的计算式:

    - - -

    对应“短”版本的矩阵变换为:

    - - -

    注意对于90度、180度和270度旋转,因为这些角度的正弦余弦分别为0和1、-1和0、0和-1,所以上式可以更简。

    -

    但是**为什么**可以这样做?为什么用这一矩阵乘法?这是因为旋转变换可以表示为三个(初等)剪切变换的复合,而将三个变换合成一个变换(因为所有矩阵变换都可以复合)即得上述矩阵表示。DataGenetics对此进行了很好的解释,非常推荐读者一读。

    -
    + +

    注意对于90度、180度和270度旋转,因为这些角度的正弦余弦分别为0和1、-1和0、0和-1,所以上式可以更简。

    +

    + 但是**为什么**可以这样做?为什么用这一矩阵乘法?这是因为旋转变换可以表示为三个(初等)剪切变换的复合,而将三个变换合成一个变换(因为所有矩阵变换都可以复合)即得上述矩阵表示。DataGenetics对此进行了很好的解释,非常推荐读者一读。 +

    + -

    以下两图展示了二次和三次贝塞尔曲线在各点的切线和法线,其中蓝色的为方向向量,红色的为法向量(标记按t值的等分区间放置,并非等距放置)。

    -
    - - - 脚本被禁用。显示备用图片。 - - - - - - 脚本被禁用。显示备用图片。 - - - -
    +

    + 以下两图展示了二次和三次贝塞尔曲线在各点的切线和法线,其中蓝色的为方向向量,红色的为法向量(标记按t值的等分区间放置,并非等距放置)。 +

    +
    + + + 脚本被禁用。显示备用图片。 + + + + + + 脚本被禁用。显示备用图片。 + + + +
    +
    +
    +

    + + 三维法向量 +

    +

    + 在进入下一章之前需要花点时间探究二维和三维的区别。尽管这一区别影响的范围不大无关且两种情形的做法相同(比如求三维切向量与二维情形所做的一样,不过所求为x、y、z而不只是x、y),但是法向量的情况有点复杂,所做的也就更多。尽管不是“极其困难”,但是所需的步骤更多,需要仔细看看。 +

    +

    + 三维法向量的求法原则上与二维一样——将规范化的切向量旋转90度。然而这就是情况变得略微复杂的地方:因为三维的“法向量”是法平面上的任意一个向量,所以可以旋转的方向并不唯一,因此需要定义三维情形中“唯一的”法向量是什么。 +

    +

    + “朴素”的方法是构造弗勒内法向量,而以下采用的简单做法在很多情况下都可行(但在其他情况下会得到极其怪异的结果)。思路是虽然有无穷多个向量与切向量垂直(即与之成90度角),但是切向量本身已差不多位于自带的平面上——因为曲线上的每一点(无论间隔多小)都有自己的切向量,所以可以说每个点都位于此处的切向量和“近旁”的切向量所在的平面上。 +

    +

    + 即使这两个切向量的差微乎其微,只要“有差”就可求出这一平面,或者说求出垂直于平面的向量。计算出这一向量之后,因为切向量在平面上,所以将切向量绕垂直向量旋转即可。计算这一法向量的逻辑与二维情形相同——“直接旋转90度”。 +

    +

    那么开始吧!令人意外的是四行就做完了:

    +
      +
    • a = normalize(B'(t))
    • +
    • b = normalize(a + B''(t))
    • +
    • r = normalize(b × a)
    • +
    • normal = normalize(r × a)
    • +
    +

    展开说几句:

    +
      +
    • + 先将曲线上一点的导数规范化得到单位向量。规范化可以减少计算量,而计算量越少越好。 +
    • +
    • 再计算b。假如曲线从这个点开始不再变化,保持导数和二阶导数不变,则b表示下一个点处的切向量。
    • +
    • + 得到两个共面向量后(导数、导数与二阶导数的和),用叉积这一基本的向量运算可以求出与这一平面垂直的向量。(注意这一运算使用的符号×绝非乘法运算!)叉积所得向量可以当做“旋转轴”,像二维情形一样将切向量旋转90度得到法向量。 +
    • +
    • 既然由叉积可得垂直于由两个向量所确定的平面的另一向量,而法向量又与切向量和旋转轴所在平面垂直,那么再用一次叉积即得法向量。
    • +
    +

    + 这样就求出了三维曲线“唯一”的法向量。以一条曲线为例看看效果如何?从左往右拖动滚动条,根据鼠标位置所确定的t值显示在此处的法向量——最左为0,最右为1,中间为0.5,等等: +

    + + + 脚本被禁用。显示备用图片。 + + + + + +

    然而摆弄图像一阵之后可能会察觉到异样——法向量似乎在t=0.65和t=0.75之间“绕着曲线急转弯”……为什么会这样?

    +

    + 其实出现这种现象是因为数学公式就是这样推导的,所以弗勒内法向量的问题就在于此:虽然“从数学上看”是对的,但是“从实际上看”有问题。因此为了让任何图像都不出问题,所真正需要的是只要……看起来不错就好的方法。 +

    +

    + 还好不只有弗勒内法向量这一种选择。另一种选择是采用稍微偏算法的方式计算一种形式的旋转最小化标架(亦称“平行输运标架”或“比舍标架”),此处“标架”是以线上点为原点,由切向量、旋转轴和法向量构成的集合。 +

    +

    + 因为计算这种类型的标架依赖于“上一个标架”,所以无法像弗勒内标架一样“按需”对单独的点直接计算,而是需要对整条曲线进行计算。好在计算过程相当简单,而且可以与曲线查询表的构建同时进行。 +

    +

    + 思路是在t=0处取一个由切向量、旋转轴、法向量构成的初始标架,再使用一定的规则计算下一标架“应有”的形式。上文链接的旋转最小化标架论文给出的规则为: +

    +
      +
    • 取曲线上已经求出旋转最小化标架的一个点,
    • +
    • 取曲线上尚未求出旋转最小化标架的下一个点,
    • +
    • 再以上一个点和下一个点的中垂面为镜面,将已有标架翻转到下一个点上。
    • +
    • 翻转后的切向量方向与下一个点的切向量方向大致相反,而且法向量也略有歪斜。
    • +
    • 于是再以翻转后的切向量和下一个点的切向量所确定的平面为镜面,将翻转后的标架再次翻转。
    • +
    • 切向量和法向量修正完毕,所得即为好用的标架。
    • +
    +

    来写点代码吧!

    +
    +

    实现旋转最小化标架

    +

    首先假设已有函数用于计算上文提及的指定点的弗勒内标架,输出的标架具有如下性质:

    -
    -
    -

    三维法向量

    -

    在进入下一章之前需要花点时间探究二维和三维的区别。尽管这一区别影响的范围不大无关且两种情形的做法相同(比如求三维切向量与二维情形所做的一样,不过所求为x、y、z而不只是x、y),但是法向量的情况有点复杂,所做的也就更多。尽管不是“极其困难”,但是所需的步骤更多,需要仔细看看。

    -

    三维法向量的求法原则上与二维一样——将规范化的切向量旋转90度。然而这就是情况变得略微复杂的地方:因为三维的“法向量”是法平面上的任意一个向量,所以可以旋转的方向并不唯一,因此需要定义三维情形中“唯一的”法向量是什么。

    -

    “朴素”的方法是构造弗勒内法向量,而以下采用的简单做法在很多情况下都可行(但在其他情况下会得到极其怪异的结果)。思路是虽然有无穷多个向量与切向量垂直(即与之成90度角),但是切向量本身已差不多位于自带的平面上——因为曲线上的每一点(无论间隔多小)都有自己的切向量,所以可以说每个点都位于此处的切向量和“近旁”的切向量所在的平面上。

    -

    即使这两个切向量的差微乎其微,只要“有差”就可求出这一平面,或者说求出垂直于平面的向量。计算出这一向量之后,因为切向量在平面上,所以将切向量绕垂直向量旋转即可。计算这一法向量的逻辑与二维情形相同——“直接旋转90度”。

    -

    那么开始吧!令人意外的是四行就做完了:

    -
      -
    • a = normalize(B'(t))
    • -
    • b = normalize(a + B''(t))
    • -
    • r = normalize(b × a)
    • -
    • normal = normalize(r × a)
    • -
    -

    展开说几句:

    -
      -
    • 先将曲线上一点的导数规范化得到单位向量。规范化可以减少计算量,而计算量越少越好。
    • -
    • 再计算b。假如曲线从这个点开始不再变化,保持导数和二阶导数不变,则b表示下一个点处的切向量。
    • -
    • 得到两个共面向量后(导数、导数与二阶导数的和),用叉积这一基本的向量运算可以求出与这一平面垂直的向量。(注意这一运算使用的符号×绝非乘法运算!)叉积所得向量可以当做“旋转轴”,像二维情形一样将切向量旋转90度得到法向量。
    • -
    • 既然由叉积可得垂直于由两个向量所确定的平面的另一向量,而法向量又与切向量和旋转轴所在平面垂直,那么再用一次叉积即得法向量。
    • -
    -

    这样就求出了三维曲线“唯一”的法向量。以一条曲线为例看看效果如何?从左往右拖动滚动条,根据鼠标位置所确定的t值显示在此处的法向量——最左为0,最右为1,中间为0.5,等等:

    - - - 脚本被禁用。显示备用图片。 - - - - - -

    然而摆弄图像一阵之后可能会察觉到异样——法向量似乎在t=0.65和t=0.75之间“绕着曲线急转弯”……为什么会这样?

    -

    其实出现这种现象是因为数学公式就是这样推导的,所以弗勒内法向量的问题就在于此:虽然“从数学上看”是对的,但是“从实际上看”有问题。因此为了让任何图像都不出问题,所真正需要的是只要……看起来不错就好的方法。

    -

    还好不只有弗勒内法向量这一种选择。另一种选择是采用稍微偏算法的方式计算一种形式的旋转最小化标架(亦称“平行输运标架”或“比舍标架”),此处“标架”是以线上点为原点,由切向量、旋转轴和法向量构成的集合。

    -

    因为计算这种类型的标架依赖于“上一个标架”,所以无法像弗勒内标架一样“按需”对单独的点直接计算,而是需要对整条曲线进行计算。好在计算过程相当简单,而且可以与曲线查询表的构建同时进行。

    -

    思路是在t=0处取一个由切向量、旋转轴、法向量构成的初始标架,再使用一定的规则计算下一标架“应有”的形式。上文链接的旋转最小化标架论文给出的规则为:

    -
      -
    • 取曲线上已经求出旋转最小化标架的一个点,
    • -
    • 取曲线上尚未求出旋转最小化标架的下一个点,
    • -
    • 再以上一个点和下一个点的中垂面为镜面,将已有标架翻转到下一个点上。
    • -
    • 翻转后的切向量方向与下一个点的切向量方向大致相反,而且法向量也略有歪斜。
    • -
    • 于是再以翻转后的切向量和下一个点的切向量所确定的平面为镜面,将翻转后的标架再次翻转。
    • -
    • 切向量和法向量修正完毕,所得即为好用的标架。
    • -
    -

    来写点代码吧!

    -
    - -

    实现旋转最小化标架

    -

    首先假设已有函数用于计算上文提及的指定点的弗勒内标架,输出的标架具有如下性质:

    -
    {
    +						
    +							
    +								
    +								
    +							
    +							
    +								
    +							
    +							
    +								
    +							
    +							
    +								
    +							
    +							
    +								
    +							
    +							
    +								
    +							
    +						
    1 + +
    2
    3
    4
    5
    6
    + +

    再如下写出生成一系列旋转最小化标架的函数:

    + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
    1 + +
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    20
    21
    22
    23
    24
    25
    26
    27
    28
    -

    提到长得更好,这样的标架到底是什么样子?下面回顾之前的那条曲线,但这次用的不是弗勒内标架而是旋转最小化标架:

    - - - 脚本被禁用。显示备用图片。 - - - - - -

    看起来好多了!

    -

    给看过代码的读者的话:严格来说一开始甚至不需要弗勒内标架。比方说可以将z轴当作初始旋转轴,于是初始法向量为 (0,0,1) × 切向量,然后再继续下去。不过求出“数学上正确”的初始标架,从而让初始法向量的方向符合曲线在三维空间中的定向,这总归是不错的。

    +

    即使忽略注释,代码也明显比计算单个弗勒内标架的多,但也没有多得离谱,而且得到了长得更好的法向量。

    +
    -
    -
    -

    分量函数

    -

    当人们开始在自己的程序中使用贝塞尔曲线时,首先遇到的问题之一是:“我虽然知道怎么画曲线,但是怎么确定包围盒?”其实做法颇为直接,但需要知道如何利用一些数学知识得到所需的值。对于包围盒而言,所需的其实并不是曲线本身,而只是曲线的“极值”——曲线的x轴和y轴分量的最小值和最大值。如果还记得微积分的话(前提是学过微积分,否则更难记),那么函数的极值可以用函数的一阶导数所确定,但由于“曲线函数”有不只一个分量,这就产生了一个问题——每个分量都有自己的函数。

    -

    解决办法:对每个分量分别计算导数,再按照原来的分量顺序重新拼在一起。

    -

    以下演示参数化的贝塞尔曲线如何“分解”为两个正常的函数,一个对应于x轴,一个对应于y轴。注意左侧的图像依然是可交互的曲线,但没有标出坐标轴(坐标显示在图中);中间和右侧的图像是分量函数,分别对应于指定t值(介于0和1之间,含端点)后求出的x轴和y轴分量。

    -

    如果水平移动曲线上的点,那么应当只有中间的图像在变化;同样,如果竖直移动曲线上的点,那么应当只有右侧的图像在变化。

    - - - 脚本被禁用。显示备用图片。 - - - - - - 脚本被禁用。显示备用图片。 - - - - -
    -
    -

    Finding extremities: root finding

    -

    Now that we understand (well, superficially anyway) the component functions, we can find the extremities of our Bézier curve by finding maxima and minima on the component functions, by solving the equation B'(t) = 0. We've already seen that the derivative of a Bézier curve is a simpler Bézier curve, but how do we solve the equality? Fairly easily, actually, until our derivatives are 4th order or higher... then things get really hard. But let's start simple:

    -

    Quadratic curves: linear derivatives.

    -

    The derivative of a quadratic Bézier curve is a linear Bézier curve, interpolating between just two terms, which means finding the solution for "where is this line 0" is effectively trivial by rewriting it to a function of t and solving. First we turn our quadratic Bézier function into a linear one, by following the rule mentioned at the end of the derivatives section:

    - - -

    And then we turn this into our solution for t using basic arithmetics:

    - - -

    Done.

    -

    Although with the caveat that if b-a is zero, there is no solution and we probably shouldn't try to perform that division.

    -

    Cubic curves: the quadratic formula.

    -

    The derivative of a cubic Bézier curve is a quadratic Bézier curve, and finding the roots for a quadratic polynomial means we can apply the Quadratic formula. If you've seen it before, you'll remember it, and if you haven't, it looks like this:

    - - -

    So, if we can rewrite the Bézier component function as a plain polynomial, we're done: we just plug in the values into the quadratic formula, check if that square root is negative or not (if it is, there are no roots) and then just compute the two values that come out (because of that plus/minus sign we get two). Any value between 0 and 1 is a root that matters for Bézier curves, anything below or above that is irrelevant (because Bézier curves are only defined over the interval [0,1]). So, how do we convert?

    -

    First we turn our cubic Bézier function into a quadratic one, by following the rule mentioned at the end of the derivatives section:

    - - -

    And then, using these v values, we can find out what our a, b, and c should be:

    - - -

    This gives us three coefficients {a, b, c} that are expressed in terms of v values, where the v values are expressions of our original coordinate values, so we can do some substitution to get:

    - - -

    Easy-peasy. We can now almost trivially find the roots by plugging those values into the quadratic formula.

    -

    And as a cubic curve, there is also a meaningful second derivative, which we can compute by simple taking the derivative of the derivative.

    -

    Quartic curves: Cardano's algorithm.

    -

    We haven't really looked at them before now, but the next step up would be a Quartic curve, a fourth degree Bézier curve. As expected, these have a derivative that is a cubic function, and now things get much harder. Cubic functions don't have a "simple" rule to find their roots, like the quadratic formula, and instead require quite a bit of rewriting to a form that we can even start to try to solve.

    -

    Back in the 16th century, before Bézier curves were a thing, and even before calculus itself was a thing, Gerolamo Cardano figured out that even if the general cubic function is really hard to solve, it can be rewritten to a form for which finding the roots is "easier" (even if not "easy"):

    - - -

    We can see that the easier formula only has two constants, rather than four, and only two expressions involving t, rather than three: this makes things considerably easier to solve because it lets us use regular calculus to find the values that satisfy the equation.

    -

    Now, there is one small hitch: as a cubic function, the solutions may be complex numbers rather than plain numbers... And Cardano realised this, centuries before complex numbers were a well-understood and established part of number theory. His interpretation of them was "these numbers are impossible but that's okay because they disappear again in later steps", allowing him to not think about them too much, but we have it even easier: as we're trying to find the roots for display purposes, we don't even care about complex numbers: we're going to simplify Cardano's approach just that tiny bit further by throwing away any solution that's not a plain number.

    -

    So, how do we rewrite the hard formula into the easier formula? This is explained in detail over at Ken J. Ward's page for solving the cubic equation, so instead of showing the maths, I'm simply going to show the programming code for solving the cubic equation, with the complex roots getting totally ignored, but if you're interested you should definitely head over to Ken's page and give the procedure a read-through.

    -
    + +

    + We can see that the easier formula only has two constants, rather than four, and only two expressions involving t, rather + than three: this makes things considerably easier to solve because it lets us use + regular calculus to find the values that satisfy the equation. +

    +

    + Now, there is one small hitch: as a cubic function, the solutions may be + complex numbers rather than plain numbers... And Cardano realised this, + centuries before complex numbers were a well-understood and established part of number theory. His interpretation of them was "these + numbers are impossible but that's okay because they disappear again in later steps", allowing him to not think about them too much, but we + have it even easier: as we're trying to find the roots for display purposes, we don't even care about complex numbers: we're + going to simplify Cardano's approach just that tiny bit further by throwing away any solution that's not a plain number. +

    +

    + So, how do we rewrite the hard formula into the easier formula? This is explained in detail over at + Ken J. Ward's page for solving the + cubic equation, so instead of showing the maths, I'm simply going to show the programming code for solving the cubic equation, with the + complex roots getting totally ignored, but if you're interested you should definitely head over to Ken's page and give the procedure a + read-through. +

    +
    +

    Implementing Cardano's algorithm for finding all real roots

    +

    + The "real roots" part is fairly important, because while you cannot take a square, cube, etc. root of a negative number in the "real" + number space (denoted with ℝ), this is perfectly fine in the + "complex" number space (denoted with ℂ). And, as it so happens, Cardano is + also attributed as the first mathematician in history to have made use of complex numbers in his calculations. For this very algorithm! +

    -

    Implementing Cardano's algorithm for finding all real roots

    -

    The "real roots" part is fairly important, because while you cannot take a square, cube, etc. root of a negative number in the "real" number space (denoted with ℝ), this is perfectly fine in the "complex" number space (denoted with ℂ). And, as it so happens, Cardano is also attributed as the first mathematician in history to have made use of complex numbers in his calculations. For this very algorithm!

    - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    20
    21
    22
    23
    24
    25
    26
    27
    28
    29
    30
    31
    32
    33
    34
    35
    36
    37
    38
    39
    40
    41
    42
    43
    44
    45
    46
    47
    48
    49
    50
    51
    52
    53
    54
    55
    56
    57
    58
    59
    60
    61
    62
    63
    64
    65
    66
    67
    68
    69
    70
    71
    72
    73
    74
    75
    76
    77
    78
    79
    80
    81
    - -
    +} + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + 18 + + + 19 + + + 20 + + + 21 + + + 22 + + + 23 + + + 24 + + + 25 + + + 26 + + + 27 + + + 28 + + + 29 + + + 30 + + + 31 + + + 32 + + + 33 + + + 34 + + + 35 + + + 36 + + + 37 + + + 38 + + + 39 + + + 40 + + + 41 + + + 42 + + + 43 + + + 44 + + + 45 + + + 46 + + + 47 + + + 48 + + + 49 + + + 50 + + + 51 + + + 52 + + + 53 + + + 54 + + + 55 + + + 56 + + + 57 + + + 58 + + + 59 + + + 60 + + + 61 + + + 62 + + + 63 + + + 64 + + + 65 + + + 66 + + + 67 + + + 68 + + + 69 + + + 70 + + + 71 + + + 72 + + + 73 + + + 74 + + + 75 + + + 76 + + + 77 + + + 78 + + + 79 + + + 80 + + + 81 + + +
    -

    And that's it. The maths is complicated, but the code is pretty much just "follow the maths, while caching as many values as we can to prevent recomputing things as much as possible" and now we have a way to find all roots for a cubic function and can just move on with using that to find extremities of our curves.

    -

    And of course, as a quartic curve also has meaningful second and third derivatives, we can quite easily compute those by using the derivative of the derivative (of the derivative), just as for cubic curves.

    -

    Quintic and higher order curves: finding numerical solutions

    -

    And this is where thing stop, because we cannot find the roots for polynomials of degree 5 or higher using algebra (a fact known as the Abel–Ruffini theorem). Instead, for occasions like these, where algebra simply cannot yield an answer, we turn to numerical analysis.

    -

    That's a fancy term for saying "rather than trying to find exact answers by manipulating symbols, find approximate answers by describing the underlying process as a combination of steps, each of which can be assigned a number via symbolic manipulation". For example, trying to mathematically compute how much water fits in a completely crazy three dimensional shape is very hard, even if it got you the perfect, precise answer. A much easier approach, which would be less perfect but still entirely useful, would be to just grab a buck and start filling the shape until it was full: just count the number of buckets of water you used. And if we want a more precise answer, we can use smaller buckets.

    -

    So that's what we're going to do here, too: we're going to treat the problem as a sequence of steps, and the smaller we can make each step, the closer we'll get to that "perfect, precise" answer. And as it turns out, there is a really nice numerical root-finding algorithm, called the Newton-Raphson root finding method (yes, after that Newton), which we can make use of. The Newton-Raphson approach consists of taking our impossible-to-solve function f(x), picking some initial value x (literally any value will do), and calculating f(x). We can think of that value as the "height" of the function at x. If that height is zero, we're done, we have found a root. If it isn't, we calculate the tangent line at f(x) and calculate at which x value its height is zero (which we've already seen is very easy). That will give us a new x and we repeat the process until we find a root.

    -

    Mathematically, this means that for some x, at step n=1, we perform the following calculation until fy(x) is zero, so that the next t is the same as the one we already have:

    - - -

    (The Wikipedia article has a decent animation for this process, so I will not add a graphic for that here)

    -

    Now, this works well only if we can pick good starting points, and our curve is continuously differentiable and doesn't have oscillations. Glossing over the exact meaning of those terms, the curves we're dealing with conform to those constraints, so as long as we pick good starting points, this will work. So the question is: which starting points do we pick?

    -

    As it turns out, Newton-Raphson is so blindingly fast that we could get away with just not picking: we simply run the algorithm from t=0 to t=1 at small steps (say, 1/200th) and the result will be all the roots we want. Of course, this may pose problems for high order Bézier curves: 200 steps for a 200th order Bézier curve is going to go wrong, but that's okay: there is no reason (at least, none that I know of) to ever use Bézier curves of crazy high orders. You might use a fifth order curve to get the "nicest still remotely workable" approximation of a full circle with a single Bézier curve, but that's pretty much as high as you'll ever need to go.

    -

    In conclusion:

    -

    So now that we know how to do root finding, we can determine the first and second derivative roots for our Bézier curves, and show those roots overlaid on the previous graphics. For the quadratic curve, that means just the first derivative, in red:

    - - - 脚本被禁用。显示备用图片。 - - - -

    And for cubic curves, that means first and second derivatives, in red and purple respectively:

    - - - 脚本被禁用。显示备用图片。 - - - + +

    (The Wikipedia article has a decent animation for this process, so I will not add a graphic for that here)

    +

    + Now, this works well only if we can pick good starting points, and our curve is + continuously differentiable and doesn't have + oscillations. Glossing over the exact meaning of those terms, the + curves we're dealing with conform to those constraints, so as long as we pick good starting points, this will work. So the question is: + which starting points do we pick? +

    +

    + As it turns out, Newton-Raphson is so blindingly fast that we could get away with just not picking: we simply run the algorithm from + t=0 to t=1 at small steps (say, 1/200th) and the result will be all the roots we want. Of course, this may + pose problems for high order Bézier curves: 200 steps for a 200th order Bézier curve is going to go wrong, but that's okay: + there is no reason (at least, none that I know of) to ever use Bézier curves of crazy high orders. You might use a fifth order + curve to get the "nicest still remotely workable" approximation of a full circle with a single Bézier curve, but that's pretty much as + high as you'll ever need to go. +

    +

    In conclusion:

    +

    + So now that we know how to do root finding, we can determine the first and second derivative roots for our Bézier curves, and show those + roots overlaid on the previous graphics. For the quadratic curve, that means just the first derivative, in red: +

    + + + 脚本被禁用。显示备用图片。 + + +

    And for cubic curves, that means first and second derivatives, in red and purple respectively:

    + + + 脚本被禁用。显示备用图片。 + + +
    +
    +

    + + Bounding boxes +

    +

    + If we have the extremities, and the start/end points, a simple for-loop that tests for min/max values for x and y means we have the four + values we need to box in our curve: +

    +

    Computing the bounding box for a Bézier curve:

    +
      +
    1. Find all t value(s) for the curve derivative's x- and y-roots.
    2. +
    3. Discard any t value that's lower than 0 or higher than 1, because Bézier curves only use the interval [0,1].
    4. +
    5. + Determine the lowest and highest value when plugging the values t=0, t=1 and each of the found roots into the original + functions: the lowest value is the lower bound, and the highest value is the upper bound for the bounding box we want to construct. +
    6. +
    +

    + Applying this approach to our previous root finding, we get the following + axis-aligned bounding boxes (with all curve extremity points + shown on the curve): +

    +
    + + + 脚本被禁用。显示备用图片。 + + + + + + 脚本被禁用。显示备用图片。 + + + +
    -
    -
    -

    Bounding boxes

    -

    If we have the extremities, and the start/end points, a simple for-loop that tests for min/max values for x and y means we have the four values we need to box in our curve:

    -

    Computing the bounding box for a Bézier curve:

    -
      -
    1. Find all t value(s) for the curve derivative's x- and y-roots.
    2. -
    3. Discard any t value that's lower than 0 or higher than 1, because Bézier curves only use the interval [0,1].
    4. -
    5. Determine the lowest and highest value when plugging the values t=0, t=1 and each of the found roots into the original functions: the lowest value is the lower bound, and the highest value is the upper bound for the bounding box we want to construct.
    6. -
    -

    Applying this approach to our previous root finding, we get the following axis-aligned bounding boxes (with all curve extremity points shown on the curve):

    -
    - - - 脚本被禁用。显示备用图片。 - - - - - - 脚本被禁用。显示备用图片。 - - - -
    - -

    We can construct even nicer boxes by aligning them along our curve, rather than along the x- and y-axis, but in order to do so we first need to look at how aligning works.

    - -
    -
    -

    Aligning curves

    -

    While there are an incredible number of curves we can define by varying the x- and y-coordinates for the control points, not all curves are actually distinct. For instance, if we define a curve, and then rotate it 90 degrees, it's still the same curve, and we'll find its extremities in the same spots, just at different draw coordinates. As such, one way to make sure we're working with a "unique" curve is to "axis-align" it.

    -

    Aligning also simplifies a curve's functions. We can translate (move) the curve so that the first point lies on (0,0), which turns our n term polynomial functions into n-1 term functions. The order stays the same, but we have less terms. Then, we can rotate the curves so that the last point always lies on the x-axis, too, making its coordinate (...,0). This further simplifies the function for the y-component to an n-2 term function. For instance, if we have a cubic curve such as this:

    - - -

    Then translating it so that the first coordinate lies on (0,0), moving all x coordinates by -120, and all y coordinates by -160, gives us:

    - - -

    If we then rotate the curve so that its end point lies on the x-axis, the coordinates (integer-rounded for illustrative purposes here) become:

    - - -

    If we drop all the zero-terms, this gives us:

    - - -

    We can see that our original curve definition has been simplified considerably. The following graphics illustrate the result of aligning our example curves to the x-axis, with the cubic case using the coordinates that were just used in the example formulae:

    - - - 脚本被禁用。显示备用图片。 - - - -

     

    - - - 脚本被禁用。显示备用图片。 - - - + +

    + We can see that our original curve definition has been simplified considerably. The following graphics illustrate the result of aligning + our example curves to the x-axis, with the cubic case using the coordinates that were just used in the example formulae: +

    + + + 脚本被禁用。显示备用图片。 + + +

     

    + + + 脚本被禁用。显示备用图片。 + + +
    +
    +

    + + Tight bounding boxes +

    +

    + With our knowledge of bounding boxes, and curve alignment, We can now form the "tight" bounding box for curves. We first align our curve, + recording the translation we performed, "T", and the rotation angle we used, "R". We then determine the aligned curve's normal bounding + box. Once we have that, we can map that bounding box back to our original curve by rotating it by -R, and then translating it by -T. +

    +

    We now have nice tight bounding boxes for our curves:

    +
    + + + 脚本被禁用。显示备用图片。 + + + + + + 脚本被禁用。显示备用图片。 + + + +
    -
    -
    -

    Tight bounding boxes

    -

    With our knowledge of bounding boxes, and curve alignment, We can now form the "tight" bounding box for curves. We first align our curve, recording the translation we performed, "T", and the rotation angle we used, "R". We then determine the aligned curve's normal bounding box. Once we have that, we can map that bounding box back to our original curve by rotating it by -R, and then translating it by -T.

    -

    We now have nice tight bounding boxes for our curves:

    -
    - - - 脚本被禁用。显示备用图片。 - - - - - - 脚本被禁用。显示备用图片。 - - - -
    - -

    These are, strictly speaking, not necessarily the tightest possible bounding boxes. It is possible to compute the optimal bounding box by determining which spanning lines we need to effect a minimal box area, but because of the parametric nature of Bézier curves this is actually a rather costly operation, and the gain in bounding precision is often not worth it.

    - -
    -
    -

    Curve inflections

    -

    Now that we know how to align a curve, there's one more thing we can calculate: inflection points. Imagine we have a variable size circle that we can slide up against our curve. We place it against the curve and adjust its radius so that where it touches the curve, the curvatures of the curve and the circle are the same, and then we start to slide the circle along the curve - for quadratic curves, we can always do this without the circle behaving oddly: we might have to change the radius of the circle as we slide it along, but it'll always sit against the same side of the curve.

    -

    But what happens with cubic curves? Imagine we have an S curve and we place our circle at the start of the curve, and start sliding it along. For a while we can simply adjust the radius and things will be fine, but once we get to the midpoint of that S, something odd happens: the circle "flips" from one side of the curve to the other side, in order for the curvatures to keep matching. This is called an inflection, and we can find out where those happen relatively easily.

    -

    What we need to do is solve a simple equation:

    - - -

    What we're saying here is that given the curvature function C(t), we want to know for which values of t this function is zero, meaning there is no "curvature", which will be exactly at the point between our circle being on one side of the curve, and our circle being on the other side of the curve. So what does C(t) look like? Actually something that seems not too hard:

    - - -

    The function C(t) is the cross product between the first and second derivative functions for the parametric dimensions of our curve. And, as already shown, derivatives of Bézier curves are just simpler Bézier curves, with very easy to compute new coefficients, so this should be pretty easy.

    -

    However as we've seen in the section on aligning, aligning lets us simplify things a lot, by completely removing the contributions of the first coordinate from most mathematical evaluations, and removing the last y coordinate as well by virtue of the last point lying on the x-axis. So, while we can evaluate C(t) = 0 for our curve, it'll be much easier to first axis-align the curve and then evaluating the curvature function.

    -
    - -

    Let's derive the full formula anyway

    -

    Of course, before we do our aligned check, let's see what happens if we compute the curvature function without axis-aligning. We start with the first and second derivatives, given our basis functions:

    - - -

    And of course the same functions for y:

    - - -

    Asking a computer to now compose the C(t) function for us (and to expand it to a readable form of simple terms) gives us this rather overly complicated set of arithmetic expressions:

    - - -

    That is... unwieldy. So, we note that there are a lot of terms that involve multiplications involving x1, y1, and y4, which would all disappear if we axis-align our curve, which is why aligning is a great idea.

    -
    + +

    + That is... unwieldy. So, we note that there are a lot of terms that involve multiplications involving x1, y1, and y4, which would all + disappear if we axis-align our curve, which is why aligning is a great idea. +

    + -

    Aligning our curve so that three of the eight coefficients become zero, and observing that scale does not affect finding t values, we end up with the following simple term function for C(t):

    - - -

    That's a lot easier to work with: we see a fair number of terms that we can compute and then cache, giving us the following simplification:

    - - -

    This is a plain quadratic curve, and we know how to solve C(t) = 0; we use the quadratic formula:

    - - -

    We can easily compute this value if the discriminator isn't a negative number (because we only want real roots, not complex roots), and if x is not zero, because divisions by zero are rather useless.

    -

    Taking that into account, we compute t, we disregard any t value that isn't in the Bézier interval [0,1], and we now know at which t value(s) our curve will inflect.

    - - - 脚本被禁用。显示备用图片。 - - - - -
    -
    -

    The canonical form (for cubic curves)

    -

    While quadratic curves are relatively simple curves to analyze, the same cannot be said of the cubic curve. As a curvature is controlled by more than one control point, it exhibits all kinds of features like loops, cusps, odd colinear features, and as many as two inflection points because the curvature can change direction up to three times. Now, knowing what kind of curve we're dealing with means that some algorithms can be run more efficiently than if we have to implement them as generic solvers, so is there a way to determine the curve type without lots of work?

    -

    As it so happens, the answer is yes, and the solution we're going to look at was presented by Maureen C. Stone from Xerox PARC and Tony D. deRose from the University of Washington in their joint paper "A Geometric Characterization of Parametric Cubic curves". It was published in 1989, and defines curves as having a "canonical" form (i.e. a form that all curves can be reduced to) from which we can immediately tell what features a curve will have. So how does it work?

    -

    The first observation that makes things work is that if we have a cubic curve with four points, we can apply a linear transformation to these points such that three of the points end up on (0,0), (0,1) and (1,1), with the last point then being "somewhere". After applying that transformation, the location of that last point can then tell us what kind of curve we're dealing with. Specifically, we see the following breakdown:

    - - - 脚本被禁用。显示备用图片。 - - - -

    This is a fairly funky image, so let's see what the various parts of it mean...

    -

    We see the three fixed points at (0,0), (0,1) and (1,1). The various regions and boundaries indicate what property the original curve will have, if the fourth point is in/on that region or boundary. Specifically, if the fourth point is...

    -
      -
    1. ...anywhere inside the red zone, but not on its boundaries, the curve will be self-intersecting (yielding a loop). We won't know where it self-intersects (in terms of t values), but we are guaranteed that it does.

      -
    2. -
    3. ...on the left (red) edge of the red zone, the curve will have a cusp. We again don't know where, but we know there is one. This edge is described by the function:

      - - -
    4. -
    5. ...on the almost circular, lower right (pink) edge, the curve's end point touches the curve, forming a loop. This edge is described by the function:

      - - -
    6. -
    7. ...on the top (blue) edge, the curve's start point touches the curve, forming a loop. This edge is described by the function:

      - - -
    8. -
    9. ...inside the lower (green) zone, past y=1, the curve will have a single inflection (switching concave/convex once).

      -
    10. -
    11. ...between the left and lower boundaries (below the cusp line but above the single-inflection line), the curve will have two inflections (switching from concave to convex and then back again, or from convex to concave and then back again).

      -
    12. -
    13. ...anywhere on the right of self-intersection zone, the curve will have no inflections. It'll just be a simple arch.

      -
    14. -
    -

    Of course, this map is fairly small, but the regions extend to infinity, with well defined boundaries.

    -
    + + +
  • +

    ...inside the lower (green) zone, past y=1, the curve will have a single inflection (switching concave/convex once).

    +
  • +
  • +

    + ...between the left and lower boundaries (below the cusp line but above the single-inflection line), the curve will have two + inflections (switching from concave to convex and then back again, or from convex to concave and then back again). +

    +
  • +
  • ...anywhere on the right of self-intersection zone, the curve will have no inflections. It'll just be a simple arch.

  • + +

    Of course, this map is fairly small, but the regions extend to infinity, with well defined boundaries.

    +
    +

    Wait, where do those lines come from?

    +

    + Without repeating the paper mentioned at the top of this section, the loop-boundaries come from rewriting the curve into canonical form, + and then solving the formulae for which constraints must hold for which possible curve properties. In the paper these functions yield + formulae for where you will find cusp points, or loops where we know t=0 or t=1, but those functions are derived for the full cubic + expression, meaning they apply to t=-∞ to t=∞... For Bézier curves we only care about the "clipped interval" t=0 to t=1, so some of the + properties that apply when you look at the curve over an infinite interval simply don't apply to the Bézier curve interval. +

    +

    + The right bound for the loop region, indicating where the curve switches from "having inflections" to "having a loop", for the general + cubic curve, is actually mirrored over x=1, but for Bézier curves this right half doesn't apply, so we don't need to pay attention to + it. Similarly, the boundaries for t=0 and t=1 loops are also nice clean curves but get "cut off" when we only look at what the general + curve does over the interval t=0 to t=1. +

    +

    + For the full details, head over to the paper and read through sections 3 and 4. If you still remember your high school pre-calculus, you + can probably follow along with this paper, although you might have to read it a few times before all the bits "click". +

    +
    -

    Wait, where do those lines come from?

    -

    Without repeating the paper mentioned at the top of this section, the loop-boundaries come from rewriting the curve into canonical form, and then solving the formulae for which constraints must hold for which possible curve properties. In the paper these functions yield formulae for where you will find cusp points, or loops where we know t=0 or t=1, but those functions are derived for the full cubic expression, meaning they apply to t=-∞ to t=∞... For Bézier curves we only care about the "clipped interval" t=0 to t=1, so some of the properties that apply when you look at the curve over an infinite interval simply don't apply to the Bézier curve interval.

    -

    The right bound for the loop region, indicating where the curve switches from "having inflections" to "having a loop", for the general cubic curve, is actually mirrored over x=1, but for Bézier curves this right half doesn't apply, so we don't need to pay attention to it. Similarly, the boundaries for t=0 and t=1 loops are also nice clean curves but get "cut off" when we only look at what the general curve does over the interval t=0 to t=1.

    -

    For the full details, head over to the paper and read through sections 3 and 4. If you still remember your high school pre-calculus, you can probably follow along with this paper, although you might have to read it a few times before all the bits "click".

    -
    - -

    So now the question becomes: how do we manipulate our curve so that it fits this canonical form, with three fixed points, and one "free" point? Enter linear algebra. Don't worry, I'll be doing all the math for you, as well as show you what the effect is on our curves, but basically we're going to be using linear algebra, rather than calculus, because "it's way easier". Sometimes a calculus approach is very hard to work with, when the equivalent geometrical solution is super obvious.

    -

    The approach is going to start with a curve that doesn't have all-colinear points (so we need to make sure the points don't all fall on a straight line), and then applying three graphics operations that you will probably have heard of: translation (moving all points by some fixed x- and y-distance), scaling (multiplying all points by some x and y scale factor), and shearing (an operation that turns rectangles into parallelograms).

    -

    Step 1: we translate any curve by -p1.x and -p1.y, so that the curve starts at (0,0). We're going to make use of an interesting trick here, by pretending our 2D coordinates are 3D, with the z coordinate simply always being 1. This is an old trick in graphics to overcome the limitations of 2D transformations: without it, we can only turn (x,y) coordinates into new coordinates of the form (ax + by, cx + dy), which means we can't do translation, since that requires we end up with some kind of (x + a, y + b). If we add a bogus z coordinate that is always 1, then we can suddenly add arbitrary values. For example:

    - - -

    Sweet! z stays 1, so we can effectively ignore it entirely, but we added some plain values to our x and y coordinates. So, if we want to subtract p1.x and p1.y, we use:

    - - -

    Running all our coordinates through this transformation gives a new set of coordinates, let's call those U, where the first coordinate lies on (0,0), and the rest is still somewhat free. Our next job is to make sure point 2 ends up lying on the x=0 line, so what we want is a transformation matrix that, when we run it, subtracts x from whatever x we currently have. This is called shearing, and the typical x-shear matrix and its transformation looks like this:

    - - -

    So we want some shearing value that, when multiplied by y, yields -x, so our x coordinate becomes zero. That value is simply -x/y, because *-x/y * y = -x*. Done:

    - - -

    Now, running this on all our points generates a new set of coordinates, let's call those V, which now have point 1 on (0,0) and point 2 on (0, some-value), and we wanted it at (0,1), so we need to do some scaling to make sure it ends up at (0,1). Additionally, we want point 3 to end up on (1,1), so we can also scale x to make sure its x-coordinate will be 1 after we run the transform. That means we'll be x-scaling by 1/point3x, and y-scaling by point2y. This is really easy:

    - - -

    Then, finally, this generates a new set of coordinates, let's call those W, of which point 1 lies on (0,0), point 2 lies on (0,1), and point three lies on (1, ...) so all that's left is to make sure point 3 ends up at (1,1) - but we can't scale! Point 2 is already in the right place, and y-scaling would move it out of (0,1) again, so our only option is to y-shear point three, just like how we x-sheared point 2 earlier. In this case, we do the same trick, but with y/x rather than x/y because we're not x-shearing but y-shearing. Additionally, we don't actually want to end up at zero (which is what we did before) so we need to shear towards an offset, in this case 1:

    - - -

    And this generates our final set of four coordinates. Of these, we already know that points 1 through 3 are (0,0), (0,1) and (1,1), and only the last coordinate is "free". In fact, given any four starting coordinates, the resulting "transformation mapped" coordinate will be:

    - - -

    Okay, well, that looks plain ridiculous, but: notice that every coordinate value is being offset by the initial translation, and also notice that a lot of terms in that expression are repeated. Even though the maths looks crazy as a single expression, we can just pull this apart a little and end up with an easy-to-calculate bit of code!

    -

    First, let's just do that translation step as a "preprocessing" operation so we don't have to subtract the values all the time. What does that leave?

    - - -

    Suddenly things look a lot simpler: the mapped x is fairly straight forward to compute, and we see that the mapped y actually contains the mapped x in its entirety, so we'll have that part already available when we need to evaluate it. In fact, let's pull out all those common factors to see just how simple this is:

    - - -

    That's kind of super-simple to write out in code, I think you'll agree. Coding math tends to be easier than the formulae initially make it look!

    -
    + +

    + That's kind of super-simple to write out in code, I think you'll agree. Coding math tends to be easier than the formulae initially make it + look! +

    +
    +

    How do you track all that?

    +

    + Doing maths can be a pain, so whenever possible, I like to make computers do the work for me. Especially for things like this, I simply + use Mathematica. Tracking all this math by hand is insane, and we invented computers, + literally, to do this for us. I have no reason to use pen and paper when I can write out what I want to do in a program, and have the + program do the math for me. And real math, too, with symbols, not with numbers. In fact, + here's the Mathematica notebook if you want to see how + this works for yourself. +

    +

    + Now, I know, you're thinking "but Mathematica is super expensive!" and that's true, it's + $344 for home use, up from $295 when I original wrote this, but it's + also free when you buy a $35 raspberry pi. Obviously, I bought a + raspberry pi, and I encourage you to do the same. With that, as long as you know what you want to do, Mathematica can just do + it for you. And we don't have to be geniuses to work out what the maths looks like. That's what we have computers for. +

    +
    -

    How do you track all that?

    -

    Doing maths can be a pain, so whenever possible, I like to make computers do the work for me. Especially for things like this, I simply use Mathematica. Tracking all this math by hand is insane, and we invented computers, literally, to do this for us. I have no reason to use pen and paper when I can write out what I want to do in a program, and have the program do the math for me. And real math, too, with symbols, not with numbers. In fact, here's the Mathematica notebook if you want to see how this works for yourself.

    -

    Now, I know, you're thinking "but Mathematica is super expensive!" and that's true, it's $344 for home use, up from $295 when I original wrote this, but it's also free when you buy a $35 raspberry pi. Obviously, I bought a raspberry pi, and I encourage you to do the same. With that, as long as you know what you want to do, Mathematica can just do it for you. And we don't have to be geniuses to work out what the maths looks like. That's what we have computers for.

    -
    - -

    So, let's write up a sketch that'll show us the canonical form for any curve drawn in blue, overlaid on our canonical map, so that we can immediately tell which features our curve must have, based on where the fourth coordinate is located on the map:

    - - - 脚本被禁用。显示备用图片。 - - - - -
    -
    -

    Finding Y, given X

    -

    One common task that pops up in things like CSS work, or parametric equalizers, or image leveling, or any other number of applications where Bézier curves are used as control curves in a way that there is really only ever one "y" value associated with one "x" value, you might want to cut out the middle man, as it were, and compute "y" directly based on "x". After all, the function looks simple enough, finding the "y" value should be simple too, right? Unfortunately, not really. However, it is possible and as long as you have some code in place to help, it's not a lot of a work either.

    -

    We'll be tackling this problem in two stages: the first, which is the hard part, is figuring out which "t" value belongs to any given "x" value. For instance, have a look at the following graphic. On the left we have a Bézier curve that looks for all intents and purposes like it fits our criteria: every "x" has one and only one associated "y" value. On the right we see the function for just the "x" values: that's a cubic curve, but not a really crazy cubic curve. If you move the graphic's slider, you will see a red line drawn that corresponds to the x coordinate: this is a vertical line in the left graphic, and a horizontal line on the right.

    - - - 脚本被禁用。显示备用图片。 - - - - - -

    Now, if you look more closely at that right graphic, you'll notice something interesting: if we treat the red line as "the x axis", then the point where the function crosses our line is really just a root for the cubic function x(t) through a shifted "x-axis"... and we've already seen how to calculate roots, so let's just run cubic root finding - and not even the complicated cubic case either: because of the kind of curve we're starting with, we know there is only root, simplifying the code we need!

    -

    First, let's look at the function for x(t):

    - - -

    We can rewrite this to a plain polynomial form, by just fully writing out the expansion and then collecting the polynomial factors, as:

    - - -

    Nothing special here: that's a standard cubic polynomial in "power" form (i.e. all the terms are ordered by their power of t). So, given that a, b, c, d, and x(t) are all known constants, we can trivially rewrite this (by moving the x(t) across the equal sign) as:

    - - -

    You might be wondering "where did all the other 'minus x' for all the other values a, b, c, and d go?" and the answer there is that they all cancel out, so the only one we actually need to subtract is the one at the end. Handy! So now we just solve this equation using Cardano's algorithm, and we're left with some rather short code:

    + +

    + You might be wondering "where did all the other 'minus x' for all the other values a, b, c, and d go?" and the answer there is that they + all cancel out, so the only one we actually need to subtract is the one at the end. Handy! So now we just solve this equation using + Cardano's algorithm, and we're left with some rather short code: +

    - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    - -

    So the procedure is fairly straight forward: pick an x, find the associated t value, evaluate our curve for that t value, which gives us the curve's {x,y} coordinate, which means we know y for this x. Move the slider for the following graphic to see this in action:

    - - - 脚本被禁用。显示备用图片。 - - - - - +y = curve.get(t).y + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + -
    -
    -

    Arc length

    -

    How long is a Bézier curve? As it turns out, that's not actually an easy question, because the answer requires maths that —much like root finding— cannot generally be solved the traditional way. If we have a parametric curve with fx(t) and fy(t), then the length of the curve, measured from start point to some point t = z, is computed using the following seemingly straight forward (if a bit overwhelming) formula:

    - - -

    or, more commonly written using Leibnitz notation as:

    - - -

    This formula says that the length of a parametric curve is in fact equal to the area underneath a function that looks a remarkable amount like Pythagoras' rule for computing the diagonal of a straight angled triangle. This sounds pretty simple, right? Sadly, it's far from simple... cutting straight to after the chase is over: for quadratic curves, this formula generates an unwieldy computation, and we're simply not going to implement things that way. For cubic Bézier curves, things get even more fun, because there is no "closed form" solution, meaning that due to the way calculus works, there is no generic formula that allows you to calculate the arc length. Let me just repeat this, because it's fairly crucial: for cubic and higher Bézier curves, there is no way to solve this function if you want to use it "for all possible coordinates".

    -

    Seriously: It cannot be done.

    -

    So we turn to numerical approaches again. The method we'll look at here is the Gauss quadrature. This approximation is a really neat trick, because for any nth degree polynomial it finds approximated values for an integral really efficiently. Explaining this procedure in length is way beyond the scope of this page, so if you're interested in finding out why it works, I can recommend the University of South Florida video lecture on the procedure, linked in this very paragraph. The general solution we're looking for is the following:

    - - -

    In plain text: an integral function can always be treated as the sum of an (infinite) number of (infinitely thin) rectangular strips sitting "under" the function's plotted graph. To illustrate this idea, the following graph shows the integral for a sinusoid function. The more strips we use (and of course the more we use, the thinner they get) the closer we get to the true area under the curve, and thus the better the approximation:

    -
    - - - 脚本被禁用。显示备用图片。 - - - - - - 脚本被禁用。显示备用图片。 - - - - - - 脚本被禁用。显示备用图片。 - - - -
    + +

    + In plain text: an integral function can always be treated as the sum of an (infinite) number of (infinitely thin) rectangular strips + sitting "under" the function's plotted graph. To illustrate this idea, the following graph shows the integral for a sinusoid function. The + more strips we use (and of course the more we use, the thinner they get) the closer we get to the true area under the curve, and thus the + better the approximation: +

    +
    + + + 脚本被禁用。显示备用图片。 + + + + + + 脚本被禁用。显示备用图片。 + + + + + + 脚本被禁用。显示备用图片。 + + + +
    -

    Now, infinitely many terms to sum and infinitely thin rectangles are not something that computers can work with, so instead we're going to approximate the infinite summation by using a sum of a finite number of "just thin" rectangular strips. As long as we use a high enough number of thin enough rectangular strips, this will give us an approximation that is pretty close to what the real value is.

    -

    So, the trick is to come up with useful rectangular strips. A naive way is to simply create n strips, all with the same width, but there is a far better way using special values for C and f(t) depending on the value of n, which indicates how many strips we'll use, and it's called the Legendre-Gauss quadrature.

    -

    This approach uses strips that are not spaced evenly, but instead spaces them in a special way based on describing the function as a polynomial (the more strips, the more accurate the polynomial), and then computing the exact integral for that polynomial. We're essentially performing arc length computation on a flattened curve, but flattening it based on the intervals dictated by the Legendre-Gauss solution.

    -
    - -

    Note that one requirement for the approach we'll use is that the integral must run from -1 to 1. That's no good, because we're dealing with Bézier curves, and the length of a section of curve applies to values which run from 0 to "some value smaller than or equal to 1" (let's call that value z). Thankfully, we can quite easily transform any integral interval to any other integral interval, by shifting and scaling the inputs. Doing so, we get the following:

    - - -

    That may look a bit more complicated, but the fraction involving z is a fixed number, so the summation, and the evaluation of the f(t) values are still pretty simple.

    -

    So, what do we need to perform this calculation? For one, we'll need an explicit formula for f(t), because that derivative notation is handy on paper, but not when we have to implement it. We'll also need to know what these Ci and ti values should be. Luckily, that's less work because there are actually many tables available that give these values, for any n, so if we want to approximate our integral with only two terms (which is a bit low, really) then these tables would tell us that for n=2 we must use the following values:

    - - -

    Which means that in order for us to approximate the integral, we must plug these values into the approximate function, which gives us:

    - - -

    We can program that pretty easily, provided we have that f(t) available, which we do, as we know the full description for the Bézier curve functions Bx(t) and By(t).

    -
    + +

    + We can program that pretty easily, provided we have that f(t) available, which we do, as we know the full description for the + Bézier curve functions Bx(t) and By(t). +

    + -

    If we use the Legendre-Gauss values for our C values (thickness for each strip) and t values (location of each strip), we can determine the approximate length of a Bézier curve by computing the Legendre-Gauss sum. The following graphic shows a cubic curve, with its computed lengths; Go ahead and change the curve, to see how its length changes. One thing worth trying is to see if you can make a straight line, and see if the length matches what you'd expect. What if you form a line with the control points on the outside, and the start/end points on the inside?

    - - - 脚本被禁用。显示备用图片。 - - - +

    + If we use the Legendre-Gauss values for our C values (thickness for each strip) and t values (location of each strip), + we can determine the approximate length of a Bézier curve by computing the Legendre-Gauss sum. The following graphic shows a cubic curve, + with its computed lengths; Go ahead and change the curve, to see how its length changes. One thing worth trying is to see if you can make + a straight line, and see if the length matches what you'd expect. What if you form a line with the control points on the outside, and the + start/end points on the inside? +

    + + + 脚本被禁用。显示备用图片。 + + + +
    +
    +

    + + Approximated arc length +

    +

    + Sometimes, we don't actually need the precision of a true arc length, and we can get away with simply computing the approximate arc length + instead. The by far fastest way to do this is to flatten the curve and then simply calculate the linear distance from point to point. This + will come with an error, but this can be made arbitrarily small by increasing the segment count. +

    +

    + If we combine the work done in the previous sections on curve flattening and arc length computation, we can implement these with minimal + effort: +

    +
    + + + 脚本被禁用。显示备用图片。 + + + + + + + + 脚本被禁用。显示备用图片。 + + + + + +
    -
    -
    -

    Approximated arc length

    -

    Sometimes, we don't actually need the precision of a true arc length, and we can get away with simply computing the approximate arc length instead. The by far fastest way to do this is to flatten the curve and then simply calculate the linear distance from point to point. This will come with an error, but this can be made arbitrarily small by increasing the segment count.

    -

    If we combine the work done in the previous sections on curve flattening and arc length computation, we can implement these with minimal effort:

    -
    - - - - 脚本被禁用。显示备用图片。 - - - - - - - - 脚本被禁用。显示备用图片。 - - - - - -
    - -

    You may notice that even though the error in length is actually pretty significant in absolute terms, even at a low number of segments we get a length that agrees with the true length when it comes to just the integer part of the arc length. Quite often, approximations can drastically speed things up!

    - -
    -
    -

    Curvature of a curve

    -

    If we have two curves, and we want to line them in up in a way that "looks right", what would we use as metric to let a computer decide what "looks right" means?

    -

    For instance, we can start by ensuring that the two curves share an end coordinate, so that there is no "gap" between the end of one and the start of the next curve, but that won't guarantee that things look right: both curves can be going in wildly different directions, and the resulting joined geometry will have a corner in it, rather than a smooth transition from one curve to the next.

    -

    What we want is to ensure that the curvature at the transition from one curve to the next "looks good". So, we start with a shared coordinate, and then also require that derivatives for both curves match at that coordinate. That way, we're assured that their tangents line up, which must mean the curve transition is perfectly smooth. We can even make the second, third, etc. derivatives match up for better and better transitions.

    -

    Problem solved!

    -

    However, there's a problem with this approach: if we think about this a little more, we realise that "what a curve looks like" and its derivative values are pretty much entirely unrelated. After all, the section on reordering curves showed us that the same looking curve can have an infinite number of curve expressions of arbitrarily high Bézier degree, and each of those will have wildly different derivative values.

    -

    So what we really want is some kind of expression that's not based on any particular expression of t, but is based on something that is invariant to the kind of function(s) we use to draw our curve. And the prime candidate for this is our curve expression, reparameterised for distance: no matter what order of Bézier curve we use, if we were able to rewrite it as a function of distance-along-the-curve, all those different degree Bézier functions would end up being the same function for "coordinate at some distance D along the curve".

    -

    We've seen this before... that's the arc length function.

    -

    So you might think that in order to find the curvature of a curve, we now need to solve the arc length function itself, and that this would be quite a problem because we just saw that there is no way to actually do that. Thankfully, we don't. We only need to know the form of the arc length function, which we saw above and is fairly simple, rather than needing to solve the arc length function. If we start with the arc length expression and the run through the steps necessary to determine its derivative (with an alternative, shorter demonstration of how to do this found over on Stackexchange), then the integral that was giving us so much problems in solving the arc length function disappears entirely (because of the fundamental theorem of calculus), and what we're left with us some surprisingly simple maths that relates curvature (denoted as κ, "kappa") to—and this is the truly surprising bit—a specific combination of derivatives of our original function.

    -

    Let me highlight what just happened, because it's pretty special:

    -
      -
    1. we wanted to make curves line up, and initially thought to match the curves' derivatives, but
    2. -
    3. that turned out to be a really bad choice, so instead
    4. -
    5. we picked a function that is basically impossible to work with, and then worked with that, which
    6. -
    7. gives us a simple formula that is and expression using the curves' derivatives.
    8. -
    -

    That's crazy!

    -

    But that's also one of the things that makes maths so powerful: even if your initial ideas are off the mark, you might be much closer than you thought you were, and the journey from "thinking we're completely wrong" to "actually being remarkably close to being right" is where we can find a lot of insight.

    -

    So, what does the function look like? This:

    - - -

    Which is really just a "short form" that glosses over the fact that we're dealing with functions of t, so let's expand that a tiny bit:

    - - -

    And while that's a little more verbose, it's still just as simple to work with as the first function: the curvature at some point on any (and this cannot be overstated: any) curve is a ratio between the first and second derivative cross product, and something that looks oddly similar to the standard Euclidean distance function. And nothing in these functions is hard to calculate either: for Bézier curves, simply knowing our curve coordinates means we know what the first and second derivatives are, and so evaluating this function for any t value is just a matter of basic arithematics.

    -

    In fact, let's just implement it right now:

    + +

    + And while that's a little more verbose, it's still just as simple to work with as the first function: the curvature at some point on any + (and this cannot be overstated: any) curve is a ratio between the first and second derivative cross product, and something that + looks oddly similar to the standard Euclidean distance function. And nothing in these functions is hard to calculate either: for Bézier + curves, simply knowing our curve coordinates means we know what the first and second derivatives are, and so + evaluating this function for any t value is just a matter of basic arithematics. +

    +

    In fact, let's just implement it right now:

    - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    - -

    That was easy! (Well okay, that "not a number" value will need to be taken into account by downstream code, but that's a reality of programming anyway)

    -

    With all of that covered, let's line up some curves! The following graphic gives you two curves that look identical, but use quadratic and cubic functions, respectively. As you can see, despite their derivatives being necessarily different, their curvature (thanks to being derived based on maths that "ignores" specific function derivative, and instead gives a formula that smooths out any differences) is exactly the same. And because of that, we can put them together such that the point where they overlap has the same curvature for both curves, giving us the smoothest transition.

    - - - 脚本被禁用。显示备用图片。 - - - -

    One thing you may have noticed in this sketch is that sometimes the curvature looks fine, but seems to be pointing in the wrong direction, making it hard to line up the curves properly. A way around that, of course, is to show the curvature on both sides of the curve, so let's just do that. But let's take it one step further: we can also compute the associated "radius of curvature", which gives us the implicit circle that "fits" the curve's curvature at any point, using what is possibly the simplest bit of maths found in this entire primer:

    - - -

    So let's revisit the previous graphic with the curvature visualised on both sides of our curves, as well as showing the circle that "fits" our curve at some point that we can control by using a slider:

    - - - 脚本被禁用。显示备用图片。 - - - - - + +

    + So let's revisit the previous graphic with the curvature visualised on both sides of our curves, as well as showing the circle that "fits" + our curve at some point that we can control by using a slider: +

    + + + 脚本被禁用。显示备用图片。 + + + + + +
    +
    +

    + + Tracing a curve at fixed distance intervals +

    +

    + Say you want to draw a curve with a dashed line, rather than a solid line, or you want to move something along the curve at fixed distance + intervals over time, like a train along a track, and you want to use Bézier curves. +

    +

    Now you have a problem.

    +

    + The reason you have a problem is that Bézier curves are parametric functions with non-linear behaviour, whereas moving a train along a + track is about as close to a practical example of linear behaviour as you can get. The problem we're faced with is that we can't just pick + t values at some fixed interval and expect the Bézier functions to generate points that are spaced a fixed distance apart. In + fact, let's look at the relation between "distance along a curve" and "t value", by plotting them against one another. +

    +

    + The following graphic shows a particularly illustrative curve, and its distance-for-t plot. For linear traversal, this line needs to be + straight, running from (0,0) to (length,1). That is, it's safe to say, not what we'll see: we'll see something very wobbly, instead. To + make matters even worse, the distance-for-t function is also of a much higher order than our curve is: while the curve we're using for + this exercise is a cubic curve, which can switch concave/convex form twice at best, the distance function is our old friend the arc length + function, which can have more inflection points. +

    + + + 脚本被禁用。显示备用图片。 + + +

    + So, how do we "cut up" the arc length function at regular intervals, when we can't really work with it? We basically cheat: we run through + the curve using t values, determine the distance-for-this-t-value at each point we generate during the run, and + then we find "the closest t value that matches some required distance" using those values instead. If we have a low number of + points sampled, we can then even refine which t value "should" work for our desired distance by interpolating between two + points, but if we have a high enough number of samples, we don't even need to bother. +

    +

    + So let's do exactly that: the following graph is similar to the previous one, showing how we would have to "chop up" our distance-for-t + curve in order to get regularly spaced points on the curve. It also shows what using those t values on the real curve looks + like, by coloring each section of curve between two distance markers differently: +

    + + + 脚本被禁用。显示备用图片。 + + + + + +

    Use the slider to increase or decrease the number of equidistant segments used to colour the curve.

    +

    + However, are there better ways? One such way is discussed in "Moving Along a Curve with Specified Speed" by David Eberly of Geometric Tools, LLC, but basically because we have no explicit length function (or rather, one we don't have to + constantly compute for different intervals), you may simply be better off with a traditional lookup table (LUT). +

    +
    +
    +

    + + Intersections +

    +

    + Let's look at some more things we will want to do with Bézier curves. Almost immediately after figuring out how to get bounding boxes to + work, people tend to run into the problem that even though the minimal bounding box (based on rotation) is tight, it's not sufficient to + perform true collision detection. It's a good first step to make sure there might be a collision (if there is no bounding box + overlap, there can't be one), but in order to do real collision detection we need to know whether or not there's an intersection on the + actual curve. +

    +

    + We'll do this in steps, because it's a bit of a journey to get to curve/curve intersection checking. First, let's start simple, by + implementing a line-line intersection checker. While we can solve this the traditional calculus way (determine the functions for both + lines, then compute the intersection by equating them and solving for two unknowns), linear algebra actually offers a nicer solution. +

    +

    Line-line intersections

    +

    + If we have two line segments with two coordinates each, segments A-B and C-D, we can find the intersection of the lines these segments are + an intervals on by linear algebra, using the procedure outlined in this + top coder + article. Of course, we need to make sure that the intersection isn't just on the lines our line segments lie on, but actually on our line + segments themselves. So after we find the intersection, we need to verify that it lies without the bounds of our original line segments. +

    +

    + The following graphic implements this intersection detection, showing a red point for an intersection on the lines our segments lie on + (thus being a virtual intersection point), and a green point for an intersection that lies on both segments (being a real intersection + point). +

    + + + 脚本被禁用。显示备用图片。 + + + +
    +

    Implementing line-line intersections

    +

    + Let's have a look at how to implement a line-line intersection checking function. The basics are covered in the article mentioned above, + but sometimes you need more function signatures, because you might not want to call your function with eight distinct parameters. Maybe + you're using point structs for the line. Let's get coding: +

    -
    -
    -

    Tracing a curve at fixed distance intervals

    -

    Say you want to draw a curve with a dashed line, rather than a solid line, or you want to move something along the curve at fixed distance intervals over time, like a train along a track, and you want to use Bézier curves.

    -

    Now you have a problem.

    -

    The reason you have a problem is that Bézier curves are parametric functions with non-linear behaviour, whereas moving a train along a track is about as close to a practical example of linear behaviour as you can get. The problem we're faced with is that we can't just pick t values at some fixed interval and expect the Bézier functions to generate points that are spaced a fixed distance apart. In fact, let's look at the relation between "distance along a curve" and "t value", by plotting them against one another.

    -

    The following graphic shows a particularly illustrative curve, and its distance-for-t plot. For linear traversal, this line needs to be straight, running from (0,0) to (length,1). That is, it's safe to say, not what we'll see: we'll see something very wobbly, instead. To make matters even worse, the distance-for-t function is also of a much higher order than our curve is: while the curve we're using for this exercise is a cubic curve, which can switch concave/convex form twice at best, the distance function is our old friend the arc length function, which can have more inflection points.

    - - - 脚本被禁用。显示备用图片。 - - - -

    So, how do we "cut up" the arc length function at regular intervals, when we can't really work with it? We basically cheat: we run through the curve using t values, determine the distance-for-this-t-value at each point we generate during the run, and then we find "the closest t value that matches some required distance" using those values instead. If we have a low number of points sampled, we can then even refine which t value "should" work for our desired distance by interpolating between two points, but if we have a high enough number of samples, we don't even need to bother.

    -

    So let's do exactly that: the following graph is similar to the previous one, showing how we would have to "chop up" our distance-for-t curve in order to get regularly spaced points on the curve. It also shows what using those t values on the real curve looks like, by coloring each section of curve between two distance markers differently:

    - - - 脚本被禁用。显示备用图片。 - - - - - -

    Use the slider to increase or decrease the number of equidistant segments used to colour the curve.

    -

    However, are there better ways? One such way is discussed in "Moving Along a Curve with Specified Speed" by David Eberly of Geometric Tools, LLC, but basically because we have no explicit length function (or rather, one we don't have to constantly compute for different intervals), you may simply be better off with a traditional lookup table (LUT).

    - -
    -
    -

    Intersections

    -

    Let's look at some more things we will want to do with Bézier curves. Almost immediately after figuring out how to get bounding boxes to work, people tend to run into the problem that even though the minimal bounding box (based on rotation) is tight, it's not sufficient to perform true collision detection. It's a good first step to make sure there might be a collision (if there is no bounding box overlap, there can't be one), but in order to do real collision detection we need to know whether or not there's an intersection on the actual curve.

    -

    We'll do this in steps, because it's a bit of a journey to get to curve/curve intersection checking. First, let's start simple, by implementing a line-line intersection checker. While we can solve this the traditional calculus way (determine the functions for both lines, then compute the intersection by equating them and solving for two unknowns), linear algebra actually offers a nicer solution.

    -

    Line-line intersections

    -

    If we have two line segments with two coordinates each, segments A-B and C-D, we can find the intersection of the lines these segments are an intervals on by linear algebra, using the procedure outlined in this top coder article. Of course, we need to make sure that the intersection isn't just on the lines our line segments lie on, but actually on our line segments themselves. So after we find the intersection, we need to verify that it lies without the bounds of our original line segments.

    -

    The following graphic implements this intersection detection, showing a red point for an intersection on the lines our segments lie on (thus being a virtual intersection point), and a green point for an intersection that lies on both segments (being a real intersection point).

    - - - 脚本被禁用。显示备用图片。 - - - -
    - -

    Implementing line-line intersections

    -

    Let's have a look at how to implement a line-line intersection checking function. The basics are covered in the article mentioned above, but sometimes you need more function signatures, because you might not want to call your function with eight distinct parameters. Maybe you're using point structs for the line. Let's get coding:

    - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    - -
    + return lli4(line1.p1, line1.p2, line2.p1, line2.p2) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + -

    What about curve-line intersections?

    -

    Curve/line intersection is more work, but we've already seen the techniques we need to use in order to perform it: first we translate/rotate both the line and curve together, in such a way that the line coincides with the x-axis. This will position the curve in a way that makes it cross the line at points where its y-function is zero. By doing this, the problem of finding intersections between a curve and a line has now become the problem of performing root finding on our translated/rotated curve, as we already covered in the section on finding extremities.

    -
    - - - 脚本被禁用。显示备用图片。 - - - - - - 脚本被禁用。显示备用图片。 - - - -
    +

    What about curve-line intersections?

    +

    + Curve/line intersection is more work, but we've already seen the techniques we need to use in order to perform it: first we + translate/rotate both the line and curve together, in such a way that the line coincides with the x-axis. This will position the curve in + a way that makes it cross the line at points where its y-function is zero. By doing this, the problem of finding intersections between a + curve and a line has now become the problem of performing root finding on our translated/rotated curve, as we already covered in the + section on finding extremities. +

    +
    + + + 脚本被禁用。显示备用图片。 + + + + + + 脚本被禁用。显示备用图片。 + + + +
    -

    Curve/curve intersection, however, is more complicated. Since we have no straight line to align to, we can't simply align one of the curves and be left with a simple procedure. Instead, we'll need to apply two techniques we've met before: de Casteljau's algorithm, and curve splitting.

    +

    + Curve/curve intersection, however, is more complicated. Since we have no straight line to align to, we can't simply align one of the + curves and be left with a simple procedure. Instead, we'll need to apply two techniques we've met before: de Casteljau's algorithm, and + curve splitting. +

    +
    +
    +

    + + Curve/curve intersection +

    +

    + Using de Casteljau's algorithm to split the curve we can now implement curve/curve intersection finding using a "divide and conquer" + technique: +

    +
      +
    1. + Take two curves C1 and C2, and treat them as a pair. +
    2. +
    3. If their bounding boxes overlap, split up each curve into two sub-curves
    4. +
    5. + With C1.1, C1.2, C2.1 and C2.2, form four new pairs (C1.1,C2.1), (C1.1, C2.2), (C1.2,C2.1), and (C1.2,C2.2). +
    6. +
    7. + For each pair, check whether their bounding boxes overlap. +
        +
      1. If their bounding boxes do not overlap, discard the pair, as there is no intersection between this pair of curves.
      2. +
      3. If there is overlap, rerun all steps for this pair.
      4. +
      +
    8. +
    9. + Once the sub-curves we form are so small that they effectively occupy sub-pixel areas, we consider an intersection found, noting that we + might have a cluster of multiple intersections at the sub-pixel level, out of which we pick one to act as "found" t value + (we can either throw all but one away, we can average the cluster's t values, or you can do something even more creative). +
    10. +
    +

    + This algorithm will start with a single pair, "balloon" until it runs in parallel for a large number of potential sub-pairs, and then + taper back down as it homes in on intersection coordinates, ending up with as many pairs as there are intersections. +

    +

    + The following graphic applies this algorithm to a pair of cubic curves, one step at a time, so you can see the algorithm in action. Click + the button to run a single step in the algorithm, after setting up your curves in some creative arrangement. You can also change the value + that is used in step 5 to determine whether the curves are small enough. Manipulating the curves or changing the threshold will reset the + algorithm, so you can try this with lots of different curves. +

    +

    (can you find the configuration that yields the maximum number of intersections between two cubic curves? Nine intersections!)

    + + + 脚本被禁用。显示备用图片。 + + + + + + +

    + Finding self-intersections is effectively the same procedure, except that we're starting with a single curve, so we need to turn that into + two separate curves first. This is trivially achieved by splitting at an inflection point, or if there are none, just splitting at + t=0.5 first, and then running the exact same algorithm as above, with all non-overlapping curve pairs getting removed at each + iteration, and each successive step homing in on the curve's self-intersection points. +

    +
    +
    +

    + + The projection identity +

    +

    + De Casteljau's algorithm is the pivotal algorithm when it comes to Bézier curves. You can use it not just to split curves, but also to + draw them efficiently (especially for high-order Bézier curves), as well as to come up with curves based on three points and a tangent. + Particularly this last thing is really useful because it lets us "mold" a curve, by picking it up at some point, and dragging that point + around to change the curve's shape. +

    +

    How does that work? Succinctly: we run de Casteljau's algorithm in reverse!

    +

    + In order to run de Casteljau's algorithm in reverse, we need a few basic things: a start and end point, a point on the curve that we want + to be moving around, which has an associated t value, and a point we've not explicitly talked about before, and as far as I know + has no explicit name, but lives one iteration higher in the de Casteljau process then our on-curve point does. I like to call it "A" for + reasons that will become obvious. +

    +

    + So let's use graphics instead of text to see where this "A" is, because text only gets us so far: move the sliders for the following + graphics to see what, given a specific t value, our A coordinate is. As well as some other coordinates, which + taken together let us derive a value that the graphics call "ratio": if you move the curve's points around, A, B, and C will move, what + happens to that value? +

    +
    + + + 脚本被禁用。显示备用图片。 + + + + + + + + 脚本被禁用。显示备用图片。 + + + + + +
    -
    -
    -

    Curve/curve intersection

    -

    Using de Casteljau's algorithm to split the curve we can now implement curve/curve intersection finding using a "divide and conquer" technique:

    -
      -
    1. Take two curves C1 and C2, and treat them as a pair.
    2. -
    3. If their bounding boxes overlap, split up each curve into two sub-curves
    4. -
    5. With C1.1, C1.2, C2.1 and C2.2, form four new pairs (C1.1,C2.1), (C1.1, C2.2), (C1.2,C2.1), and (C1.2,C2.2).
    6. -
    7. For each pair, check whether their bounding boxes overlap.
        -
      1. If their bounding boxes do not overlap, discard the pair, as there is no intersection between this pair of curves.
      2. -
      3. If there is overlap, rerun all steps for this pair.
      4. -
      -
    8. -
    9. Once the sub-curves we form are so small that they effectively occupy sub-pixel areas, we consider an intersection found, noting that we might have a cluster of multiple intersections at the sub-pixel level, out of which we pick one to act as "found" t value (we can either throw all but one away, we can average the cluster's t values, or you can do something even more creative).
    10. -
    -

    This algorithm will start with a single pair, "balloon" until it runs in parallel for a large number of potential sub-pairs, and then taper back down as it homes in on intersection coordinates, ending up with as many pairs as there are intersections.

    -

    The following graphic applies this algorithm to a pair of cubic curves, one step at a time, so you can see the algorithm in action. Click the button to run a single step in the algorithm, after setting up your curves in some creative arrangement. You can also change the value that is used in step 5 to determine whether the curves are small enough. Manipulating the curves or changing the threshold will reset the algorithm, so you can try this with lots of different curves.

    -

    (can you find the configuration that yields the maximum number of intersections between two cubic curves? Nine intersections!)

    - - - 脚本被禁用。显示备用图片。 - - - - - - -

    Finding self-intersections is effectively the same procedure, except that we're starting with a single curve, so we need to turn that into two separate curves first. This is trivially achieved by splitting at an inflection point, or if there are none, just splitting at t=0.5 first, and then running the exact same algorithm as above, with all non-overlapping curve pairs getting removed at each iteration, and each successive step homing in on the curve's self-intersection points.

    - -
    -
    -

    The projection identity

    -

    De Casteljau's algorithm is the pivotal algorithm when it comes to Bézier curves. You can use it not just to split curves, but also to draw them efficiently (especially for high-order Bézier curves), as well as to come up with curves based on three points and a tangent. Particularly this last thing is really useful because it lets us "mold" a curve, by picking it up at some point, and dragging that point around to change the curve's shape.

    -

    How does that work? Succinctly: we run de Casteljau's algorithm in reverse!

    -

    In order to run de Casteljau's algorithm in reverse, we need a few basic things: a start and end point, a point on the curve that we want to be moving around, which has an associated t value, and a point we've not explicitly talked about before, and as far as I know has no explicit name, but lives one iteration higher in the de Casteljau process then our on-curve point does. I like to call it "A" for reasons that will become obvious.

    -

    So let's use graphics instead of text to see where this "A" is, because text only gets us so far: move the sliders for the following graphics to see what, given a specific t value, our A coordinate is. As well as some other coordinates, which taken together let us derive a value that the graphics call "ratio": if you move the curve's points around, A, B, and C will move, what happens to that value?

    -
    - - - - 脚本被禁用。显示备用图片。 - - - - - - - - 脚本被禁用。显示备用图片。 - - - - - -
    - -

    So these graphics show us several things:

    -
      -
    1. a point at the tip of the curve construction's "hat": let's call that A, as well as
    2. -
    3. our on-curve point give our chosen t value: let's call that B, and finally,
    4. -
    5. a point that we get by projecting A, through B, onto the line between the curve's start and end points: let's call that C.
    6. -
    7. for both quadratic and cubic curves, two points e1 and e2, which represent the single-to-last step in de Casteljau's algorithm: in the last step, we find B at (1-t) * e1 + t * e2.
    8. -
    9. for cubic curves, also the points v1 and v2, which together with A represent the first step in de Casteljau's algorithm: in the next step, we find e1 and e2.
    10. -
    -

    These three values A, B, and C allow us to derive an important identity formula for quadratic and cubic Bézier curves: for any point on the curve with some t value, the ratio of distances from A to B and B to C is fixed: if some t value sets up a C that is 20% away from the start and 80% away from the end, then it doesn't matter where the start, end, or control points are; for that t value, C will always lie at 20% from the start and 80% from the end point. Go ahead, pick an on-curve point in either graphic and then move all the other points around: if you only move the control points, start and end won't move, and so neither will C, and if you move either start or end point, C will move but its relative position will not change.

    -

    So, how can we compute C? We start with our observation that C always lies somewhere between the start and end points, so logically C will have a function that interpolates between those two coordinates:

    - - -

    If we can figure out what the function u(t) looks like, we'll be done. Although we do need to remember that this u(t) will have a different form depending on whether we're working with quadratic or cubic curves. Running through the maths (with thanks to Boris Zbarsky) shows us the following two formulae:

    - - -

    And

    - - -

    So, if we know the start and end coordinates and the t value, we know C without having to calculate the A or even B coordinates. In fact, we can do the same for the ratio function. As another function of t, we technically don't need to know what A or B or C are. It, too, can be expressed as a pure function of t.

    -

    We start by observing that, given A, B, and C, the following always holds:

    - - -

    Working out the maths for this, we see the following two formulae for quadratic and cubic curves:

    - - -

    And

    - - -

    Which now leaves us with some powerful tools: given three points (start, end, and "some point on the curve"), as well as a t value, we can construct curves. We can compute C using the start and end points and our u(t) function, and once we have C, we can use our on-curve point (B) and the ratio(t) function to find A:

    - - -

    With A found, finding e1 and e2 for quadratic curves is a matter of running the linear interpolation with t between start and A to yield e1, and between A and end to yield e2. For cubic curves, there is no single pair of points that can act as e1 and e2 (there are infinitely many, because the tangent at B is a free parameter for cubic curves) so as long as the distance ratio between e1 to B and B to e2 is the Bézier ratio (1-t):t, we are free to pick any pair, after which we can reverse engineer v1 and v2:

    - - -

    And then reverse engineer the curve's control points:

    - - -

    So: if we have a curve's start and end points, as well as some third point B that we want the curve to pass through, then for any t value we implicitly know all the ABC values, which (combined with an educated guess on appropriate e1 and e2 coordinates for cubic curves) gives us the necessary information to reconstruct a curve's "de Casteljau skeleton". Which means that we can now do several things: we can "fit" curves using only three points, which means we can also "mold" curves by moving an on-curve point but leaving its start and end points, and then reconstruct the curve based on where we moved the on-curve point to. These are very useful things, and we'll look at both in the next few sections.

    - -
    -
    -

    Creating a curve from three points

    -

    Given the preceding section, you might be wondering if we can use that knowledge to just "create" curves by placing some points and having the computer do the rest, to which the answer is: that's exactly what we can now do!

    -

    For quadratic curves, things are pretty easy. Technically, we'll need a t value in order to compute the ratio function used in computing the ABC coordinates, but we can just as easily approximate one by treating the distance between the start and B point, and B and end point as a ratio, using

    - - -

    With this code in place, creating a quadratic curve from three points is literally just computing the ABC values, and using A as our curve's control point:

    - - - 脚本被禁用。显示备用图片。 - - - -

    For cubic curves we need to do a little more work, but really only just a little. We're first going to assume that a decent curve through the three points should approximate a circular arc, which first requires knowing how to fit a circle to three points. You may remember (if you ever learned it!) that a line between two points on a circle is called a chord, and that one property of chords is that the line from the center of any chord, perpendicular to that chord, passes through the center of the circle.

    -

    That means that if we have three points on a circle, we have three (different) chords, and consequently, three (different) lines that go from those chords through the center of the circle: if we find two of those lines, then their intersection will be our circle's center, and the circle's radius will—by definition!—be the distance from the center to any of our three points:

    - - - 脚本被禁用。显示备用图片。 - - - -

    With that covered, we now also know the tangent line to our point B, because the tangent to any point on the circle is a line through that point, perpendicular to the line from that point to the center. That just leaves marking appropriate points e1 and e2 on that tangent, so that we can construct a new cubic curve hull. We use the approach as we did for quadratic curves to automatically determine a reasonable t value, and then our e1 and e2 coordinates must obey the standard de Casteljau rule for linear interpolation:

    - - -

    Where d is the total length of the line segment from e1 to e2. So how long do we make that? There are again all kinds of approaches we can take, and a simple-but-effective one is to set the length of that segment to "one third the length of the baseline". This forces e1 and e2 to always be the "linear curve" distance apart, which means if we place our three points on a line, it will actually look like a line. Nice! The last thing we'll need to do is make sure to flip the sign of d depending on which side of the baseline our B is located, so we don't end up creating a funky curve with a loop in it. To do this, we can use the atan2 function:

    - - -

    This angle φ will be between 0 and π if B is "above" the baseline (rotating all three points so that the start is on the left and the end is the right), so we can use a relatively straight forward check to make sure we're using the correct sign for our value d:

    - - -

    The result of this approach looks as follows:

    - - - 脚本被禁用。显示备用图片。 - - - -

    It is important to remember that even though we're using a circular arc to come up with decent e1 and e2 terms, we're not trying to perfectly create a circular arc with a cubic curve (which is good, because we can't; more on that later), we're only trying to come up with some reasonable e1 and e2 points so we can construct a new cubic curve... so now that we have those: let's see what kind of cubic curve that gives us:

    - - - 脚本被禁用。显示备用图片。 - - - -

    That looks perfectly serviceable!

    -

    Of course, we can take this one step further: we can't just "create" curves, we also have (almost!) all the tools available to "mold" curves, where we can reshape a curve by dragging a point on the curve around while leaving the start and end fixed, effectively molding the shape as if it were clay or the like. We'll see the last tool we need to do that in the next section, and then we'll look at implementing curve molding in the section after that, so read on!

    + +

    The result of this approach looks as follows:

    + + + 脚本被禁用。显示备用图片。 + + + +

    + It is important to remember that even though we're using a circular arc to come up with decent e1 and e2 terms, + we're not trying to perfectly create a circular arc with a cubic curve (which is good, because we can't; + more on that later), we're only trying to come up with some reasonable e1 and + e2 points so we can construct a new cubic curve... so now that we have those: let's see what kind of cubic curve that gives + us: +

    + + + 脚本被禁用。显示备用图片。 + + + +

    That looks perfectly serviceable!

    +

    + Of course, we can take this one step further: we can't just "create" curves, we also have (almost!) all the tools available to "mold" + curves, where we can reshape a curve by dragging a point on the curve around while leaving the start and end fixed, effectively molding + the shape as if it were clay or the like. We'll see the last tool we need to do that in the next section, and then we'll look at + implementing curve molding in the section after that, so read on! +

    +
    +
    +

    + + Projecting a point onto a Bézier curve +

    +

    + Before we can move on to actual curve molding, it'll be good if know how to actually be able to find "some point on the curve" that we're + trying to click on. After all, if all we have is our Bézier coordinates, that is not in itself enough to figure out which point on the + curve our cursor will be closest to. So, how do we project points onto a curve? +

    +

    + If the Bézier curve is of low enough order, we might be able to + work out the maths for how to do this, and get a perfect t value back, but in general this is an incredibly hard problem and the easiest solution is, really, a + numerical approach again. We'll be finding our ideal t value using a + binary search. First, we do a coarse distance-check based on + t values associated with the curve's "to draw" coordinates (using a lookup table, or LUT). This is pretty fast: +

    -
    -
    -

    Projecting a point onto a Bézier curve

    -

    Before we can move on to actual curve molding, it'll be good if know how to actually be able to find "some point on the curve" that we're trying to click on. After all, if all we have is our Bézier coordinates, that is not in itself enough to figure out which point on the curve our cursor will be closest to. So, how do we project points onto a curve?

    -

    If the Bézier curve is of low enough order, we might be able to work out the maths for how to do this, and get a perfect t value back, but in general this is an incredibly hard problem and the easiest solution is, really, a numerical approach again. We'll be finding our ideal t value using a binary search. First, we do a coarse distance-check based on t values associated with the curve's "to draw" coordinates (using a lookup table, or LUT). This is pretty fast:

    - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    - -

    After this runs, we know that LUT[i] is the coordinate on the curve in our LUT that is closest to the point we want to project, so that's a pretty good initial guess as to what the best projection onto our curve is. To refine it, we note that LUT[i] is a better guess than both LUT[i-1] and LUT[i+1], but there might be an even better projection somewhere else between those two values, so that's what we're going to be testing for, using a variation of the binary search.

    -
      -
    1. we start with our point p, and the t values t1=LUT[i-1].t and t2=LUT[i+1].t, which span an interval v = t2-t1.
    2. -
    3. we test this interval in five spots: the start, middle, and end (which we already have), and the two points in between the middle and start/end points
    4. -
    5. we then check which of these five points is the closest to our original point p, and then repeat step 1 with the points before and after the closest point we just found.
    6. -
    -

    This makes the interval we check smaller and smaller at each iteration, and we can keep running the three steps until the interval becomes so small as to lead to distances that are, for all intents and purposes, the same for all points.

    -

    So, let's see that in action: in this case, I'm going to arbitrarily say that if we're going to run the loop until the interval is smaller than 0.001, and show you what that means for projecting your mouse cursor or finger tip onto a rather complex Bézier curve (which, of course, you can reshape as you like). Also shown are the original three points that our coarse check finds.

    - - - 脚本被禁用。显示备用图片。 - - - + i = index + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + -
    -
    -

    Intersections with a circle

    -

    It might seem odd to cover this subject so much later than the line/line, line/curve, and curve/curve intersection topics from several sections earlier, but the reason we can't cover circle/curve intersections is that we can't really discuss circle/curve intersection until we've covered the kind of lookup table (LUT) walking that the section on projecting a point onto a curve uses. To see why, let's look at what we would have to do if we wanted to find the intersections between a curve and a circle using calculus.

    -

    First, we observe that "finding intersections" in this case means that, given a circle defined by a center point c = (x,y) and a radius r, we want to find all points on the Bezier curve for which the distance to the circle's center point is equal to the circle radius, which by definition means those points lie on the circle, and so count as intersections. In maths, that means we're trying to solve:

    - - -

    Which seems simple enough. Unfortunately, when we expand that dist function, things get a lot more problematic:

    - - -

    And now we have a problem because that's a sixth degree polynomial inside the square root. So, thanks to the Abel-Ruffini theorem that we saw before, we can't solve this by just going "square both sides because we don't care about signs"... we can't solve a sixth degree polynomial. So, we're going to have to actually evaluate that expression. We can "simplify" this by translating all our coordinates so that the center of the circle is (0,0) and all our coordinates are shifted accordingly, which makes the cx and cy terms fall away, but then we're still left with a monstrous function to solve.

    -

    So instead, we turn to the same kind of "LUT walking" that we saw for projecting points onto a curve, with a twist: instead of finding the on-curve point with the smallest distance to our projection point, we want to find the on-curve point that has the exact distance r to our projection point (namely, our circle center). Of course, there can be more than one such point, so there's also a bit more code to make sure we find all of them, but let's look at the steps involved:

    + +

    + And now we have a problem because that's a sixth degree polynomial inside the square root. So, thanks to the + Abel-Ruffini theorem that we saw before, we can't solve this by + just going "square both sides because we don't care about signs"... we can't solve a sixth degree polynomial. So, we're going to have to + actually evaluate that expression. We can "simplify" this by translating all our coordinates so that the center of the circle is (0,0) and + all our coordinates are shifted accordingly, which makes the cx and cy terms fall away, but then we're still left + with a monstrous function to solve. +

    +

    + So instead, we turn to the same kind of "LUT walking" that we saw for projecting points onto a curve, with a twist: instead of finding the + on-curve point with the smallest distance to our projection point, we want to find the on-curve point that has the exact distance + r to our projection point (namely, our circle center). Of course, there can be more than one such point, so there's also a + bit more code to make sure we find all of them, but let's look at the steps involved: +

    - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    - -

    This is very similar to the code in the previous section, with an extra input r for the circle radius, and a minor change in the "distance for this coordinate": rather than just distance(coordinate, p) we want to know the difference between that distance and the circle radius. After all, if that difference is zero, then the distance from the coordinate to the circle center is exactly the radius, so the coordinate lies on both the curve and the circle.

    -

    So far so good.

    -

    However, we also want to make sure we find all the points, not just a single one, so we need a little more code for that:

    + i = index + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    - -

    After running this code, values will be the list of all LUT coordinates that are closest to the distance r: we can use those values to run the same kind of refinement lookup we used for point projection (with the caveat that we're now not checking for smallest distance, but for "distance closest to r"), and we'll have all our intersection points. Of course, that does require explaining what findClosest does: rather than looking for a global minimum, we're now interested in finding a local minimum, so instead of checking a single point and looking at its distance value, we check three points ("current", "previous" and "before previous") and then check whether they form a local minimum:

    + start = i + 2; + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + - - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    - -

    In words: given a start index, the circle center and radius, and our LUT, we check where (closest to our start index) we can find a local minimum for the difference between "the distance from the curve to the circle center", and the circle's radius. We track this by looking at three values (associated with the indices index-2, index-1, and index), and we know we've found a local minimum if the three values show that the middle value (pd1) is less than either value beside it. When we do, we can set our "best guess, relative to start" as index-1. Of course, since we're now checking values relative to some start value, we might not find another candidate value at all, in which case we return start - 1, so that a simple "is the result less than start?" lets us determine that there are no more intersections to find.

    -

    Finally, while not necessary for point projection, there is one more step we need to perform when we run the binary refinement function on our candidate LUT indices, because we've so far only been testing using distances "closest to the radius of the circle", and that's actually not good enough... we need distances that are the radius of the circle. So, after running the refinement for each of these indices, we need to discard any final value that isn't the circle radius. And because we're working with floating point numbers, what this really means is that we need to discard any value that's a pixel or more "off". Or, if we want to get really fancy, "some small epsilon value".

    -

    Based on all of that, the following graphic shows this off for the standard cubic curve (which you can move the coordinates around for, of course) and a circle with a controllable radius centered on the graphic's center, using the code approach described above.

    - - - 脚本被禁用。显示备用图片。 - - - - - -

    And of course, for the full details, click that "view source" link.

    + return start + i + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + 18 + + + 19 + + -
    -
    -

    Molding a curve

    -

    Armed with knowledge of the "ABC" relation, point-on-curve projection, and guestimating reasonable looking helper values for cubic curve construction, we can finally cover curve molding: updating a curve's shape interactively, by dragging points on the curve around.

    -

    For quadratic curve, this is a really simple trick: we project our cursor onto the curve, which gives us a t value and initial B coordinate. We don't even need the latter: with our t value and "wherever the cursor is" as target B, we can compute the associated C:

    - - -

    And then the associated A:

    - - -

    And we're done, because that's our new quadratic control point!

    - - - 脚本被禁用。显示备用图片。 - - - -

    As before, cubic curves are a bit more work, because while it's easy to find our initial t value and ABC values, getting those all-important e1 and e2 coordinates is going to pose a bit of a problem... in the section on curve creation, we were free to pick an appropriate t value ourselves, which allowed us to find appropriate e1 and e2 coordinates. That's great, but when we're curve molding we don't have that luxury: whatever point we decide to start moving around already has its own t value, and its own e1 and e2 values, and those may not make sense for the rest of the curve.

    -

    For example, let's see what happens if we just "go with what we get" when we pick a point and start moving it around, preserving its t value and e1/e2 coordinates:

    - - - 脚本被禁用。显示备用图片。 - - - -

    That looks reasonable, close to the original point, but the further we drag our point, the less "useful" things become. Especially if we drag our point across the baseline, rather than turning into a nice curve.

    -

    One way to combat this might be to combine the above approach with the approach from the creating curves section: generate both the "unchanged t/e1/e2" curve, as well as the "idealized" curve through the start/cursor/end points, with idealized t value, and then interpolating between those two curves:

    - - - 脚本被禁用。显示备用图片。 - - - - - -

    The slide controls the "falloff distance" relative to where the original point on the curve is, so that as we drag our point around, it interpolates with a bias towards "preserving t/e1/e2" closer to the original point, and bias towards "idealized" form the further away we move our point, with anything that's further than our falloff distance simply being the idealized curve. We don't even try to interpolate at that point.

    -

    A more advanced way to try to smooth things out is to implement continuous molding, where we constantly update the curve as we move around, and constantly change what our B point is, based on constantly projecting the cursor on the curve as we're updating it - this is, you won't be surprised to learn, tricky, and beyond the scope of this section: interpolation (with a reasonable distance) will do for now!

    - -
    -
    -

    Curve fitting

    -

    Given the previous section, one question you might have is "what if I don't want to guess t values?". After all, plenty of graphics packages do automated curve fitting, so how can we implement that in a way that just finds us reasonable t values all on its own?

    -

    And really this is just a variation on the question "how do I get the curve through these X points?", so let's look at that. Specifically, let's look at the answer: "curve fitting". This is in fact a rather rich field in geometry, applying to anything from data modelling to path abstraction to "drawing", so there's a fair number of ways to do curve fitting, but we'll look at one of the most common approaches: something called a least squares polynomial regression. In this approach, we look at the number of points we have in our data set, roughly determine what would be an appropriate order for a curve that would fit these points, and then tackle the question "given that we want an nth order curve, what are the coordinates we can find such that our curve is "off" by the least amount?".

    -

    Now, there are many ways to determine how "off" points are from the curve, which is where that "least squares" term comes in. The most common tool in the toolbox is to minimise the squared distance between each point we have, and the corresponding point on the curve we end up "inventing". A curve with a snug fit will have zero distance between those two, and a bad fit will have non-zero distances between every such pair. It's a workable metric. You might wonder why we'd need to square, rather than just ensure that distance is a positive value (so that the total error is easy to compute by just summing distances) and the answer really is "because it tends to be a little better". There's lots of literature on the web if you want to deep-dive the specific merits of least squared error metrics versus least absolute error metrics, but those are well beyond the scope of this material.

    -

    So let's look at what we end up with in terms of curve fitting if we start with the idea of performing least squares Bézier fitting. We're going to follow a procedure similar to the one described by Jim Herold over on his "Least Squares Bézier Fit" article, and end with some nice interactive graphics for doing some curve fitting.

    -

    Before we begin, we're going to use the curve in matrix form. In the section on matrices, I mentioned that some things are easier if we use the matrix representation of a Bézier curve rather than its calculus form, and this is one of those things.

    -

    As such, the first step in the process is expressing our Bézier curve as powers/coefficients/coordinate matrix T x M x C, by expanding the Bézier functions.

    -
    - -

    Revisiting the matrix representation

    -

    Rewriting Bézier functions to matrix form is fairly easy, if you first expand the function, and then arrange them into a multiple line form, where each line corresponds to a power of t, and each column is for a specific coefficient. First, we expand the function:

    - - -

    And then we (trivially) rearrange the terms across multiple lines:

    - - -

    This rearrangement has "factors of t" at each row (the first row is t⁰, i.e. "1", the second row is t¹, i.e. "t", the third row is t²) and "coefficient" at each column (the first column is all terms involving "a", the second all terms involving "b", the third all terms involving "c").

    -

    With that arrangement, we can easily decompose this as a matrix multiplication:

    - - -

    We can do the same for the cubic curve, of course. We know the base function for cubics:

    - - -

    So we write out the expansion and rearrange:

    - - -

    Which we can then decompose:

    - - -

    And, of course, we can do this for quartic curves too (skipping the expansion step):

    - - -

    And so and on so on. Now, let's see how to use these T, M, and C, to do some curve fitting.

    -
    + +

    + And so and on so on. Now, let's see how to use these T, M, and C, to do some curve + fitting. +

    + -

    Let's get started: we're going to assume we picked the right order curve: for n points we're fitting an n-1th order curve, so we "start" with a vector P that represents the coordinates we already know, and for which we want to do curve fitting:

    - - -

    Next, we need to figure out appropriate t values for each point in the curve, because we need something that lets us tie "the actual coordinate" to "some point on the curve". There's a fair number of different ways to do this (and a large part of optimizing "the perfect fit" is about picking appropriate t values), but in this case let's look at two "obvious" choices:

    -
      -
    1. equally spaced t values, and
    2. -
    3. t values that align with distance along the polygon.
    4. -
    -

    The first one is really simple: if we have n points, then we'll just assign each point i a t value of (i-1)/(n-1). So if we have four points, the first point will have t=(1-1)/(4-1)=0/3, the second point will have t=(2-1)/(4-1)=1/3, the third point will have t=2/3, and the last point will be t=1. We're just straight up spacing the t values to match the number of points we have.

    -

    The second one is a little more interesting: since we're doing polynomial regression, we might as well exploit the fact that our base coordinates just constitute a collection of line segments. At the first point, we're fixing t=0, and the last point, we want t=1, and anywhere in between we're simply going to say that t is equal to the distance along the polygon, scaled to the [0,1] domain.

    -

    To get these values, we first compute the general "distance along the polygon" matrix:

    - - -

    Where length() is literally just that: the length of the line segment between the point we're looking at, and the previous point. This isn't quite enough, of course: we still need to make sure that all the values between i=1 and i=n fall in the [0,1] interval, so we need to scale all values down by whatever the total length of the polygon is:

    - - -

    And now we can move on to the actual "curve fitting" part: what we want is a function that lets us compute "ideal" control point values such that if we build a Bézier curve with them, that curve passes through all our original points. Or, failing that, have an overall error distance that is as close to zero as we can get it. So, let's write out what the error distance looks like.

    -

    As mentioned before, this function is really just "the distance between the actual coordinate, and the coordinate that the curve evaluates to for the associated t value", which we'll square to get rid of any pesky negative signs:

    - - -

    Since this function only deals with individual coordinates, we'll need to sum over all coordinates in order to get the full error function. So, we literally just do that; the total error function is simply the sum of all these individual errors:

    - - -

    And here's the trick that justifies using matrices: while we can work with individual values using calculus, with matrices we can compute as many values as we make our matrices big, all at the "same time", We can replace the individual terms pi with the full P coordinate matrix, and we can replace Bézier(si) with the matrix representation T x M x C we talked about before, which gives us:

    - - -

    In which we can replace the rather cumbersome "squaring" operation with a more conventional matrix equivalent:

    - - -

    Here, the letter T is used instead of the number 2, to represent the matrix transpose; each row in the original matrix becomes a column in the transposed matrix instead (row one becomes column one, row two becomes column two, and so on).

    -

    This leaves one problem: T isn't actually the matrix we want: we don't want symbolic t values, we want the actual numerical values that we computed for S, so we need to form a new matrix, which we'll call 𝕋, that makes use of those, and then use that 𝕋 instead of T in our error function:

    - - -

    Which, because of the first and last values in S, means:

    - - -

    Now we can properly write out the error function as matrix operations:

    - - -

    So, we have our error function: we now need to figure out the expression for where that function has minimal value, e.g. where the error between the true coordinates and the coordinates generated by the curve fitting is smallest. Like in standard calculus, this requires taking the derivative, and determining where that derivative is zero:

    - - -
    + +
    +

    Where did this derivative come from?

    +

    + That... is a good question. In fact, when trying to run through this approach, I ran into the same question! And you know what? I + straight up had no idea. I'm decent enough at calculus, I'm decent enough at linear algebra, and I just don't know. +

    +

    + So I did what I always do when I don't understand something: I asked someone to help me understand how things work. In this specific + case, I + posted a question + to Math.stackexchange, and received a answer that goes into way more detail than I had + hoped to receive. +

    +

    + Is that answer useful to you? Probably: no. At least, not unless you like understanding maths on a recreational level. And I do mean + maths in general, not just basic algebra. But it does help in giving us a reference in case you ever wonder "Hang on. Why was that + true?". There are answers. They might just require some time to come to understand. +

    +
    -

    Where did this derivative come from?

    -

    That... is a good question. In fact, when trying to run through this approach, I ran into the same question! And you know what? I straight up had no idea. I'm decent enough at calculus, I'm decent enough at linear algebra, and I just don't know.

    -

    So I did what I always do when I don't understand something: I asked someone to help me understand how things work. In this specific case, I posted a question to Math.stackexchange, and received a answer that goes into way more detail than I had hoped to receive.

    -

    Is that answer useful to you? Probably: no. At least, not unless you like understanding maths on a recreational level. And I do mean maths in general, not just basic algebra. But it does help in giving us a reference in case you ever wonder "Hang on. Why was that true?". There are answers. They might just require some time to come to understand.

    -
    - -

    Now, given the above derivative, we can rearrange the terms (following the rules of matrix algebra) so that we end up with an expression for C:

    - - -

    Here, the "to the power negative one" is the notation for the matrix inverse. But that's all we have to do: we're done. Starting with P and inventing some t values based on the polygon the coordinates in P define, we can compute the corresponding Bézier coordinates C that specify a curve that goes through our points. Or, if it can't go through them exactly, as near as possible.

    -

    So before we try that out, how much code is involved in implementing this? Honestly, that answer depends on how much you're going to be writing yourself. If you already have a matrix maths library available, then really not that much code at all. On the other hand, if you are writing this from scratch, you're going to have to write some utility functions for doing your matrix work for you, so it's really anywhere from 50 lines of code to maybe 200 lines of code. Not a bad price to pay for being able to fit curves to pre-specified coordinates.

    -

    So let's try it out! The following graphic lets you place points, and will start computing exact-fit curves once you've placed at least three. You can click for more points, and the code will simply try to compute an exact fit using a Bézier curve of the appropriate order. Four points? Cubic Bézier. Five points? Quartic. And so on. Of course, this does break down at some point: depending on where you place your points, it might become mighty hard for the fitter to find an exact fit, and things might actually start looking horribly off once there's enough points for compound floating point rounding errors to start making a difference (which is around 10~11 points).

    - - - 脚本被禁用。显示备用图片。 - - - - - - -

    You'll note there is a convenient "toggle" buttons that lets you toggle between equidistant t values, and distance ratio along the polygon formed by the points. Arguably more interesting is that once you have points to abstract a curve, you also get direct control over the time values through sliders for each, because if the time values are our degree of freedom, you should be able to freely manipulate them and see what the effect on your curve is.

    - -
    -
    -

    Bézier curves and Catmull-Rom curves

    -

    Taking an excursion to different splines, the other common design curve is the Catmull-Rom spline, which unlike Bézier curves pass through each control point, so they offer a kind of "built-in" curve fitting.

    -

    In fact, let's start with just playing with one: the following graphic has a predefined curve that you manipulate the points for, and lets you add points by clicking/tapping the background, as well as let you control "how fast" the curve passes through its point using the tension slider. The tenser the curve, the more the curve tends towards straight lines from one point to the next.

    - - - 脚本被禁用。显示备用图片。 - - - - - -

    Now, it may look like Catmull-Rom curves are very different from Bézier curves, because these curves can get very long indeed, but what looks like a single Catmull-Rom curve is actually a spline: a single curve built up of lots of identically-computed pieces, similar to if you just took a whole bunch of Bézier curves, placed them end to end, and lined up their control points so that things look like a single curve. For a Catmull-Rom curve, each "piece" between two points is defined by the point's coordinates, and the tangent for those points, the latter of which can trivially be derived from knowing the previous and next point:

    - + +

    + You'll note there is a convenient "toggle" buttons that lets you toggle between equidistant t values, and distance ratio + along the polygon formed by the points. Arguably more interesting is that once you have points to abstract a curve, you also get + direct control over the time values through sliders for each, because if the time values are our degree of freedom, you should be + able to freely manipulate them and see what the effect on your curve is. +

    +
    +
    +

    + + Bézier curves and Catmull-Rom curves +

    +

    + Taking an excursion to different splines, the other common design curve is the + Catmull-Rom spline, which unlike Bézier curves + pass through each control point, so they offer a kind of "built-in" curve fitting. +

    +

    + In fact, let's start with just playing with one: the following graphic has a predefined curve that you manipulate the points for, and lets + you add points by clicking/tapping the background, as well as let you control "how fast" the curve passes through its point using the + tension slider. The tenser the curve, the more the curve tends towards straight lines from one point to the next. +

    + + + 脚本被禁用。显示备用图片。 + + + + + +

    + Now, it may look like Catmull-Rom curves are very different from Bézier curves, because these curves can get very long indeed, but what + looks like a single Catmull-Rom curve is actually a spline: a single + curve built up of lots of identically-computed pieces, similar to if you just took a whole bunch of Bézier curves, placed them end to end, + and lined up their control points so that things look like a single curve. For a Catmull-Rom curve, each "piece" between two points is + defined by the point's coordinates, and the tangent for those points, the latter of which + can trivially be derived from knowing the + previous and next point: +

    + - -

    One downside of this is that—as you may have noticed from the graphic—the first and last point of the overall curve don't actually join up with the rest of the curve: they don't have a previous/next point respectively, and so there is no way to calculate what their tangent should be. Which also makes it rather tricky to fit a Catmull-Rom curve to three points like we were able to do for Bézier curves. More on that in the next section.

    -

    In fact, before we move on, let's look at how to actually draw the basic form of these curves (I say basic, because there are a number of variations that make things considerable more complex):

    + +

    + One downside of this is that—as you may have noticed from the graphic—the first and last point of the overall curve don't actually join up + with the rest of the curve: they don't have a previous/next point respectively, and so there is no way to calculate what their tangent + should be. Which also makes it rather tricky to fit a Catmull-Rom curve to three points like we were able to do for Bézier curves. More on + that in the next section. +

    +

    + In fact, before we move on, let's look at how to actually draw the basic form of these curves (I say basic, because there are a number of + variations that make things + considerable more + complex): +

    - - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    - -

    Now, since a Catmull-Rom curve is a form of cubic Hermite spline, and as cubic Bézier curves are also a form of cubic Hermite spline, we run into an interesting bit of maths programming: we can convert one to the other and back, and the maths for doing so is surprisingly simple!

    -

    The main difference between Catmull-Rom curves and Bézier curves is "what the points mean":

    -
      -
    • A cubic Bézier curve is defined by a start point, a control point that implies the tangent at the start, a control point that implies the tangent at the end, and an end point, plus a characterizing matrix that we can multiply by that point vector to get on-curve coordinates.
    • -
    • A Catmull-Rom curve is defined by a start point, a tangent that for that starting point, an end point, and a tangent for that end point, plus a characteristic matrix that we can multiple by the point vector to get on-curve coordinates.
    • -
    -

    Those are very similar, so let's see exactly how similar they are. We've already see the matrix form for Bézier curves, so how different is the matrix form for Catmull-Rom curves?:

    - - -

    That's pretty dang similar. So the question is: how can we convert that expression with Catmull-Rom matrix and vector into an expression of the Bézier matrix and vector? The short answer is of course "by using linear algebra", but the longer answer is the rest of this section, and involves some maths that you may not even care for: if you just want to know the (incredibly simple) conversions between the two curve forms, feel free to skip to the end of the following explanation, but if you want to how we can get one from the other... let's get mathing!

    -
    - -

    Deriving the conversion formulae

    -

    In order to convert between Catmull-Rom curves and Bézier curves, we need to know two things. Firstly, how to express the Catmull-Rom curve using a "set of four coordinates", rather than a mix of coordinates and tangents, and secondly, how to convert those Catmull-Rom coordinates to and from Bézier form.

    -

    We start with the first part, to figure out how we can go from Catmull-Rom V coordinates to Bézier P coordinates, by applying "some matrix T". We don't know what that T is yet, but we'll get to that:

    - - -

    So, this mapping says that in order to map a Catmull-Rom "point + tangent" vector to something based on an "all coordinates" vector, we need to determine the mapping matrix such that applying T yields P2 as start point, P3 as end point, and two tangents based on the lines between P1 and P3, and P2 nd P4, respectively.

    -

    Computing T is really more "arranging the numbers":

    - - -

    Thus:

    - - -

    However, we're not quite done, because Catmull-Rom curves have that "tension" parameter, written as τ (a lowercase"tau"), which is a scaling factor for the tangent vectors: the bigger the tension, the smaller the tangents, and the smaller the tension, the bigger the tangents. As such, the tension factor goes in the denominator for the tangents, and before we continue, let's add that tension factor into both our coordinate vector representation, and mapping matrix T:

    - - -

    With the mapping matrix properly done, let's rewrite the "point + tangent" Catmull-Rom matrix form to a matrix form in terms of four coordinates, and see what we end up with:

    - - -

    Replace point/tangent vector with the expression for all-coordinates:

    - - -

    and merge the matrices:

    - - -

    This looks a lot like the Bézier matrix form, which as we saw in the chapter on Bézier curves, should look like this:

    - - -

    So, if we want to express a Catmull-Rom curve using a Bézier curve, we'll need to turn this Catmull-Rom bit:

    - - -

    Into something that looks like this:

    - - -

    And the way we do that is with a fairly straight forward bit of matrix rewriting. We start with the equality we need to ensure:

    - - -

    Then we remove the coordinate vector from both sides without affecting the equality:

    - - -

    Then we can "get rid of" the Bézier matrix on the right by left-multiply both with the inverse of the Bézier matrix:

    - - -

    A matrix times its inverse is the matrix equivalent of 1, and because "something times 1" is the same as "something", so we can just outright remove any matrix/inverse pair:

    - - -

    And now we're basically done. We just multiply those two matrices and we know what V is:

    - - -

    We now have the final piece of our function puzzle. Let's run through each step.

    -
      -
    1. Start with the Catmull-Rom function:
    2. -
    - - -
      -
    1. rewrite to pure coordinate form:
    2. -
    - - -
      -
    1. rewrite for "normal" coordinate vector:
    2. -
    - - -
      -
    1. merge the inner matrices:
    2. -
    - - -
      -
    1. rewrite for Bézier matrix form:
    2. -
    - - -
      -
    1. and transform the coordinates so we have a "pure" Bézier expression:
    2. -
    - - -

    And we're done: we finally know how to convert these two curves!

    -
    + +

    And we're done: we finally know how to convert these two curves!

    + -

    If we have a Catmull-Rom curve defined by four coordinates P1 through P4, then we can draw that curve using a Bézier curve that has the vector:

    - - -

    Similarly, if we have a Bézier curve defined by four coordinates P1 through P4, we can draw that using a standard tension Catmull-Rom curve with the following coordinate values:

    - - -

    Or, if your API allows you to specify Catmull-Rom curves using plain coordinates:

    - - - -
    -
    -

    Creating a Catmull-Rom curve from three points

    -

    Much shorter than the previous section: we saw that Catmull-Rom curves need at least 4 points to draw anything sensible, so how do we create a Catmull-Rom curve from three points?

    -

    Short and sweet: we don't.

    -

    We run through the maths that lets us create a cubic Bézier curve, and then convert its coordinates to Catmull-Rom form using the conversion formulae we saw above.

    - -
    -
    -

    Forming poly-Bézier curves

    -

    Much like lines can be chained together to form polygons, Bézier curves can be chained together to form poly-Béziers, and the only trick required is to make sure that:

    -
      -
    1. the end point of each section is the starting point of the following section, and
    2. -
    3. the derivatives across that dual point line up.
    4. -
    -

    Unless you want sharp corners, of course. Then you don't even need 2.

    -

    We'll cover three forms of poly-Bézier curves in this section. First, we'll look at the kind that just follows point 1. where the end point of a segment is the same point as the start point of the next segment. This leads to poly-Béziers that are pretty hard to work with, but they're the easiest to implement:

    - - - 脚本被禁用。显示备用图片。 - - - - - - 脚本被禁用。显示备用图片。 - - - -

    Dragging the control points around only affects the curve segments that the control point belongs to, and moving an on-curve point leaves the control points where they are, which is not the most useful for practical modelling purposes. So, let's add in the logic we need to make things a little better. We'll start by linking up control points by ensuring that the "incoming" derivative at an on-curve point is the same as it's "outgoing" derivative:

    - - -

    We can effect this quite easily, because we know that the vector from a curve's last control point to its last on-curve point is equal to the derivative vector. If we want to ensure that the first control point of the next curve matches that, all we have to do is mirror that last control point through the last on-curve point. And mirroring any point A through any point B is really simple:

    - - -

    So let's implement that and see what it gets us. The following two graphics show a quadratic and a cubic poly-Bézier curve again, but this time moving the control points around moves others, too. However, you might see something unexpected going on for quadratic curves...

    - - - 脚本被禁用。显示备用图片。 - - - - - - 脚本被禁用。显示备用图片。 - - - -

    As you can see, quadratic curves are particularly ill-suited for poly-Bézier curves, as all the control points are effectively linked. Move one of them, and you move all of them. Not only that, but if we move the on-curve points, it's possible to get a situation where a control point cannot satisfy the constraint that it's the reflection of its two neighbouring control points... This means that we cannot use quadratic poly-Béziers for anything other than really, really simple shapes. And even then, they're probably the wrong choice. Cubic curves are pretty decent, but the fact that the derivatives are linked means we can't manipulate curves as well as we might if we relaxed the constraints a little.

    -

    So: let's relax the requirement a little.

    -

    We can change the constraint so that we still preserve the angle of the derivatives across sections (so transitions from one section to the next will still look natural), but give up the requirement that they should also have the same vector length. Doing so will give us a much more useful kind of poly-Bézier curve:

    - - - 脚本被禁用。显示备用图片。 - - - - - - 脚本被禁用。显示备用图片。 - - - -

    Cubic curves are now better behaved when it comes to dragging control points around, but the quadratic poly-Bézier still has the problem that moving one control points will move the control points and may ending up defining "the next" control point in a way that doesn't work. Quadratic curves really aren't very useful to work with...

    -

    Finally, we also want to make sure that moving the on-curve coordinates preserves the relative positions of the associated control points. With that, we get to the kind of curve control that you might be familiar with from applications like Photoshop, Inkscape, Blender, etc.

    - - - 脚本被禁用。显示备用图片。 - - - - - - 脚本被禁用。显示备用图片。 - - - -

    Again, we see that cubic curves are now rather nice to work with, but quadratic curves have a new, very serious problem: we can move an on-curve point in such a way that we can't compute what needs to "happen next". Move the top point down, below the left and right points, for instance. There is no way to preserve correct control points without a kink at the bottom point. Quadratic curves: just not that good...

    -

    A final improvement is to offer fine-level control over which points behave which, so that you can have "kinks" or individually controlled segments when you need them, with nicely well-behaved curves for the rest of the path. Implementing that, is left as an exercise for the reader.

    - -
    -
    -

    Curve offsetting

    -

    Perhaps you're like me, and you've been writing various small programs that use Bézier curves in some way or another, and at some point you make the step to implementing path extrusion. But you don't want to do it pixel based; you want to stay in the vector world. You find that extruding lines is relatively easy, and tracing outlines is coming along nicely (although junction caps and fillets are a bit of a hassle), and then you decide to do things properly and add Bézier curves to the mix. Now you have a problem.

    -

    Unlike lines, you can't simply extrude a Bézier curve by taking a copy and moving it around, because of the curvatures; rather than a uniform thickness, you get an extrusion that looks too thin in places, if you're lucky, but more likely will self-intersect. The trick, then, is to scale the curve, rather than simply copying it. But how do you scale a Bézier curve?

    -

    Bottom line: you can't. So you cheat. We're not going to do true curve scaling, or rather curve offsetting, because that's impossible. Instead we're going to try to generate 'looks good enough' offset curves.

    -
    - -

    "What do you mean, you can't? Prove it."

    -

    First off, when I say "you can't," what I really mean is "you can't offset a Bézier curve with another Bézier curve", not even by using a really high order curve. You can find the function that describes the offset curve, but it won't be a polynomial, and as such it cannot be represented as a Bézier curve, which has to be a polynomial. Let's look at why this is:

    -

    From a mathematical point of view, an offset curve O(t) is a curve such that, given our original curve B(t), any point on O(t) is a fixed distance d away from coordinate B(t). So let's math that:

    - - -

    However, we're working in 2D, and d is a single value, so we want to turn it into a vector. If we want a point distance d "away" from the curve B(t) then what we really mean is that we want a point at d times the "normal vector" from point B(t), where the "normal" is a vector that runs perpendicular ("at a right angle") to the tangent at B(t). Easy enough:

    - - -

    Now this still isn't very useful unless we know what the formula for N(t) is, so let's find out. N(t) runs perpendicular to the original curve tangent, and we know that the tangent is simply B'(t), so we could just rotate that 90 degrees and be done with it. However, we need to ensure that N(t) has the same magnitude for every t, or the offset curve won't be at a uniform distance, thus not being an offset curve at all. The easiest way to guarantee this is to make sure N(t) always has length 1, which we can achieve by dividing B'(t) by its magnitude:

    - - -

    Determining the length requires computing an arc length, and this is where things get Tricky with a capital T. First off, to compute arc length from some start a to end b, we must use the formula we saw earlier. Noting that "length" is usually denoted with double vertical bars:

    - - -

    So if we want the length of the tangent, we plug in B'(t), with t = 0 as start and -t = 1 as end:

    - - -

    And that's where things go wrong. It doesn't even really matter what the second derivative for B(t) is, that square root is screwing everything up, because it turns our nice polynomials into things that are no longer polynomials.

    -

    There is a small class of polynomials where the square root is also a polynomial, but they're utterly useless to us: any polynomial with unweighted binomial coefficients has a square root that is also a polynomial. Now, you might think that Bézier curves are just fine because they do, but they don't; remember that only the base function has binomial coefficients. That's before we factor in our coordinates, which turn it into a non-binomial polygon. The only way to make sure the functions stay binomial is to make all our coordinates have the same value. And that's not a curve, that's a point. We can already create offset curves for points, we call them circles, and they have much simpler functions than Bézier curves.

    -

    So, since the tangent length isn't a polynomial, the normalised tangent won't be a polynomial either, which means N(t) won't be a polynomial, which means that d times N(t) won't be a polynomial, which means that, ultimately, O(t) won't be a polynomial, which means that even if we can determine the function for O(t) just fine (and that's far from trivial!), it simply cannot be represented as a Bézier curve.

    -

    And that's one reason why Bézier curves are tricky: there are actually a lot of curves that cannot be represented as a Bézier curve at all. They can't even model their own offset curves. They're weird that way. So how do all those other programs do it? Well, much like we're about to do, they cheat. We're going to approximate an offset curve in a way that will look relatively close to what the real offset curve would look like, if we could compute it.

    -
    + +

    + And that's where things go wrong. It doesn't even really matter what the second derivative for B(t) is, that square root is + screwing everything up, because it turns our nice polynomials into things that are no longer polynomials. +

    +

    + There is a small class of polynomials where the square root is also a polynomial, but they're utterly useless to us: any polynomial with + unweighted binomial coefficients has a square root that is also a polynomial. Now, you might think that Bézier curves are just fine + because they do, but they don't; remember that only the base function has binomial coefficients. That's before we + factor in our coordinates, which turn it into a non-binomial polygon. The only way to make sure the functions stay binomial is to make + all our coordinates have the same value. And that's not a curve, that's a point. We can already create offset curves for points, we call + them circles, and they have much simpler functions than Bézier curves. +

    +

    + So, since the tangent length isn't a polynomial, the normalised tangent won't be a polynomial either, which means + N(t) won't be a polynomial, which means that d times N(t) won't be a polynomial, which means + that, ultimately, O(t) won't be a polynomial, which means that even if we can determine the function for + O(t) just fine (and that's far from trivial!), it simply cannot be represented as a Bézier curve. +

    +

    + And that's one reason why Bézier curves are tricky: there are actually a lot of curves that cannot be represented as a Bézier + curve at all. They can't even model their own offset curves. They're weird that way. So how do all those other programs do it? Well, + much like we're about to do, they cheat. We're going to approximate an offset curve in a way that will look relatively close to what the + real offset curve would look like, if we could compute it. +

    + -

    So, you cannot offset a Bézier curve perfectly with another Bézier curve, no matter how high-order you make that other Bézier curve. However, we can chop up a curve into "safe" sub-curves (where "safe" means that all the control points are always on a single side of the baseline, and the midpoint of the curve at t=0.5 is roughly in the center of the polygon defined by the curve coordinates) and then point-scale each sub-curve with respect to its scaling origin (which is the intersection of the point normals at the start and end points).

    -

    A good way to do this reduction is to first find the curve's extreme points, as explained in the earlier section on curve extremities, and use these as initial splitting points. After this initial split, we can check each individual segment to see if it's "safe enough" based on where the center of the curve is. If the on-curve point for t=0.5 is too far off from the center, we simply split the segment down the middle. Generally this is more than enough to end up with safe segments.

    -

    The following graphics show off curve offsetting, and you can use the slider to control the distance at which the curve gets offset. The curve first gets reduced to safe segments, each of which is then offset at the desired distance. Especially for simple curves, particularly easily set up for quadratic curves, no reduction is necessary, but the more twisty the curve gets, the more the curve needs to be reduced in order to get segments that can safely be scaled.

    - - - 脚本被禁用。显示备用图片。 - - - - - - - - 脚本被禁用。显示备用图片。 - - - - - -

    You may notice that this may still lead to small 'jumps' in the sub-curves when moving the curve around. This is caused by the fact that we're still performing a naive form of offsetting, moving the control points the same distance as the start and end points. If the curve is large enough, this may still lead to incorrect offsets.

    - -
    -
    -

    Graduated curve offsetting

    -

    What if we want to do graduated offsetting, starting at some distance s but ending at some other distance e? Well, if we can compute the length of a curve (which we can if we use the Legendre-Gauss quadrature approach) then we can also determine how far "along the line" any point on the curve is. With that knowledge, we can offset a curve so that its offset curve is not uniformly wide, but graduated between with two different offset widths at the start and end.

    -

    Like normal offsetting we cut up our curve in sub-curves, and then check at which distance along the original curve each sub-curve starts and ends, as well as to which point on the curve each of the control points map. This gives us the distance-along-the-curve for each interesting point in the sub-curve. If we call the total length of all sub-curves seen prior to seeing "the current" sub-curve S (and if the current sub-curve is the first one, S is zero), and we call the full length of our original curve L, then we get the following graduation values:

    -
      -
    • start: map S from interval (0,L) to interval (s,e)
    • -
    • c1: map(<strong>S+d1</strong>, 0,L, s,e), d1 = distance along curve to projection of c1
    • -
    • c2: map(<strong>S+d2</strong>, 0,L, s,e), d2 = distance along curve to projection of c2
    • -
    • ...
    • -
    • end: map(<strong>S+length(subcurve)</strong>, 0,L, s,e)
    • -
    -

    At each of the relevant points (start, end, and the projections of the control points onto the curve) we know the curve's normal, so offsetting is simply a matter of taking our original point, and moving it along the normal vector by the offset distance for each point. Doing so will give us the following result (these have with a starting width of 0, and an end width of 40 pixels, but can be controlled with your up and down arrow keys):

    - - - 脚本被禁用。显示备用图片。 - - - - - - - - 脚本被禁用。显示备用图片。 - - - - - - -
    -
    -

    Circles and quadratic Bézier curves

    -

    Circles and Bézier curves are very different beasts, and circles are infinitely easier to work with than Bézier curves. Their formula is much simpler, and they can be drawn more efficiently. But, sometimes you don't have the luxury of using circles, or ellipses, or arcs. Sometimes, all you have are Bézier curves. For instance, if you're doing font design, fonts have no concept of geometric shapes, they only know straight lines, and Bézier curves. OpenType fonts with TrueType outlines only know quadratic Bézier curves, and OpenType fonts with Type 2 outlines only know cubic Bézier curves. So how do you draw a circle, or an ellipse, or an arc?

    -

    You approximate.

    -

    We already know that Bézier curves cannot model all curves that we can think of, and this includes perfect circles, as well as ellipses, and their arc counterparts. However, we can certainly approximate them to a degree that is visually acceptable. Quadratic and cubic curves offer us different curvature control, so in order to approximate a circle we will first need to figure out what the error is if we try to approximate arcs of increasing degree with quadratic and cubic curves, and where the coordinates even lie.

    -

    Since arcs are mid-point-symmetrical, we need the control points to set up a symmetrical curve. For quadratic curves this means that the control point will be somewhere on a line that intersects the baseline at a right angle. And we don't get any choice on where that will be, since the derivatives at the start and end point have to line up, so our control point will lie at the intersection of the tangents at the start and end point.

    -

    First, let's try to fit the quadratic curve onto a circular arc. In the following sketch you can move the mouse around over a unit circle, to see how well, or poorly, a quadratic curve can approximate the arc from (1,0) to where your mouse cursor is:

    - - - 脚本被禁用。显示备用图片。 - - - - - -

    As you can see, things go horribly wrong quite quickly; even trying to approximate a quarter circle using a quadratic curve is a bad idea. An eighth of a turns might look okay, but how okay is okay? Let's apply some maths and find out. What we're interested in is how far off our on-curve coordinates are with respect to a circular arc, given a specific start and end angle. We'll be looking at how much space there is between the circular arc, and the quadratic curve's midpoint.

    -

    We start out with our start and end point, and for convenience we will place them on a unit circle (a circle around 0,0 with radius 1), at some angle φ:

    - - -

    What we want to find is the intersection of the tangents, so we want a point C such that:

    - - -

    i.e. we want a point that lies on the vertical line through S (at some distance a from S) and also lies on the tangent line through E (at some distance b from E). Solving this gives us:

    - - -

    First we solve for b:

    - - -

    which yields:

    - - -

    which we can then substitute in the expression for a:

    - - -

    A quick check shows that plugging these values for a and b into the expressions for Cx and Cy give the same x/y coordinates for both "a away from A" and "b away from B", so let's continue: now that we know the coordinate values for C, we know where our on-curve point T for t=0.5 (or angle φ/2) is, because we can just evaluate the Bézier polynomial, and we know where the circle arc's actual point P is for angle φ/2:

    - - -

    We compute T, observing that if t=0.5, the polynomial values (1-t)², 2(1-t)t, and t² are 0.25, 0.5, and 0.25 respectively:

    - - -

    Which, worked out for the x and y components, gives:

    - - -

    And the distance between these two is the standard Euclidean distance:

    - - -

    So, what does this distance function look like when we plot it for a number of ranges for the angle φ, such as a half circle, quarter circle and eighth circle?

    -
    - - plotted for 0 ≤ φ ≤ π: - - - plotted for 0 ≤ φ ≤ ½π: - - - - - plotted for 0 ≤ φ ≤ ¼π: -
    + +

    + So, what does this distance function look like when we plot it for a number of ranges for the angle φ, such as a half circle, quarter + circle and eighth circle? +

    + + + + + + + + +
    + + plotted for 0 ≤ φ ≤ π: + + + plotted for 0 ≤ φ ≤ ½π: + + + + + plotted for 0 ≤ φ ≤ ¼π: +
    -

    We now see why the eighth circle arc looks decent, but the quarter circle arc doesn't: an error of roughly 0.06 at t=0.5 means we're 6% off the mark... we will already be off by one pixel on a circle with pixel radius 17. Any decent sized quarter circle arc, say with radius 100px, will be way off if approximated by a quadratic curve! For the eighth circle arc, however, the error is only roughly 0.003, or 0.3%, which explains why it looks so close to the actual eighth circle arc. In fact, if we want a truly tiny error, like 0.001, we'll have to contend with an angle of (rounded) 0.593667, which equates to roughly 34 degrees. We'd need 11 quadratic curves to form a full circle with that precision! (technically, 10 and ten seventeenth, but we can't do partial curves, so we have to round up). That's a whole lot of curves just to get a shape that can be drawn using a simple function!

    -

    In fact, let's flip the function around, so that if we plug in the precision error, labelled ε, we get back the maximum angle for that precision:

    - - -

    And frankly, things are starting to look a bit ridiculous at this point, we're doing way more maths than we've ever done, but thankfully this is as far as we need the maths to take us: If we plug in the precisions 0.1, 0.01, 0.001 and 0.0001 we get the radians values 1.748, 1.038, 0.594 and 0.3356; in degrees, that means we can cover roughly 100 degrees (requiring four curves), 59.5 degrees (requiring six curves), 34 degrees (requiring 11 curves), and 19.2 degrees (requiring a whopping nineteen curves).

    -

    The bottom line? Quadratic curves are kind of lousy if you want circular (or elliptical, which are circles that have been squashed in one dimension) curves. We can do better, even if it's just by raising the order of our curve once. So let's try the same thing for cubic curves.

    - -
    -
    -

    Circular arcs and cubic Béziers

    -

    Let's look at approximating circles and circular arcs using cubic Béziers. How much better is that?

    - - - 脚本被禁用。显示备用图片。 - - - - - -

    At cursory glance, a fair bit better, but let's find out how much better by looking at how to construct the Bézier curve.

    -

    A construction diagram for a cubic approximation of a circular arc

    -

    The start and end points are trivial, but the mid point requires a bit of work, but it's mostly basic trigonometry once we know the angle θ for our circular arc: if we scale our circular arc to a unit circle, we can always start our arc, with radius 1, at (1,0) and then given our arc angle θ, we also know that the circular arc has length θ (because unit circles are nice that way). We also know our end point, because that's just (cos(θ), sin(θ)), and so the challenge is to figure out what control points we need in order for the curve at t=0.5 to exactly touch the circular arc at the angle θ/2:

    -

    So let's again formally describe this:

    - - -

    Only P3 isn't quite straight-forward here, and its description is based on the fact that the triangle (origin, P4, P3) is a right angled triangle, with the distance between the origin and P4 being 1 (because we're working with a unit circle), and the distance between P4 and P3 being k, so that we can represent P3 as "The point P4 plus the vector from the origin to P4 but then rotated a quarter circle, counter-clockwise, and scaled by k".

    -

    With that, we can determine the y-coordinates for A, B, e1, and e2, after which we have all the information we need to determine what the value of k is. We can find these values by using (no surprise here) linear interpolation between known points, as A is midway between P2 and P3, e1 is between A and "midway between P1 and P2" (which is "half height" P2), and so forth:

    - - -

    Which now gives us two identities for B, because in addition to determining B through linear interpolation, we also know that B's y coordinate is just sin(θ/2): we started this exercise by saying we were going to approximate the circular arc using a Bézier curve that had its midpoint, which is point B, touching the unit circle at the arc's half-angle, by definition making B the point at (cos(θ/2), sin(θ/2)).

    -

    This means we can equate the two identities we now have for By and solve for k.

    -
    - -

    Deriving k

    -

    Solving for k is fairly straight forward, but it's a fair few steps, and if you just the immediate result: using a tool like Wolfram Alpha is definitely the way to go. That said, let's get going:

    - - -

    And finally, we can take further advantage of several trigonometric identities to drastically simplify our formula for k:

    - - -

    And we're done.

    -
    + +

    And we're done.

    + -

    So, the distance of our control points to the start/end points can be expressed as a number that we get from an almost trivial expression involving the circular arc's angle:

    - - -

    Which means that for any circular arc with angle θ and radius r, our Bézier approximation based on three points of incidence is:

    - - -

    Which also gives us the commonly found value of 0.55228 for quarter circles, based on them having an angle of half π:

    - - -

    And thus giving us the following Bézier coordinates for a quarter circle of radius r:

    - - -
    + +
    +

    So, how accurate is this?

    +

    + Unlike for the quadratic curve, we can't use t=0.5 as our reference point because by its very nature it's one of the three points + that are actually guaranteed to be on the circular arc itself. Instead, we need a different t value that will give us the maximum + deflection - there are two possible choices (as our curve is still strictly "overshoots" the circular arc, and it's symmetrical) but + rather than trying to use calculus to find the perfect t value—which we could! the maths is perfectly reasonable as long as we + get to use computers—we can also just perform a binary search for the biggest deflection and not bother with all this maths stuff. +

    +

    + So let's do that instead: we can run a maximum deflection check that just runs through t from 0 to 1 at some coarse interval, + finds a t value that has "the highest deflection of the bunch", then reruns the same check with a much smaller interval around + that t value, repeating as many times as necessary to get us an arbitrarily precise value of t: +

    -

    So, how accurate is this?

    -

    Unlike for the quadratic curve, we can't use t=0.5 as our reference point because by its very nature it's one of the three points that are actually guaranteed to be on the circular arc itself. Instead, we need a different t value that will give us the maximum deflection - there are two possible choices (as our curve is still strictly "overshoots" the circular arc, and it's symmetrical) but rather than trying to use calculus to find the perfect t value—which we could! the maths is perfectly reasonable as long as we get to use computers—we can also just perform a binary search for the biggest deflection and not bother with all this maths stuff.

    -

    So let's do that instead: we can run a maximum deflection check that just runs through t from 0 to 1 at some coarse interval, finds a t value that has "the highest deflection of the bunch", then reruns the same check with a much smaller interval around that t value, repeating as many times as necessary to get us an arbitrarily precise value of t:

    - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    - -

    Plus, how often do you get to write a function with that name?

    -

    Using this code, we find that our t values are approximately 0.211325 and 0.788675, so let's pick the lower of the two and see what the maximum deflection is across our domain of angles, with the original quadratic error show in green (rocketing off to infinity first, and then coming back down as we approach 2π)

    - - -
    - - - - - -
    - error plotted for 0 ≤ φ ≤ 2π - - error plotted for 0 ≤ φ ≤ π - - error plotted for 0 ≤ φ ≤ ½π -
    + return getMostWrongT(radius, bezier, worst_t - stepsize, worst_t + stepsize) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + -

    That last image is probably not quite clear enough: the cubic approximation of a quarter circle is so incredibly much better that we can't even really see it at the same scale of our quadratic curve. Let's scale the y-axis a little, and try that again:

    -

    +

    Plus, how often do you get to write a function with that name?

    +

    + Using this code, we find that our t values are approximately 0.211325 and 0.788675, so let's pick the lower of the two and see + what the maximum deflection is across our domain of angles, with the original quadratic error show in green (rocketing off to infinity + first, and then coming back down as we approach 2π) +

    + + + + + + + + + + + + + +
    + + + + + +
    + error plotted for 0 ≤ φ ≤ 2π + + error plotted for 0 ≤ φ ≤ π + + error plotted for 0 ≤ φ ≤ ½π +
    -

    Yeah... the error of a cubic approximation for a quarter circle turns out to be two orders of magnitude better. At approximately 0.00027 (or: just shy of being 2.7 pixels off for a circle with a radius of 10,000 pixels) the increase in precision over quadratic curves is quite spectacular - certainly good enough that no one in their right mind should ever use quadratic curves.

    -
    +

    + That last image is probably not quite clear enough: the cubic approximation of a quarter circle is so incredibly much better that we + can't even really see it at the same scale of our quadratic curve. Let's scale the y-axis a little, and try that again: +

    +

    -

    So that's it, kappa is 4/3 · tan(θ/4) , we're done! ...or are we?

    -

    Can we do better?

    -

    Technically: yes, we can. But I'm going to prefix this section with "we can, and we should investigate that possibility, but let me warn you up front that the result is only better if we're going to hard-code the values". We're about to get into the weeds and the standard three-points-of-incidence value is so good already that for most applications, trying to do better won't make any sense at all.

    -

    So with that said: what we calculated above is an upper bound for a best fit Bézier curve for a circular arc: anywhere we don't touch the circular arc in our approximation, we've "overshot" the arc. What if we dropped our value for k just a little, so that the curve starts out as an over-estimation, but then crosses the circular arc, yielding an region of underestimation, and then crosses the circular arc again, with another region of overestimation. This might give us a lower overall error, so let's see what we can do.

    -

    First, let's express the total error (given circular arc angle θ, and some k) using standard calculus notation:

    - - -

    This says that the error function for a given angle and value of k is equal to the "infinite" sum of differences between our curve and the circular arc, as we run t from 0 to 1, using an infinitely small step size. between subsequent t values.

    -

    Now, since we want to find the minimal error, that means we want to know where along this function things go from "error is getting progressively less" to "error is increasing again", which means we want to know where its derivative is zero, which as mathematical expression looks like:

    - - -

    And here we have the most direct application of the Fundamental Theorem of Calculus: the derivative and integral are each other's inverse operations, so they cancel out, leaving us with our original function:

    - - -

    And now we just solve for that... oh wait. We've seen this before. In order to solve this, we'd end up needing to solve this:

    - - -

    And both of those terms on the left of the equal sign are 6th degree polynomials, which means—as we've covered in the section on arc lengths—there is no symbolic solution for this equasion. Instead, we'll have to use a numerical approach to find the solutions here, so... to the computer!

    -
    + +

    + And both of those terms on the left of the equal sign are 6th degree polynomials, which means—as we've covered in the section + on arc lengths—there is no symbolic solution for this equasion. + Instead, we'll have to use a numerical approach to find the solutions here, so... to the computer! +

    +
    +

    Iterating on a solution

    +

    + By which I really mean "to the binary search algorithm", because we're dealing with a reasonably well behaved function: depending on the + value for k , we're either going to end up with a Bézier curve that's on average "not at distance r from the arc's + center", "exactly distance r from the arc's center", or "more than distance r from the arc's center", so we can just + binary search our way to the most accurate value for c that gets us that middle case. +

    +

    First our setup, where we determine our upper and lower bounds, before entering our binary search:

    -

    Iterating on a solution

    -

    By which I really mean "to the binary search algorithm", because we're dealing with a reasonably well behaved function: depending on the value for k , we're either going to end up with a Bézier curve that's on average "not at distance r from the arc's center", "exactly distance r from the arc's center", or "more than distance r from the arc's center", so we can just binary search our way to the most accurate value for c that gets us that middle case.

    -

    First our setup, where we determine our upper and lower bounds, before entering our binary search:

    - - - - -
    1 - -
    2
    3
    4
    - -

    And then the binary search algorithm, which can be found in pretty much any CS textbook, as well as more online articles, tutorials, and blog posts than you can ever read in a life time:

    + return binarySearch(radius, angle, points, lowerBound, upperBound) + + + + 2 + + + 3 + + + 4 + + - - - - - - - - - - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    11
    12
    13
    14
    15
    16
    17
    18
    19
    - -

    Using the following radialError function, which samples the curve's approximation of the circular arc over several points (although the first and last point will never contribute anything, so we skip them):

    + return binarySearch(radius, angle, points, value, upperBound) + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + + 11 + + + 12 + + + 13 + + + 14 + + + 15 + + + 16 + + + 17 + + + 18 + + + 19 + + - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    - -

    In this, getOnCurvePoint is just the standard Bézier evaluation function, yielding a point. Treating that point as a vector, we can get its length to the origin using a magnitude call.

    -

    Examining the result

    -

    Running the above code we can get a list of k values associated with a list of angles θ from 0 to π, and we can use that to, for each angle, plot what the difference between the circular arc and the Bézier approximation looks like:

    -

    image-20210419085430711

    -

    Here we see the difference between an arc and its Bézier approximation plotted as we run t from 0 to 1. Just by looking at the plot we can tell that there is maximum deflection at t = 0.5, so let's plot the maximum deflection "function", for angles from 0 to θ:

    -

    In fact, let's plot the maximum deflections for both approaches as a functions over θ:

    - - -
    - - - - - -
    - max deflection using unit scale - - max deflection at 10x scale - - max deflection at 100x scale -
    + return err + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + -

    That doesn't actually appear to be all that much better, so let's look at some numbers, to see what the improvement actually is:

    - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
    angle"improved" deflection"upper bound" deflectiondifference
    1/8 π6.202833502388927E-86.657161222278773E-84.5432771988984655E-9
    1/4 π3.978021202111215E-64.246252911066506E-62.68231708955291E-7
    3/8 π4.547652269037972E-54.8397483513262785E-52.9209608228830675E-6
    1/2 π2.569196199214696E-42.7251652752280364E-41.559690760133403E-5
    5/8 π9.877526288810667E-40.00104441758597118025.666495709011343E-5
    3/4 π0.002981649786796270.00314556284145806051.6391305466179062E-4
    7/8 π0.00763231828070198850.0080477779099483734.1545962924638413E-4
    π0.0173621859640437080.0183490165195459029.86830555502194E-4
    -

    As we can see, the increase in precision is not particularly big: for a quarter circle (π/2) the traditional k will be off by 2.75 pixels on a circle with radius 10,000 pixels, whereas this "better" fit will be off by 2.56 pixels. And while that's certainly an almost 10% improvement, it's also nowhere near enough of an improvement to make a discernible difference.

    -
    +

    + In this, getOnCurvePoint is just the standard Bézier evaluation function, yielding a point. Treating that point as a + vector, we can get its length to the origin using a magnitude call. +

    +

    Examining the result

    +

    + Running the above code we can get a list of k values associated with a list of angles θ from 0 to π, and we can use that to, + for each angle, plot what the difference between the circular arc and the Bézier approximation looks like: +

    +

    image-20210419085430711

    +

    + Here we see the difference between an arc and its Bézier approximation plotted as we run t from 0 to 1. Just by looking at the + plot we can tell that there is maximum deflection at t = 0.5, so let's plot the maximum deflection "function", for angles from + 0 to θ: +

    +

    In fact, let's plot the maximum deflections for both approaches as a functions over θ:

    + + + + + + + + + + + + + +
    + + + + + +
    + max deflection using unit scale + + max deflection at 10x scale + + max deflection at 100x scale +
    -

    At this point it should be clear that while, yes, there are improvement to be had, they're essentially insignificant while also being much more computationally expensive.

    -

    TL;DR: just tell me which value I should be using

    -

    It depends on what we need to do. If we just want the best value for quarter circles, and we're going to hard code the value for k, then there is no reason to hard-code the constant k=4/3*tan(pi/8) when you can just as easily hard-code the constant as k=0.551784777779014 instead.

    -

    If you need "the" value for quarter circles, use 0.551785 instead of 0.55228

    -

    However, for dynamic arc approximation, in code that tries to fit circular paths using Bézier paths instead, it should be fairly obvious that the simple function involving a tangent computation, two divisions, and one multiplication, is vastly more performant than running all the code we ended writing just to get a 25% lower error value, and most certainly worth preferring over getting the "more accurate" value.

    -

    If you need to fit Béziers to circular arcs on the fly, use 4/3 * tan(θ/4)

    -

    However, always remember that if you're writing for humans, you can typically use the best of both worlds: as the user interacts with their curves, you should draw their curves instead of drawing approximations of them. If they need to draw circles or circular arcs, draw those, and only approximate them with a Bézier curve when the data needs to be exported to a format that doesn't support those. Ideally with a preview mechanism that highlights where the errors will be, and how large they will be.

    -

    If you're writing code for graphics design by humans, use circular arcs for circular arcs

    -

    And that's it. We have pretty well exhausted this subject. There are different metrics we could use to find "different best k values", like trying to match arc length (e.g. when we're optimizing for material cost), or minimizing the area between the circular arc and the Bézier curve (e.g. when we're optimizing for inking), or minimizing the rate of change of the Bézier's curvature (e.g. when we're optimizing for curve traversal) and they all yield values that are so similar that it's almost certainly not worth it. (For instance, for quarter circle approximations those values are 0.551777, 0.5533344, and 0.552184 respectively. Much like the 0.551785 we get from minimizing the maximum deflection, none of these values are significantly better enough to prefer them over the upper bound value).

    +

    That doesn't actually appear to be all that much better, so let's look at some numbers, to see what the improvement actually is:

    + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
    angle"improved" deflection"upper bound" deflectiondifference
    1/8 π6.202833502388927E-86.657161222278773E-84.5432771988984655E-9
    1/4 π3.978021202111215E-64.246252911066506E-62.68231708955291E-7
    3/8 π4.547652269037972E-54.8397483513262785E-52.9209608228830675E-6
    1/2 π2.569196199214696E-42.7251652752280364E-41.559690760133403E-5
    5/8 π9.877526288810667E-40.00104441758597118025.666495709011343E-5
    3/4 π0.002981649786796270.00314556284145806051.6391305466179062E-4
    7/8 π0.00763231828070198850.0080477779099483734.1545962924638413E-4
    π0.0173621859640437080.0183490165195459029.86830555502194E-4
    +

    + As we can see, the increase in precision is not particularly big: for a quarter circle (π/2) the traditional k will be off by + 2.75 pixels on a circle with radius 10,000 pixels, whereas this "better" fit will be off by 2.56 pixels. And while that's certainly an + almost 10% improvement, it's also nowhere near enough of an improvement to make a discernible difference. +

    +
    -
    -
    -

    Approximating Bézier curves with circular arcs

    -

    Let's look at doing the exact opposite of the previous section: rather than approximating circular arc using Bézier curves, let's approximate Bézier curves using circular arcs.

    -

    We already saw in the section on circle approximation that this will never yield a perfect equivalent, but sometimes you need circular arcs, such as when you're working with fabrication machinery, or simple vector languages that understand lines and circles, but not much else.

    -

    The approach is fairly simple: pick a starting point on the curve, and pick two points that are further along the curve. Determine the circle that goes through those three points, and see if it fits the part of the curve we're trying to approximate. Decent fit? Try spacing the points further apart. Bad fit? Try spacing the points closer together. Keep doing this until you've found the "good approximation/bad approximation" boundary, record the "good" arc, and then move the starting point up to overlap the end point we previously found. Rinse and repeat until we've covered the entire curve.

    -

    We already saw how to fit a circle through three points in the section on creating a curve from three points, and finding the arc through those points is straight-forward: pick one of the three points as start point, pick another as an end point, and the arc has to necessarily go from the start point, to the end point, over the remaining point.

    -

    So, how can we convert a Bézier curve into a (sequence of) circular arc(s)?

    -
      -
    • Start at t=0
    • -
    • Pick two points further down the curve at some value m = t + n and e = t + 2n
    • -
    • Find the arc that these points define
    • -
    • Determine how close the found arc is to the curve:
        -
      • Pick two additional points e1 = t + n/2 and e2 = t + n + n/2.
      • -
      • These points, if the arc is a good approximation of the curve interval chosen, should - lie on the circle, so their distance to the center of the circle should be the - same as the distance from any of the three other points to the center.
      • -
      • For point points, determine the (absolute) error between the radius of the circle, and the -actual distance from the center of the circle to the point on the curve.
      • -
      • If this error is too high, we consider the arc bad, and try a smaller interval.
      • -
      -
    • -
    -

    The result of this is shown in the next graphic: we start at a guaranteed failure: s=0, e=1. That's the entire curve. The midpoint is simply at t=0.5, and then we start performing a binary search.

    -
      -
    1. We start with low=0, mid=0.5 and high=1
    2. -
    3. That'll fail, so we retry with the interval halved: {0, 0.25, 0.5}
        -
      • If that arc's good, we move back up by half distance: {0, 0.375, 0.75}.
      • -
      • However, if the arc was still bad, we move down by half the distance: {0, 0.125, 0.25}.
      • -
      -
    4. -
    5. We keep doing this over and over until we have two arcs, in sequence, of which the first arc is good, and the second arc is bad. When we find that pair, we've found the boundary between a good approximation and a bad approximation, and we pick the good arc.
    6. -
    -

    The following graphic shows the result of this approach, with a default error threshold of 0.5, meaning that if an arc is off by a combined half pixel over both verification points, then we treat the arc as bad. This is an extremely simple error policy, but already works really well. Note that the graphic is still interactive, and you can use your up and down arrow keys keys to increase or decrease the error threshold, to see what the effect of a smaller or larger error threshold is.

    - - - 脚本被禁用。显示备用图片。 - - - - - -

    With that in place, all that's left now is to "restart" the procedure by treating the found arc's end point as the new to-be-determined arc's starting point, and using points further down the curve. We keep trying this until the found end point is for t=1, at which point we are done. Again, the following graphic allows for up and down arrow key input to increase or decrease the error threshold, so you can see how picking a different threshold changes the number of arcs that are necessary to reasonably approximate a curve:

    - - - 脚本被禁用。显示备用图片。 - - - - - -

    So... what is this good for? Obviously, if you're working with technologies that can't do curves, but can do lines and circles, then the answer is pretty straightforward, but what else? There are some reasons why you might need this technique: using circular arcs means you can determine whether a coordinate lies "on" your curve really easily (simply compute the distance to each circular arc center, and if any of those are close to the arc radii, at an angle between the arc start and end, bingo, this point can be treated as lying "on the curve"). Another benefit is that this approximation is "linear": you can almost trivially travel along the arcs at fixed speed. You can also trivially compute the arc length of the approximated curve (it's a bit like curve flattening). The only thing to bear in mind is that this is a lossy equivalence: things that you compute based on the approximation are guaranteed "off" by some small value, and depending on how much precision you need, arc approximation is either going to be super useful, or completely useless. It's up to you to decide which, based on your application!

    - -
    -
    -

    B-Splines

    -

    No discussion on Bézier curves is complete without also giving mention of that other beast in the curve design space: B-Splines. Easily confused to mean Bézier splines, that's not actually what they are; they are "basis function" splines, which makes a lot of difference, and we'll be looking at those differences in this section. We're not going to dive as deep into B-Splines as we have for Bézier curves (that would be an entire primer on its own) but we'll be looking at how B-Splines work, what kind of maths is involved in computing them, and how to draw them based on a number of parameters that you can pick for individual B-Splines.

    -

    First off: B-Splines are piecewise, polynomial interpolation curves, where the "single curve" is built by performing polynomial interpolation over a set of points, using a sliding window of a fixed number of points. For instance, a "cubic" B-Spline defined by twelve points will have its curve built by evaluating the polynomial interpolation of four points, and the curve can be treated as a lot of different sections, each controlled by four points at a time, such that the full curve consists of smoothly connected sections defined by points {1,2,3,4}, {2,3,4,5}, ..., {8,9,10,11}, and finally {9,10,11,12}, for eight sections.

    -

    What do they look like? They look like this! Tap on the graphic to add more points, and move points around to see how they map to the spline curve drawn.

    - - - 脚本被禁用。显示备用图片。 - - - -

    The important part to notice here is that we are not doing the same thing with B-Splines that we do for poly-Béziers or Catmull-Rom curves: both of the latter simply define new sections as literally "new sections based on new points", so a 12 point cubic poly-Bézier curve is actually impossible, because we start with a four point curve, and then add three more points for each section that follows, so we can only have 4, 7, 10, 13, 16, etc. point Poly-Béziers. Similarly, while Catmull-Rom curves can grow by adding single points, this addition of a single point introduces three implicit Bézier points. Cubic B-Splines, on the other hand, are smooth interpolations of each possible curve involving four consecutive points, such that at any point along the curve except for our start and end points, our on-curve coordinate is defined by four control points.

    -

    Consider the difference to be this:

    -
      -
    • for Bézier curves, the curve is defined as an interpolation of points, but:
    • -
    • for B-Splines, the curve is defined as an interpolation of curves.
    • -
    -

    In fact, let's look at that again, but this time with the base curves shown, too. Each consecutive four points define one curve:

    - - - 脚本被禁用。显示备用图片。 - - - - - -

    In order to make this interpolation of curves work, the maths is necessarily more complex than the maths for Bézier curves, so let's have a look at how things work.

    -

    How to compute a B-Spline curve: some maths

    -

    Given a B-Spline of degree d and thus order k=d+1 (so a quadratic B-Spline is degree 2 and order 3, a cubic B-Spline is degree 3 and order 4, etc) and n control points P0 through Pn-1, we can compute a point on the curve for some value t in the interval [0,1] (where 0 is the start of the curve, and 1 the end, just like for Bézier curves), by evaluating the following function:

    - + +

    + In order to make this interpolation of curves work, the maths is necessarily more complex than the maths for Bézier curves, so let's have + a look at how things work. +

    +

    How to compute a B-Spline curve: some maths

    +

    + Given a B-Spline of degree d and thus order k=d+1 (so a quadratic B-Spline is degree 2 and order 3, a cubic + B-Spline is degree 3 and order 4, etc) and n control points P0 through Pn-1, we can compute a point on the curve for some value t in the interval [0,1] (where 0 is the start of the curve, and 1 the + end, just like for Bézier curves), by evaluating the following function: +

    + - -

    Which, honestly, doesn't tell us all that much. All we can see is that a point on a B-Spline curve is defined as "a mix of all the control points, weighted somehow", where the weighting is achieved through the N(...) function, subscripted with an obvious parameter i, which comes from our summation, and some magical parameter k. So we need to know two things: 1. what does N(t) do, and 2. what is that k? Let's cover both, in reverse order.

    -

    The parameter k represents the "knot interval" over which a section of curve is defined. As we learned earlier, a B-Spline curve is itself an interpolation of curves, and we can treat each transition where a control point starts or stops influencing the total curvature as a "knot on the curve". -Doing so for a degree d B-Spline with n control point gives us d + n + 1 knots, defining d + n intervals along the curve, and it is these intervals that the above k subscript to the N() function applies to.

    -

    Then the N() function itself. What does it look like?

    - - -

    So this is where we see the interpolation: N(t) for an (i,k) pair (that is, for a step in the above summation, on a specific knot interval) is a mix between N(t) for (i,k-1) and N(t) for (i+1,k-1), so we see that this is a recursive iteration where i goes up, and k goes down, so it seem reasonable to expect that this recursion has to stop at some point; obviously, it does, and specifically it does so for the following i/k values:

    - - -

    And this function finally has a straight up evaluation: if a t value lies within a knot-specific interval once we reach a k=1 value, it "counts", otherwise it doesn't. We did cheat a little, though, because for all these values we need to scale our t value first, so that it lies in the interval bounded by knots[d] and knots[n], which are the start point and end point where curvature is controlled by exactly order control points. For instance, for degree 3 (=order 4) and 7 control points, with knot vector [1,2,3,4,5,6,7,8,9,10,11], we map t from [the interval 0,1] to the interval [4,8], and then use that value in the functions above, instead.

    -

    Can we simplify that?

    -

    We can, yes.

    -

    People far smarter than us have looked at this work, and two in particular — Maurice Cox and Carl de Boor — came to a mathematically pleasing solution: to compute a point P(t), we can compute this point by evaluating d(t) on a curve section between knots i and i+1:

    - - -

    This is another recursive function, with k values decreasing from the curve order to 1, and the value α (alpha) defined by:

    - - -

    That looks complicated, but it's not. Computing alpha is just a fraction involving known, plain numbers. And, once we have our alpha value, we also have (1-alpha) because it's a trivial subtraction. Computing the d() function is thus mostly a matter of computing pretty simple arithmetical statements, with some caching of results so we can refer to them as we recurve. While the recursion might see computationally expensive, the total algorithm is cheap, as each step only involves very simple maths.

    -

    Of course, the recursion does need a stop condition:

    - - -

    So, we actually see two stopping conditions: either i becomes 0, in which case d() is zero, or k becomes zero, in which case we get the same "either 1 or 0" that we saw in the N() function above.

    -

    Thanks to Cox and de Boor, we can compute points on a B-Spline pretty easily using the same kind of linear interpolation we saw in de Casteljau's algorithm. For instance, if we write out d() for i=3 and k=3, we get the following recursion diagram:

    - - -

    That is, we compute d(3,3) as a mixture of d(2,3) and d(2,2), where those two are themselves a mixture of d(1,3) and d(1,2), and d(1,2) and d(1,1), respectively, which are themselves a mixture of etc. etc. We simply keep expanding our terms until we reach the stop conditions, and then sum everything back up. It's really quite elegant.

    -

    One thing we need to keep in mind is that we're working with a spline that is constrained by its control points, so even though the d(..., k) values are zero or one at the lowest level, they are really "zero or one, times their respective control point", so in the next section you'll see the algorithm for running through the computation in a way that starts with a copy of the control point vector and then works its way up to that single point, rather than first starting "on the left", working our way "to the right" and then summing back up "to the left". We can just start on the right and work our way left immediately.

    -

    Running the computation

    -

    Unlike the de Casteljau algorithm, where the t value stays the same at every iteration, for B-Splines that is not the case, and so we end having to (for each point we evaluate) run a fairly involving bit of recursive computation. The algorithm is discussed on this Michigan Tech page, but an easier to read version is implemented by b-spline.js, so we'll look at its code.

    -

    Given an input value t, we first map the input to a value from the domain [0,1] to the domain [knots[degree], knots[knots.length - 1 - degree]. Then, we find the section number s that this mapped t value lies on:

    + +

    + That is, we compute d(3,3) as a mixture of d(2,3) and d(2,2), where those two are themselves a + mixture of d(1,3) and d(1,2), and d(1,2) and d(1,1), respectively, which are + themselves a mixture of etc. etc. We simply keep expanding our terms until we reach the stop conditions, and then sum everything back up. + It's really quite elegant. +

    +

    + One thing we need to keep in mind is that we're working with a spline that is constrained by its control points, so even though the + d(..., k) values are zero or one at the lowest level, they are really "zero or one, times their respective control point", so + in the next section you'll see the algorithm for running through the computation in a way that starts with a copy of the control point + vector and then works its way up to that single point, rather than first starting "on the left", working our way "to the right" and then + summing back up "to the left". We can just start on the right and work our way left immediately. +

    +

    Running the computation

    +

    + Unlike the de Casteljau algorithm, where the t value stays the same at every iteration, for B-Splines that is not the case, + and so we end having to (for each point we evaluate) run a fairly involving bit of recursive computation. The algorithm is discussed on + this Michigan Tech page, but an easier to read version + is implemented by b-spline.js, so we'll look at its code. +

    +

    + Given an input value t, we first map the input to a value from the domain [0,1] to the domain + [knots[degree], knots[knots.length - 1 - degree]. Then, we find the section number s that this mapped + t value lies on: +

    - - -
    1 - -
    2
    3
    - -

    after running this code, s is the index for the section the point will lie on. We then run the algorithm mentioned on the MU page (updated to use this description's variable names):

    +} + + + + 2 + + + 3 + + - - - - - - - - - -
    1 - -
    2
    3
    4
    5
    6
    7
    8
    9
    10
    - -

    (A nice bit of behaviour in this code is that we work the interpolation "backwards", starting at i=s at each level of the interpolation, and we stop when i = s - order + level, so we always end up with a value for i such that those v[i-1] don't try to use an array index that doesn't exist)

    -

    Open vs. closed paths

    -

    Much like poly-Béziers, B-Splines can be either open, running from the first point to the last point, or closed, where the first and last point are the same coordinate. However, because B-Splines are an interpolation of curves, not just points, we can't simply make the first and last point the same, we need to link as many points as are necessary to form "a curve" that the spline performs interpolation with. As such, for an order d B-Spline, we need to make the first and last d points the same. This is of course hardly more work than before (simply append points.splice(0,d) to points) but it's important to remember that you need more than just a single point.

    -

    Of course if we want to manipulate these kind of curves we need to make sure to mark them as "closed" so that we know the coordinate for points[0] and points[n-k] etc. don't just happen to have the same x/y values, but really are the same coordinate, so that manipulating one will equally manipulate the other, but programming generally makes this really easy by storing references to points, rather than copies (or other linked values such as coordinate weights, discussed in the NURBS section) rather than separate coordinate objects.

    -

    Manipulating the curve through the knot vector

    -

    The most important thing to understand when it comes to B-Splines is that they work because of the concept of a knot vector. As mentioned above, knots represent "where individual control points start/stop influencing the curve", but we never looked at the values that go in the knot vector. If you look back at the N() and a() functions, you see that interpolations are based on intervals in the knot vector, rather than the actual values in the knot vector, and we can exploit this to do some pretty interesting things with clever manipulation of the knot vector. Specifically there are four things we can do that are worth looking at:

    -
      -
    1. we can use a uniform knot vector, with equally spaced intervals,
    2. -
    3. we can use a non-uniform knot vector, without enforcing equally spaced intervals,
    4. -
    5. we can collapse sequential knots to the same value, locally lowering curve complexity using "null" intervals, and
    6. -
    7. we can form a special case non-uniform vector, by combining (1) and (3) to for a vector with collapsed start and end knots, with a uniform vector in between.
    8. -
    -

    Uniform B-Splines

    -

    The most straightforward type of B-Spline is the uniform spline. In a uniform spline, the knots are distributed uniformly over the entire curve interval. For instance, if we have a knot vector of length twelve, then a uniform knot vector would be [0,1,2,3,...,9,10,11]. Or [4,5,6,...,13,14,15], which defines the same intervals, or even [0,2,3,...,18,20,22], which also defines the same intervals, just scaled by a constant factor, which becomes normalised during interpolation and so does not contribute to the curvature.

    - - - 脚本被禁用。显示备用图片。 - - - - - -

    This is an important point: the intervals that the knot vector defines are relative intervals, so it doesn't matter if every interval is size 1, or size 100 - the relative differences between the intervals is what shapes any particular curve.

    -

    The problem with uniform knot vectors is that, as we need order control points before we have any curve with which we can perform interpolation, the curve does not "start" at the first point, nor "ends" at the last point. Instead there are "gaps". We can get rid of these, by being clever about how we apply the following uniformity-breaking approach instead...

    -

    Reducing local curve complexity by collapsing intervals

    -

    Collapsing knot intervals, by making two or more consecutive knots have the same value, allows us to reduce the curve complexity in the sections that are affected by the knots involved. This can have drastic effects: for every interval collapse, the curve order goes down, and curve continuity goes down, to the point where collapsing order knots creates a situation where all continuity is lost and the curve "kinks".

    - - - 脚本被禁用。显示备用图片。 - - - - - -

    Open-Uniform B-Splines

    -

    By combining knot interval collapsing at the start and end of the curve, with uniform knots in between, we can overcome the problem of the curve not starting and ending where we'd kind of like it to:

    -

    For any curve of degree D with control points N, we can define a knot vector of length N+D+1 in which the values 0 ... D+1 are the same, the values D+1 ... N+1 follow the "uniform" pattern, and the values N+1 ... N+D+1 are the same again. For example, a cubic B-Spline with 7 control points can have a knot vector [0,0,0,0,1,2,3,4,4,4,4], or it might have the "identical" knot vector [0,0,0,0,2,4,6,8,8,8,8], etc. Again, it is the relative differences that determine the curve shape.

    - - - 脚本被禁用。显示备用图片。 - - - - - -

    Non-uniform B-Splines

    -

    This is essentially the "free form" version of a B-Spline, and also the least interesting to look at, as without any specific reason to pick specific knot intervals, there is nothing particularly interesting going on. There is one constraint to the knot vector, other than that any value knots[k+1] should be greater than or equal to knots[k].

    -

    One last thing: Rational B-Splines

    -

    While it is true that this section on B-Splines is running quite long already, there is one more thing we need to talk about, and that's "Rational" splines, where the rationality applies to the "ratio", or relative weights, of the control points themselves. By introducing a ratio vector with weights to apply to each control point, we greatly increase our influence over the final curve shape: the more weight a control point carries, the closer to that point the spline curve will lie, a bit like turning up the gravity of a control point, just like for rational Bézier curves.

    - - - 脚本被禁用。显示备用图片。 - - - - - -

    Of course this brings us to the final topic that any text on B-Splines must touch on before calling it a day: the NURBS, or Non-Uniform Rational B-Spline (NURBS is not a plural, the capital S actually just stands for "spline", but a lot of people mistakenly treat it as if it is, so now you know better). NURBS is an important type of curve in computer-facilitated design, used a lot in 3D modelling (typically as NURBS surfaces) as well as in arbitrary-precision 2D design due to the level of control a NURBS curve offers designers.

    -

    While a true non-uniform rational B-Spline would be hard to work with, when we talk about NURBS we typically mean the Open-Uniform Rational B-Spline, or OURBS, but that doesn't roll off the tongue nearly as nicely, and so remember that when people talk about NURBS, they typically mean open-uniform, which has the useful property of starting the curve at the first control point, and ending it at the last.

    -

    Extending our implementation to cover rational splines

    -

    The algorithm for working with Rational B-Splines is virtually identical to the regular algorithm, and the extension to work in the control point weights is fairly simple: we extend each control point from a point in its original number of dimensions (2D, 3D, etc.) to one dimension higher, scaling the original dimensions by the control point's weight, and then assigning that weight as its value for the extended dimension.

    -

    For example, a 2D point (x,y) with weight w becomes a 3D point (w * x, w * y, w).

    -

    We then run the same algorithm as before, which will automatically perform weight interpolation in addition to regular coordinate interpolation, because all we've done is pretended we have coordinates in a higher dimension. The algorithm doesn't really care about how many dimensions it needs to interpolate.

    -

    In order to recover our "real" curve point, we take the final result of the point generation algorithm, and "unweigh" it: we take the final point's derived weight w' and divide all the regular coordinate dimensions by it, then throw away the weight information.

    -

    Based on our previous example, we take the final 3D point (x', y', w'), which we then turn back into a 2D point by computing (x'/w', y'/w'). And that's it, we're done!

    +} + + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 10 + + -
    -
    - +

    + (A nice bit of behaviour in this code is that we work the interpolation "backwards", starting at i=s at each level of the + interpolation, and we stop when i = s - order + level, so we always end up with a value for i such that those + v[i-1] don't try to use an array index that doesn't exist) +

    +

    Open vs. closed paths

    +

    + Much like poly-Béziers, B-Splines can be either open, running from the first point to the last point, or closed, where the first and last + point are the same coordinate. However, because B-Splines are an interpolation of curves, not just points, we can't simply make the first + and last point the same, we need to link as many points as are necessary to form "a curve" that the spline performs interpolation with. As + such, for an order d B-Spline, we need to make the first and last d points the same. This is of course hardly + more work than before (simply append points.splice(0,d) to points) but it's important to remember that you need + more than just a single point. +

    +

    + Of course if we want to manipulate these kind of curves we need to make sure to mark them as "closed" so that we know the coordinate for + points[0] and points[n-k] etc. don't just happen to have the same x/y values, but really are the same + coordinate, so that manipulating one will equally manipulate the other, but programming generally makes this really easy by storing + references to points, rather than copies (or other linked values such as coordinate weights, discussed in the NURBS section) rather than + separate coordinate objects. +

    +

    Manipulating the curve through the knot vector

    +

    + The most important thing to understand when it comes to B-Splines is that they work because of the concept of a knot vector. As + mentioned above, knots represent "where individual control points start/stop influencing the curve", but we never looked at the + values that go in the knot vector. If you look back at the N() and a() functions, you see that interpolations are based on + intervals in the knot vector, rather than the actual values in the knot vector, and we can exploit this to do some pretty interesting + things with clever manipulation of the knot vector. Specifically there are four things we can do that are worth looking at: +

    +
      +
    1. we can use a uniform knot vector, with equally spaced intervals,
    2. +
    3. we can use a non-uniform knot vector, without enforcing equally spaced intervals,
    4. +
    5. we can collapse sequential knots to the same value, locally lowering curve complexity using "null" intervals, and
    6. +
    7. + we can form a special case non-uniform vector, by combining (1) and (3) to for a vector with collapsed start and end knots, with a + uniform vector in between. +
    8. +
    +

    Uniform B-Splines

    +

    + The most straightforward type of B-Spline is the uniform spline. In a uniform spline, the knots are distributed uniformly over the entire + curve interval. For instance, if we have a knot vector of length twelve, then a uniform knot vector would be [0,1,2,3,...,9,10,11]. Or + [4,5,6,...,13,14,15], which defines the same intervals, or even [0,2,3,...,18,20,22], which also defines + the same intervals, just scaled by a constant factor, which becomes normalised during interpolation and so does not contribute to + the curvature. +

    + + + 脚本被禁用。显示备用图片。 + + + + + +

    + This is an important point: the intervals that the knot vector defines are relative intervals, so it doesn't matter if every + interval is size 1, or size 100 - the relative differences between the intervals is what shapes any particular curve. +

    +

    + The problem with uniform knot vectors is that, as we need order control points before we have any curve with which we can + perform interpolation, the curve does not "start" at the first point, nor "ends" at the last point. Instead there are "gaps". We can get + rid of these, by being clever about how we apply the following uniformity-breaking approach instead... +

    +

    Reducing local curve complexity by collapsing intervals

    +

    + Collapsing knot intervals, by making two or more consecutive knots have the same value, allows us to reduce the curve complexity in the + sections that are affected by the knots involved. This can have drastic effects: for every interval collapse, the curve order goes down, + and curve continuity goes down, to the point where collapsing order knots creates a situation where all continuity is lost + and the curve "kinks". +

    + + + 脚本被禁用。显示备用图片。 + + + + + +

    Open-Uniform B-Splines

    +

    + By combining knot interval collapsing at the start and end of the curve, with uniform knots in between, we can overcome the problem of the + curve not starting and ending where we'd kind of like it to: +

    +

    + For any curve of degree D with control points N, we can define a knot vector of length N+D+1 in + which the values 0 ... D+1 are the same, the values D+1 ... N+1 follow the "uniform" pattern, and the values + N+1 ... N+D+1 are the same again. For example, a cubic B-Spline with 7 control points can have a knot vector + [0,0,0,0,1,2,3,4,4,4,4], or it might have the "identical" knot vector [0,0,0,0,2,4,6,8,8,8,8], etc. Again, it is the relative differences + that determine the curve shape. +

    + + + 脚本被禁用。显示备用图片。 + + + + + +

    Non-uniform B-Splines

    +

    + This is essentially the "free form" version of a B-Spline, and also the least interesting to look at, as without any specific reason to + pick specific knot intervals, there is nothing particularly interesting going on. There is one constraint to the knot vector, other than + that any value knots[k+1] should be greater than or equal to knots[k]. +

    +

    One last thing: Rational B-Splines

    +

    + While it is true that this section on B-Splines is running quite long already, there is one more thing we need to talk about, and that's + "Rational" splines, where the rationality applies to the "ratio", or relative weights, of the control points themselves. By introducing a + ratio vector with weights to apply to each control point, we greatly increase our influence over the final curve shape: the more weight a + control point carries, the closer to that point the spline curve will lie, a bit like turning up the gravity of a control point, just like + for rational Bézier curves. +

    + + + 脚本被禁用。显示备用图片。 + + + + + +

    + Of course this brings us to the final topic that any text on B-Splines must touch on before calling it a day: the + NURBS, or Non-Uniform Rational B-Spline (NURBS is not a plural, + the capital S actually just stands for "spline", but a lot of people mistakenly treat it as if it is, so now you know better). NURBS is an + important type of curve in computer-facilitated design, used a lot in 3D modelling (typically as NURBS surfaces) as well as in + arbitrary-precision 2D design due to the level of control a NURBS curve offers designers. +

    +

    + While a true non-uniform rational B-Spline would be hard to work with, when we talk about NURBS we typically mean the Open-Uniform + Rational B-Spline, or OURBS, but that doesn't roll off the tongue nearly as nicely, and so remember that when people talk about NURBS, + they typically mean open-uniform, which has the useful property of starting the curve at the first control point, and ending it at the + last. +

    +

    Extending our implementation to cover rational splines

    +

    + The algorithm for working with Rational B-Splines is virtually identical to the regular algorithm, and the extension to work in the + control point weights is fairly simple: we extend each control point from a point in its original number of dimensions (2D, 3D, etc.) to + one dimension higher, scaling the original dimensions by the control point's weight, and then assigning that weight as its value for the + extended dimension. +

    +

    For example, a 2D point (x,y) with weight w becomes a 3D point (w * x, w * y, w).

    +

    + We then run the same algorithm as before, which will automatically perform weight interpolation in addition to regular coordinate + interpolation, because all we've done is pretended we have coordinates in a higher dimension. The algorithm doesn't really care about how + many dimensions it needs to interpolate. +

    +

    + In order to recover our "real" curve point, we take the final result of the point generation algorithm, and "unweigh" it: we take the + final point's derived weight w' and divide all the regular coordinate dimensions by it, then throw away the weight + information. +

    +

    + Based on our previous example, we take the final 3D point (x', y', w'), which we then turn back into a 2D point by computing + (x'/w', y'/w'). And that's it, we're done! +

    +
    +
    + -

    Comments and questions

    -

    First off, if you enjoyed this book, or you simply found it useful for something you were trying to get done, and you were wondering how to let me know you appreciated this book, you have two options: you can either head on over to the Patreon page for this book, or if you prefer to make a one-time donation, head on over to the buy Pomax a coffee page. This work has grown from a small primer to a 70-plus print-page-equivalent reader on the subject of Bézier curves over the years, and a lot of coffee went into the making of it. I don't regret a minute I spent on writing it, but I can always do with some more coffee to keep on writing.

    -

    With that said, on to the comments!

    -
    +

    + + Comments and questions +

    +

    + First off, if you enjoyed this book, or you simply found it useful for something you were trying to get done, and you were wondering how + to let me know you appreciated this book, you have two options: you can either head on over to the + Patreon page for this book, or if you prefer to make a one-time donation, head on over to + the buy Pomax a coffee page. This + work has grown from a small primer to a 70-plus print-page-equivalent reader on the subject of Bézier curves over the years, and a lot of + coffee went into the making of it. I don't regret a minute I spent on writing it, but I can always do with some more coffee to keep on + writing. +

    +

    With that said, on to the comments!

    +
    +
    +
    +
    +
    - - - + -
    - - - - - - - - - \ No newline at end of file + + +