Open Educational Resources

Engineering Mechanics: STATICS: Scalars and vectors

Basic definitions

Scalars and vectors are mathematical objects and they are useful for quantifying physical quantities.

Scalar. A scalar is a real number. For example 2.23, 0, -1324, \pi , and so on .

Scalar quantity. A scalar quantity is a quantity that can be described by a single real number. This number specifies the magnitude or size of that quantity. For example mass, speed, area, temperature, and pressure are scalar quantities.

Sometimes a scalar quantity is simply referred to as scalar. For example, “mass is a scalar” is equivalent to stating: “mass is a scalar quantity”.

Mathematical operations on scalars follow the usual rules of arithmetic.

Vector. A vector is a mathematical object that is understood by its size (magnitude) and its direction. This definition perfectly fulfils the requirements of engineering mechanics. However, for a more rigorous mathematical definition, please refer to section 14.2.8.

Vector quantity. A vector quantity is a quantity characterized by both a magnitude and a direction. For example velocity, force, acceleration, and moment are all vector quantities.

The magnitude (sometimes referred to as the norm) of a vector is always taken to be a positive number. For example the magnitude of the velocity of a moving object is the speed of that object. Speed is always measured and reported as a positive number.

The terms vector and vector quantity can be equivalently used.

In this book, we distinguish between a vector and a scalar using special notations. The notation for a vector is a bold upper-case letter \bold{A} or an upper-case letter with a symbolic overhead arrow \overrightarrow A. By convention, the direction of the arrow is from left to right and it does not convey any information about the direction of the vector itself. The magnitude of a vector is denoted by a regular upper-case letter A or the vector notation embedded by two vertical lines \vert \bold A \vert or \vert \overrightarrow A\vert. The embedding vertical line notation is also used to denote the absolute value of a scalar, for example \vert s\vert where s is a real number. In some text the notation of vector magnitude is a double-vertical-line notation \|A\| and the absolute value of a scalar is shown by \vert s\vert. In this book, \vert . \vert is used for both cases.

Using an upper-case letter with a symbolic overhead arrow is recommended when writing notes, assignments, or exams.

Graphical representation of vectors

A graphical or geometrical representation of vectors is by directed line segments or arrows. Actually, a directed line segment is a vector itself.

A line segment is a finite-length piece of a straight line. It is defined by two ending points on a straight line.

A directed line segment (arrow) is a line segment with a defined direction. To define the direction, one of the segment’s end point is distinguished as the tail, and consequently the other one as head. Thereby, the r direction is defined as the sense of direction from the tail to the head of the segment. A directed line segment can be displayed by an arrow such that the arrow head defines the sense of direction. The line that is coincident with an arrow is called the line of action. The orientation or direction of an arrow (its tail is already known or defined) with respect to a fixed axis (a line) is defined by an angle formed between the axis and the line of action of the arrow. Figure 1 shows a directed line segment or arrow representing the vector \bold F with its defined terms.

Figure 1. Geometric representation of a directed line segment (arrow) or a vector.

Figure 1. Geometric representation of a directed line segment (arrow) or a vector.

A directed line segment or simply an arrow has both the direction and length. The length of an arrow is defined as its magnitude. An arrow has both a direction and a magnitude, therefore, an arrow is a vector. To be more precise an arrow is a geometric vector. In this book, vectors and arrow are equivalent.

Zero vector. The zero vector, denoted by \bold 0 or \overrightarrow 0, is a vector of length zero. The zero vector does not point toward any direction, therefore, its direction is undefined.

An important remark with a vector is that the position of a directed line segment or a vector in the space does not change the properties of the vector; because a vector is defined only by its direction and magnitude. As a result, two vectors are equal if they have the same direction and magnitude. Examples of equal vectors are shown in Figure 2 below.

Equivalent vectors

Figure 2. Equivalent vectors.

A vector (an arrow) can be in a plane (a 2-dimensional space) or in a 3-dimensional space.

Vector operations using the parallelogram rule and trigonometry

The following are mathematical operations defined on vectors:

  1. Multiplication (or division) by a scalar.
  2. Vector addition (or subtraction).
  3. Vector products (dot product and cross product).

The first two operations are the fundamental operations. Vector products are special functions on vectors discussed in sections 14.2.6 and 14.27. In this section, the first two operations are discussed.

Multiplication (and division) of a vector by a scalar

A vector can be multiplied by a scalar and the result is another vector. A vector \bold A multiplied by a scalar a will be equal the vector \bold B. This operation is denoted by \bold B = a\bold A. Division of a vector by a scalar a\ne 0 is the same as multiplication by 1/a. Multiplication of a vector by a scalar is implemented by this rule:

Scalar multiplication simply scales the magnitude (length) of a vector if the scalar is a positive number. The operation also reverses the direction of the vector if the scalar is a negative number.

The following figure demonstrates different vectors as the results of multiplication of \bold A by a scalar.

Multiplication of a vector by a scalar.

Figure 3. Multiplication of a vector by a scalar.

Remark: For any non zero scalar a, if a< 0 then a\bold A scales and reverse (switches) the direction of \bold A.

Remark: multiplication of any vector, \bold A, by zero results in the zero vector: 0\bold A=\bold 0.

Remark: multiplying a vector by a scalar scales its magnitude: if \bold B = a\bold A, then \vert \bold B \vert=\vert a \bold A \vert =\vert a\vert \vert \bold A \vert in which \vert a \vert is the absolute value of the scalar a.

Colinear vectors. Vectors on the same straight line are called colinear vectors. In other word, \bold A and \bold B are colinear if and only if there is an scalar s such that \bold A= s\bold B. If \bold B\ne \bold 0 then the absolute value of s is \vert \bold A\vert / \vert \bold B \vert, because \vert \bold A \vert = \vert s \bold B\vert. According to this definition, the zero vector is colinear with all other vectors.

It should be noted that parallel arrows or parallel vectors (vectors with parallel lines of action) are colinear because they can be moved on the same straight line.

Concurrent vectors. Vectors with their tails starting at the same point are called concurrent. This definition is only for the sake of graphical representation of arrows as vectors. Any number of vector can become concurrent once they are moved (re-drawn) such chat they all start from the same point.

Unit vector. A unit vector is a vector of magnitude 1. Any vector can be written as a scalar multiplication of a unit vector with the same direction: \bold A = \vert \bold A \vert \bold u_{\bold A} such that the notation \bold u_{\bold A} indicates the unit vector (\vert \bold u_{\bold u}\vert = 1) in the direction of \bold A. Any vector \bold A\ne \bold 0 can become a unit vector if scaled by 1/\vert \bold A\vert. In this case we write \bold u_{\bold A}=\frac{\bold A}{\vert \bold A \vert} and call \bold u_{\bold A} the unit vector of \bold A. Obviously, \bold u_{\bold A} is colinear with \bold A.

Normalizing a vector. Making a unit vector out of a vector is called normalizing a vector. The resultant unit vector only shows the direction of the original vector.

Vector addition

Vector addition is a vector operation that produces another vector. The addition of two vectors \bold A and \bold B resulting in a vector \bold C is denoted as: \bold A+\bold B=\bold C. There are two equivalent rules (laws) for vector addition: triangle rule, and parallelogram law.

To calculate \bold A+\bold B=\bold C using the triangle rule follow these steps:

  1. Put \bold A and \bold B head to tail (or tail to head), and then,
  2. Close the triangle with the vector \bold C .

Figure 4 demonstrates the triangle rule for vector addition.

Addition of two vectors by The triangle rule

Figure 4. Addition of two vectors by The triangle rule.

The vector addition is commutative meaning \bold A+\bold B= \bold B+\bold A. This can be readily observed in the following figure:

Commutative property of vector addition.

Figure 5. Commutative property of vector addition.

The property \bold A+\bold B= \bold B+\bold A leads to the parallelogram law. To add two vectors using the parallelogram law, follow these steps:

  1. Bring the vectors to join at a point, say O, by their tails (make the vectors concurrent).
  2. From the head of each vector draw a line parallel to the other vector. These two lines intersect at a point P and form two adjacent lines of a parallelogram.
  3. Draw a vector on the diagonal of the parallelogram from point O to the point P. This vector is the resultant vector \bold A + \bold B =\bold C.

The figure below demonstrates vector addition using the parallelogram law.

The parallelogram law.

Figure 6. The parallelogram law.

Vector subtraction. The difference between two vectors, or vector subtraction, exactly follows the rules for vector addition. For two vectors \bold A and \bold B the operation \bold A - \bold B can be written as \bold A + (-1)\bold B =\bold A + (-\bold B) which means adding \bold A to \bold B with its direction revered. An example is shown in Figure 7.

Vector subtraction.

Figure 7. Vector subtraction.

Remark: the term vector addition as an operation naturally includes vector subtraction.

Remark: while vector addition is commutative (\bold A + \bold B=\bold B + \bold A), vector subtraction is not commutative because \bold A - \bold B\ne \bold B - \bold A. In fact, \bold A -\bold B = -(\bold B - \bold A).

Addition of colinear vectors. The addition of colinear vectors are naturally handled by the triangle rule. Examples of vector addition in the case of colinear vectors are demonstrated in Figure 8.

 Vector addition of parallel or colinear vectors.

Figure 8. Vector addition of parallel or colinear vectors.

Addition of several vectors. If more than two vectors are to be added (or subtracted), they will be added successively. For example \bold A + \bold B + \bold C=(\bold A + \bold B) + \bold C . An example is as demonstrated below:

Successive addition of 3 vectors using the triangle rule.

Figure 9. Successive addition of 3 vectors using the triangle rule.

Trigonometry for vector operations

The triangle and parallelogram rules graphically give the resultant vector such that the direction of the vector can be understood. To calculate the magnitude of the resultant vector and the angle it makes with other vectors, trigonometry laws are used. Basic trigonometry laws are:

Basic trigonometry laws.

Figure 10. Basic trigonometry laws.

In addition to the above laws, The Sine and Cosine laws hold for any triangle or any triple of vectors forming a triangle:

 Sine and Cosine Laws

Figure 11. Sine and Cosine Laws.

Example 1

Determine the magnitude of the resultant vector \bold C=\bold A+\bold B and its direction measured counterclockwise from the positive x-axis shown.

Solution

— to be added —

Remark: It should be understood that the magnitude of vector addition is not necessarily equal the addition of the magnitudes. In other words, \vert \bold A + \bold B\vert \neq \vert \bold A \vert + \vert \bold B \vert. In fact, \vert \bold A + \bold B\vert \le \vert \bold A \vert + \vert \bold B \vert always holds. This statement, called the triangle inequality, can be explored or proved by using the Cosine law.

Vector components and Cartesian vector notation

The vector operation \bold F=\bold F_1+\bold F_2 can be considered as decomposing the vector \bold F into two vector components \bold F_1 and \bold F_2. This implies that \bold F has two components along the directions or lines of action of \bold F_1 and \bold F_2 (Figure 12). Given a set of directions, a component of a vector along each directions can be found by decomposing the vectors. Vector decomposition of \bold F can be regarded as the reverse of the vector addition, which means finding vectors with known directions (but unknown magnitude) such that their addition equals \bold F. Decomposing a vector can be performed using the trigonometry laws.

Decomposing a (planar) vector into two components along two directions or lines.

Figure 12. Decomposing a (planar) vector into two components along two directions or lines.

Example 2

The vector \bold F of magnitude \vert \bold F\vert = 100 is to be resolved into two components along the lines a-a and b-b. Determine the angle \alpha if the component of \bold F along the line a-a has a magnitude of 70, i.e. \vert \bold F_a\vert = 70.

example2

Solution

— to be added —

Vector decomposition makes the resultant vector components on each direction collinear. This makes the addition of the vector components on a particular direction straight forward.

Example 3

Determine the resultant vector, \bold R = \bold P + \bold Q, by decomposing the vectors \bold P and \bold Q on the two perpendicular lines (axes) denoted by x and y.

example_3

Solution

— to be added —

In the above example, the vectors are resolved into their components on the given axes and the collinear components of the vectors are added. Then, adding the resultant vectors on the axes gives the final resultant vector. This approach is a systematic approach for adding vectors.

Adding more than two vectors through a direct use of the parallelogram law, the Sine and Cosine laws is cumbersome. To do vector addition efficiently and in a systematic way, vectors are to be decomposed (resolved) onto a set of directions and the vector addition is performed using the colinear components.

In this regard, a set of directions simplifying the equations should be chosen. A convenient set of directions is a set of perpendicular directions called orthogonal axes. Orthogonal means perpendicular and an axis is a line to which a positive direction is associated by a vector colinear with the line. The positive direction of an axis sets a benchmark determining the positive (or negative) directions of the colinear vectors with the axis. For example, considering an axis x shown in Figure 13, we say \bold P is in the positive direction and \bold Q is in the negative direction (with respect to the positive direction defined by the axis).

Definition of an axis.

Figure 13. Definition of an axis.

Decomposing a coplanar vector (a vector in the 2-dimensional plane) needs two orthogonal axes. By the same token, decomposing a spatial vector (a vector in the 3-dimensional space) needs three orthogonal vectors. A set of orthogonal axis (intersecting at a point called the origin), is called a Cartesian coordinate system or Cartesian frame. The components of a vector in a Cartesian coordinate system is referred to as the rectangular components or Cartesian components.

A vector decomposed (resolved) into its rectangular components can be expressed by using two possible notations namely the scalar notation (components) and the Cartesian vector notation. Both notations are explained for the planar conditions, and then expanded to three dimensions.

Rectangular vector components of coplanar vectors

Consider a vector \bold F and its rectangular components resolved in a Cartesian system as shown in Figure 14. The x and y vector components of \bold F are resolved and denoted by \bold F_x and \bold F_y respectively. The subscripts x and y denote the axes with which the components are colinear. The magnitude of the vector components are \vert \bold F_x \vert = \vert \bold F\vert \cos \theta and \vert \bold F_y \vert = \vert \bold F\vert \sin \theta. By the definition the magnitude of the vectors are (has to be) positive. Therefore, the magnitude of the components does not have information about their directions. To include the information about the directions of the vector components in a Cartesian system, scalar notation or scalar components are defined. Scalar components of a vector are signed magnitudes of its rectangular components. A scalar component is positive if the vector component is directed along the positive axis, and negative if the vector component is directed along the negative axis (opposite of the axis positive direction). For the vector components \bold F_x and \bold F_y shown in Figure 14, the scalar components are denoted by F_x and F_y. Note the regular and italic font face used for the notation. The scalar components of the vector shown in Figure 14a are F_x=\vert \bold F_x\vert and F_y=\vert \bold F_y\vert both being positive. The scalar components of the vector shown in Figure 14b, however, are F_x=\vert \bold F_x\vert and F_y=-\vert \bold F_y\vert. The scalar component F_y is negative, because its vector component \bold F_y is directed along the negative direction of the y axis.

Components of vectors resolved in the Cartesian system.

Figure 14. Components of vectors resolved in the Cartesian system.

The scalar components of a vector can be collected in an ordered pair of real numbers denoted as (F_x,F_y). The first element indicated the x component and the second one refers to the y component. The scalar components of a vector are also referred to as the coordinates of a vector.

By the definition (Figure 13), an axis has an associated vector showing the positive direction of the axis. Conventionally, the vector associated with an axis is considered to be a unit vector. Any vector on an axis (or parallel with the axis) is colinear with the unit vector of an axis. Therefore, any vector on an axis can be written as an scalar multiplier of the axis unit vector. The scalar multiplier is equal to the signed magnitude of the vector on the axis. It is positive if the vector is in the direction of the axis unit vector, and negative otherwise.

The unit vectors associated with the Cartesian x and y axes are denoted by bold small letters \bold i and \bold j respectively and their respective small arrows are displayed on or near the Cartesian axes (Figure 15). Another possible notations are \hat i and \hat j.

 The planar Cartesian axes and their unit vectors.

Figure 15. The planar Cartesian axes and their unit vectors.

The rectangular components of any vector \bold F can be now expressed in terms of the Cartesian axis unit vectors: \bold F =\bold F_x + \bold F_y = \vert \bold F_x\vert \bold i +\vert \bold F_y\vert \bold j. The notation -\bold i or -\bold j is used for any vector component is in the opposite direction of its axis unit vector. For example, the vector on Figure 14.b is resolved to its rectangular components as: \bold F = \vert \bold F_x\vert\bold i + \vert\bold F_y\vert (-\bold j) = \vert \bold F_x\vert\bold i - \vert\bold F_y\vert \bold j.

As a general rule any vector \bold F can be written as \bold F = \pm\vert \bold F_x\vert \bold i \pm\vert \bold F_y\vert \bold j in a Cartesian system. The notation \pm means any of the signs are independently possible. Showing a vector by the addition of its rectangular components expressed in terms of the unit vectors \bold i and \bold j is called the Cartesian vector notation (CVN).

Remark: Both the scalar notation and the CVN are equivalent by noting that \bold F = \pm\vert \bold F_x\vert \bold i \pm\vert \bold F_y\vert \bold j=F_x\bold i + F_y\bold j.

Remark: using the CVN is equivalent to resolving a vector in the Cartesian coordinate system.

Remark: components of a vector, referring to the vector components, are vectors, whereas, the scalar components, also known as the coordinates of a vector, are scalar.

Example 4

Determine the rectangular components of \bold F (shown in the figure below) and write them in both the scalar notation and CVN.

Solution

— to be added —

Remark: the apparent location of a vector on a plane does not affect its component. You can always consider (draw) the Cartesian axes at the tail of a vector and calculate its components.

It is worthy to note the relationships among the scalar components, the magnitude, and the direction of a vector. Consider a Cartesian coordinate system in which angles are measured counter clockwise from the x axis as demonstrated in Figure 16. Then the following equations hold:

(1)   \[ \begin{split}F^2&=\vert \bold F\vert^2 =F_x^2 + F_y^2 \quad \text{or}\quad F=\vert \bold F\vert =\sqrt {F_x^2 + F_y^2}\\ \tan \theta &= \frac{F_y}{F_x},\quad \sin\theta=\frac{F_y}{F},\quad\cos\theta=\frac{F_x}{F}\quad\text{for }0^\circ\le \theta < 360^\circ  \end{split}\]

Figure 16. The relationship between the scalar components of a vector and its magnitude, and direction.

Figure 16. The relationship between the scalar components of a vector and its magnitude, and direction.

Specifying the direction of a planar vector in the Cartesian frame can be also specified by the acute angle that the vector makes with any of the Cartesian axes. In this case, the angle is always as 0^\circ \le \theta\le 90^\circ and consequently the absolute values of the scalar components (which are the magnitudes of the vector components) should be used in the trigonometry Laws. Figure 17a demonstrates some examples. Another way of specifying the direction of a planar vector is by a small slope triangle. The small slope triangle conveys the information on \sin\theta, \cos\theta, and \tan\theta of the acute angle that the vector makes with an axis (Figure 17b).

Figure 17. Specifying the direction of a planar vector in the Cartesian frame by the acute angle that the vector makes with any of the Cartesian axes.

Figure 17. Specifying the direction of a planar vector in the Cartesian frame by the acute angle that the vector makes with any of the Cartesian axes.

Using an acute angle to specify the direction of a vector modifies Eq. 1 as:

(2)   \[\begin{split} F^2&=\vert \bold F\vert^2 =F_x^2 + F_y^2 \quad \text{or}\quad F=\vert \bold F\vert =\sqrt {F_x^2 + F_y^2} \\ \tan \theta &= \frac{\mid F_y\mid}{\mid F_x\mid},\quad \sin\theta=\frac{\mid F_y\mid}{F},\quad\cos\theta=\frac{\mid F_x\mid}{F}\quad\text{for }0^\circ\le \theta < 90^\circ \end{split}\]

Remark: be cautious about the way the direction of a vector is specified and the proper formulation you should use for the calculations. If an acute angle is used, you should determine the signs of the scalar components by observing the direction of the vector components.

Remark: using an acute angle to specify the direction of an vector has a calculation advantage that 0^\circ\le\theta\le 90^\circ is in the domain of the inverse trigonometric functions \sin^{-1}, \cos^{-1} and \tan^{-1}.

Planar vector operations using CVN (two dimensions)

Addition of several vectors \bold F_1 + \bold F_2 + \dots +\bold F_n using the CVN takes the following steps:

1- Express each vector in CVN by resolving the vector to its rectangular components:

    \[\begin{split} \bold F_1&=F_{1x}\bold i + F_{1y}\bold j\\ \bold F_2&=F_{2x}\bold i + F_{2y}\bold j\\ &\vdots\\ \bold F_n&=F_{nx}\bold i + F_{ny}\bold j \end{split} \]

2- Add the respective components (components on the same axis):

    \[\begin{split} \bold R_x&= (F_{1x}+F_{2x} + \dots + F_{nx})\bold i =(\sum_i^nF_{ix})\bold i= R_x\bold i\\ \bold R_y&= (F_{1y}+F_{2y} + \dots + F_{ny})\bold j = (\sum_i^nF_{iy})\bold j=R_y\bold j \end{split} \]

in which \bold R_x and \bold R_y are the components of the resultant vector \bold R.

3- Form the resultant vector \bold R = \bold R_x +\bold R_y. The magnitude and direction of \bold R can be obtained by Eq. 1 or 2.

The above steps can be summarized as:

(3)   \[\bold R=(\sum F_x)\bold i + (\sum F_y)\bold j \]

in which \sum F_x and \sum F_y represent the algebraic sums of the scalar components along the x and y axes respectively.

Remark: the apparent locations of vectors on a plane does not affect the vector operations. You can always consider (draw) fixed Cartesian axes, make the vectors concurrent by moving them to the origin of the Cartesian system (tails of the vectors meet at the origin).

Rectangular vector components of spatial vectors

To treat a vectors in three dimensions, a 3-dimensional (3D) Cartesian (rectangular) coordinate system is to be defined. A common 3D rectangular coordinate system is a right-handed coordinate system. A coordinate systems of three orthogonal axes is said to be right-handed if your right-hand thumb points in the z direction when fingers curl from the (positive) x axis to the (positive) y axis (Figure 18a). In three dimensions, the unit vectors of the axes are denoted by \bold i, \bold j, and \bold k as demonstrated in Figure 18b.

Figure 18. A right-handed Cartesian coordinate system.

Figure 18. A right-handed Cartesian coordinate system.

Any vector in three dimensions can be resolved into three rectangular components. A vector expressed by the CVN in 3D is written as \bold A = \bold A_x + \bold A_y + \bold A_z = A_x\bold i + A_y\bold j + A_z\bold k such that \bold A_x, \bold A_y and \bold A_z are the vector components, and A_x, A_y and A_z are the scalar components (Figure 19a). The magnitude of \bold A is: A=\vert \bold A\vert=\sqrt{A_x^2 + A_y^2 + A_z^2}. This can be proved by considering the shaded triangles in Figure 19b and twice using the Pythagoras theorem.

The direction of \bold A in the 3D Cartesian coordinate system can be defined by the coordinate direction angles \alpha, \beta, and \gamma measured from the positive x, y, and z axes respectively to the vector (Figure 19c). These angles are limited as 0^\circ \le \alpha, \beta,\gamma \le 180^\circ. The following relationship holds between the vector magnitude, the scalar components and the coordinate direction angles:

A_x=A\cos \alpha\quad A_y=A\cos \beta\quad A_z=A\cos \gamma

Figure 19. Components of a 3D vector in the 3D Cartesian coordinate system.

Figure 19. Components of a 3D vector in the 3D Cartesian coordinate system.

The three coordinate direction angles are not independent and they are related by an equation. To derive the equation we start by expressing a vector \bold A by its Cartesian components \bold A = A_x\bold i + A_y\bold j + A_z\bold k. Dividing this equation by the magnitude of \bold A gives \bold u_A = \frac{\bold A}{A} = \frac{A_x}{A}\bold i + \frac{A_y}{A}\bold j + \frac{A_z}{A}\bold k which is written as \bold u_A=\cos\alpha\bold i + \cos\beta\bold j + \cos\gamma\bold k. Noting that \bold u_A is a unit vector, and taking the magnitude of the right hand side of the equation, we find the following relationship:

1=\cos^2\alpha + \cos^2\beta + \cos^2\gamma

This means knowing two of the angles, the third one is readily obtainable.

Spatial vector operations using CVN (three dimensions)

Once the vectors to be summed are resolved into their components, the same steps as in the planar case should be followed but with including components in the z direction. This means:

(4)   \[\bold R=(\sum F_x)\bold i + (\sum F_y)\bold j + (\sum F_z)\bold kj \]

in which \sum F_x, \sum F_y, and \sum F_z represent the algebraic sums of the scalar components along the x, y and z axes respectively.

Position vector

Naturally, we can have a perception of a geometric point in space. A point in the space (3D or 2D) can be imagined as an infinitely small dot. The space containing all the geometric points is called the geometric space. In a geometric space, geometric objects (or shapes) are born out of points. For examples, lines, circles, polylines, polygons, surfaces, cubes, etc. We simply refer to the geometric space as the space. The location of a point is the space can be quantitatively determined relative to other points. Thereby, all points can be located relative to one marked point. This point is called the origin of the space and denoted by O. Generally, we form a Cartesian coordinate system in the space and define the origin of the space to be at the intersection of the three (or two) axes of the coordinate system. Therefore, any point in the space is identified by a coordinate triple (x,y,z) in the 3D space, or (x,y) in the 2D space. Obviously, the coordinate of the origin is (0,0,0). This approach was introduced by the French mathematician and philosopher René Descartes.

Another approach to locate (identify) a point in the space is by using a vector. Once a point in the space is marked as the origin, a vector (arrow) can be readily drawn (imagined) from the origin of the space, O, to any point, P, in the space. This vector is called the position vector. The position vector is generally denoted by \bold r. To indicate a particular point located by a position vector, the notation \bold r_{OP} or \bold r_P is used. The former notation explicitly express the sense of the direction from the origin O to the point P, while the latter notation implicitly conveys the sense of the direction. A position vector simply points at a point in the space; is tail is at the origin of the space and its head is at any point to be located (Figure 20a). Constructing a Cartesian coordinate system a the origin as well, the position vector is resolved by the CVN as:

    \[\bold r = x\bold i + y\bold j + z\bold k\]

where (x,y,z) are the coordinates of the point located by the position vector. Figure 20a shows a position vector in a Cartesian coordinate system locating the points of a space. Choosing a point as the origin of the space is arbitrary; however, a wise choice depending on the problem can simplify the equations. Figure 20b shows a point in a space located by two different coordinate systems and position vectors. Note that the point is the same point but looked at from two different outlooks; therefore it has different coordinates and position vectors in different coordinate systems.

Figure 20. Position vector and coordinate systems.

Figure 20. Position vector and coordinate systems.

A position vector is a geometric object because it connects two geometric points in the space. This implies that the magnitude of a geometric vector \vert \bold r\vert = \sqrt {x^2+y^2+z^2} has the dimension and units of length, and it measures the distance between two points.

Remark: unit of a position vector or its length has to be specified and written in the calculations. For example you can write \bold r=\{a\bold i + b\bold j + c\bold k\}\ \text{mm} where (a,b,c) are the scalar components with the dimension and unit of length, or for example r=7.0 \text{ m}.

Remark: the unit vector \hat {\bold r}=\frac{\bold r}{r} has no dimension. The unit vectors (\bold i, \bold j, \bold k) of the coordinate system have no dimension as well.

Using the coordinate direction angles \alpha, \beta, and \gamma (measured from the positive x, y, and z axes respectively), the following equations hold for a position vector in the Cartesian coordinate system:

(5)   \[ \begin{split} \bold r &= x\bold i + y\bold j + z\bold k\quad\quad r=\vert\bold r\vert=\sqrt{x^2+y^2+z^2}\\ x&=r\cos \alpha\quad y=r\cos \beta\quad z=r\cos \gamma\quad \text{for } 0^\circ \le \alpha, \beta,\gamma \le 180^\circ\\ \bold r&=r \hat{\bold r}\ \text{such that }\hat{\bold r}=(\cos\alpha)\bold i +(\cos\beta)\bold j+(\cos\gamma)\bold k \end{split}\]

Any vector in the geometric space is a type of a position vector because it connects two points whether or not either marked as the origin. If a (geometric) vector connects two points, say A and B, in the space such than non of the points are designated as the origin, the vector is denoted by \bold r_{AB} and called a displacement vector. The displacement vector in fact expresses the displacement of the position vector from point A to point B. To understand this concept, imagine two points A and B respectively located by the position vectors \bold r_A and \bold r_b in the space already having a defined origin O and a Cartesian coordinate system (Figure 21). The vector (arrow) from the point A to the point B is the displacement vector \bold r_{AB}. By the rules of vector addition, we can write:

(6)   \[\bold r_A+\bold r_{AB}=\bold r_B \quad\text{or}\quad \bold r_B-\bold r_A=\bold r_{AB}\]

By using the CVN:

(7)   \[\bold r_{AB}= (x_B-x_A)\bold i+(y_B-y_A)\bold j+(z_B-z_A)\bold k\]

Interpreting the above equations geometrically, we understand that if an object is displaced along the vector \bold r_{AB}, its position is changed from the point A, located by \bold r_{AB}, to the point B located by \bold r_B. The quantity \vert\bold r_{AB}\vert is the length of displacement.

Figure 21. Position and displacement vectors.

Figure 21. Position and displacement vectors.

One application of a displacement vector is to determine or set a direction for a line segment formed (imagined or drawn) between two points \bold A and \bold B in the space. This can be easily achieved through normalizing \bold r_{AB} by writing \hat{\bold r}_{AB}=\frac{\bold r_{AB}}{\vert r_{AB}\vert}. It should be noted that another possibility for setting a direction is normalizing \bold r_{BA} which is in the opposite direction of \bold r_{AB}.

Dot product

The dot product, also called the scalar product, is an operation that takes two vectors and returns a scalar. The dot product of vectors \bold A and \bold B, denoted as \bold A \cdot \bold B and read “\bold A dot \bold B” is defined as:

(8)   \[\bold A\cdot\bold B=AB\cos \theta \]

where A=\vert \bold A\vert, B=\vert \bold B\vert, and 0^\circ\le \theta \le 180^\circ is the angle between the two vectors (Figure 22)

Figure 22. Configuration of two vectors for the dot product.

Figure 22. Configuration of two vectors for the dot product.

From the definition, it is obvious that the result of the dot product is a scalar. The dot product has three properties as follows:

  1. Commutativity: \bold A\cdot\bold B = \bold B\cdot\bold A
  2. Associativity (scalar multiplication): a(\bold A\cdot\bold B)=(a\bold A)\cdot \bold B=\bold A\cdot(a\bold B)
  3. Distributivity: \bold A\cdot(\bold B + \bold D)=(\bold A\cdot \bold B)+(\bold A\cdot\bold D)

The proofs of the first two properties are by direct using the dot product definition (Eq. 8). The proof for the third property is by expanding the right hand side of the equation using the CVN notation and factoring the scalar components of \bold A. Dot product using the CVN is explained below.

Other properties of the dot product

  1. Dot product of a vector by itself gives its squared magnitude: \bold A\cdot \bold A=A^2.
  2. Dot product of two perpendicular vectors is zero: \cos \theta=90^\circ \Rightarrow \bold A\cdot \bold B=0.
  3. If 0^\circ \le \theta< 90^\circ then \bold A\cdot \bold B> 0.
  4. If 90^\circ < \theta\le 180^\circ then \bold A\cdot \bold B< 0.
  5. Dot product by the zero vector is zero: \bold A\cdot \bold 0=0.

These properties are easily proved by using Eq. 8.

Formulation of the dot product using CVN

Let \bold A and \bold B be two vectors with their scalar components (A_x, A_y, A_z) and (B_x, B_y, b_z). Using the CVN, we can write:

    \[ \begin{split} \bold A\cdot \bold B&=(A_x\bold i +A_y\bold j+A_z\bold k )\cdot(B_x\bold i +B_y\bold j+B_z\bold k)\\ &\text{by the distributivity property}\\ &=A_xB_x(\bold i\cdot \bold i)+A_xBy(\bold i\cdot \bold j)+A_xB_z(\bold i\cdot \bold k)\\ &+A_yB_x(\bold j\cdot \bold i)+A_yBy(\bold j\cdot \bold j)+A_yB_z(\bold j\cdot \bold k)\\ &+A_zB_x(\bold k\cdot \bold i)+A_zBy(\bold k\cdot \bold j)+A_zB_z(\bold k\cdot \bold k) \end{split}\]

The dot product of the unit vectors, by the dot product properties, are:

    \[ \begin{split} \bold i \cdot \bold i&=\bold j \cdot \bold j=\bold k \cdot \bold k=1\\ \bold i \cdot \bold j&=\bold j \cdot \bold i=\bold i \cdot \bold k=\bold k \cdot \bold i=\bold j \cdot \bold k=\bold k \cdot \bold j=0 \end{split}\]

Therefore,

(9)   \[\bold A\cdot \bold B=A_xB_x+A_yB_y+A_zB_z\]

This result expresses that the dot product of two vectors written in their CVN is obtained by multiplying their corresponding scalar components and summing over these product algebraically. Equation 9 indicates that calculating the dot product (Eq. 8) does not need the magnitudes of two vectors and the angle between them, if the vectors are resolved in the Cartesian system.

Application of the dot product: finding the angle between two vectors

Dot product can be used to find the angle formed between two vectors or two intersecting line. This property facilitate the calculations particularly when solving problems in three dimensions. The angle between two vectors is obtained by solving Eq. 8. for the angle term:

(10)   \[\theta=\cos^{-1}(\frac{\bold A\cdot\bold B}{AB})\quad 0^\circ\le \theta\le 180^\circ \]

in which \bold A\cdot \bold B is found from Eq. 9.

The above equation can be manipulated as:

    \[\theta=\cos^{-1}(\frac{\bold A}{A}\cdot\frac{\bold B}{B})=\cos^{-1}(\bold u_A\cdot\bold u_B)\quad 0^\circ\le \theta\le 180^\circ\]

in which \bold u_a and \bold u_b are the unit vectors of \bold A and \bold B respectively. This result naturally states that the angle between two vectors only depends on their directions and not on their magnitudes.

Application of the dot product: orthogonal projection of a vector

In many problems, we need to resolve a vector on a particular line or lines in the space. To be more precise, the component of a vector along a particular direction or axis is to be found. Decomposing a vector onto the Cartesian axes is already demonstrated. In this section, we explain decomposing a vector on general lines or axes in space using the dot product. Using the dot product makes the calculation easier specially in three dimensions.

Consider a non zero vector \bold A in the three dimensional space and a line aa intersecting the tail of the vector at a point O (Figure 23a). This line accompanies the direction along which we want to decompose the vector. The positive direction of the line is determined by a unit vector say \bold u_a. As previously mentioned, a line along with a colinear vector (determining the positive direction) is called an axis. It is intended to resolve \bold A into two components as (Figure 23b):

(11)   \[ \bold A =\bold A_a + \bold A_{\perp} \]

where \bold A_a is the component of \bold A parallel to (colinear with) the line and, indeed, the unit vector \bold u_a, and \bold A_{\perp} is the component of \bold A perpendicular to the line and, of course, the unit vector \bold u_a. The symbol \perp denotes perpendicularity.

Figure 23. Orthogonal projection.

Figure 23. Orthogonal projection.

The vector \bold A_a is called the parallel component of \bold A . Also, \bold A_a is referred to as the orthogonal projection (or projection) of \bold A onto the line aa or along the direction of \bold u_a. To obtain \bold A_a, it suffices to note that the vectors \bold A, \bold A_a and \bold A_{\perp} form a right-angle triangle (Figure 23b). Therefore by the Pythagorean’s theorem:

(12)   \[ \begin{split} A_a=A\cos \theta&=\bold A\cdot\bold u_a \\ \bold A_a&=A_a\bold u_a\\ \vert\bold A_a \vert&=\mid A_a\mid \end{split}\]

It should be noted that A_a​ is the scalar component of \bold A​ resolved along the direction of \bold u_a​. Using the dot product \bold A \cdot \bold u_a​ to calculate A_a​ may result in a negative scalar if the angle between \bold A​ and \bold u_a​ are larger than 90^\circ​. In such a case, the direction of \bold A_a​ is in the opposite direction of \bold u_a​.

The perpendicular component of \bold A can be then obtain by writing:

    \[ \bold A_{\perp}=\bold A-\bold A_a \]

The magnitude of the perpendicular component can be calculated either by A_\perp=A\sin \theta or A_\perp=\sqrt{A^2-A_a^2}.

In practice, A_a=A\cos \theta and A_\perp=A\sin \theta can be readily used if \theta in known, otherwise A_a=\bold A\cdot\bold u_a, \bold A_{\perp}=\bold A-\bold A_a, and A_\perp=\sqrt{A^2-A_a^2} can be utilized if the components of the vectors in CVN are known.

As a special case, orthogonal projection is used to find the scalar components of a vector in a Cartesian frame. This is done by writing:

(13)   \[ \begin{split} A_x&=\bold A\cdot \bold i\\ A_y&=\bold A\cdot \bold j\\ A_z&=\bold A\cdot \bold k  \end{split}\]

Cross product

The cross product of two vectors is an operation that takes two vectors \bold A and \bold B and returns a vector \bold C. This operation is denoted by \bold C =\bold A\times\bold B and is read “C equals A cross B”. The resultant vector \bold C has a magnitude, C,and a direction denoted by \bold u_C.

For \bold A \times \bold B=\bold C= C\bold u_C The magnitude of \bold C is defined as:

(14)   \[C=AB\sin\theta\quad\quad 0^\circ\le \theta \le 180^\circ\]

in which \theta is the angle between the two vectors.

The definition of the magnitude of the cross product implies that, \bold C=\bold 0 if two non-zero vectors are colinear.

By definition, \bold C is perpendicular to both \bold A and \bold B which are not colinear and it is direction is specified by the right-hand rule. In other words, \bold C is perpendicular to the plane containing the non-colinear vectors \bold A and \bold B such that \bold u_C is in the direction of the thumb following the right-hand rule (Figure 24).

Figure 24. Cross product.

Figure 24. Cross product.

Properties of Cross product

  1. Anti-commutativity: \bold A\times\bold B=-\bold B\times\bold A
  2. Associativity: for a scalar c, c(\bold A\times \bold B)=(c\bold A)\times \bold B=\bold A\times (c\bold B)
  3. Distributivity: \bold A\times(\bold B+\bold C)=(\bold A\times\bold B)+(\bold A\times\bold C)

Cross product in CVN

In a Cartesian frame the following relationships are readily obtained for the mutually perpendicular unit vectors \bold i, \bold j and \bold k:

    \[ \begin{matrix} \bold i\times \bold i=\bold 0 &\bold i\times \bold j=\bold k & \bold i\times \bold k=-\bold j \\ \bold j\times \bold i=-\bold k & \bold j\times \bold j=\bold 0 & \bold j\times \bold k=\bold i \\ \bold k\times \bold i=\bold j & \bold k\times \bold j=-\bold i & \bold k\times \bold k=\bold 0 \\ \end{matrix} \]

Based on the above results and the properties of the cross product, the cross product of two vectors in their CVN is:

    \[ \begin{split} \bold A&=A_x\bold i + A_y\bold j+A_z\bold k\quad , \quad \bold B=B_x\bold i+B_y\bold j+B_z\bold k\\ \bold A\times \bold B&=(A_xB_x)\bold i\times \bold i +(A_xB_y)\bold i\times \bold j +(A_xB_z)\bold i\times \bold k\\ &+(A_yB_x)\bold j\times \bold i +(A_yB_y)\bold j\times \bold j +(A_yB_z)\bold j\times \bold k \\ &+(A_zB_x)\bold k\times \bold i +(A_zB_y)\bold k\times \bold j +(A_zB_z)\bold k\times \bold k \end{split}\]

leading to:

    \[ \bold A\times\bold B=(A_yB_z-A_zB_y)\bold i+(A_zB_x-A_xB_z)\bold j+(A_xB_y-A_yB_x)\bold k \]

This result need not to me memorize and can be obtained by calculating the following determinant symbolically:

(15)   \[ \bold A\times\bold B = \begin{vmatrix} \bold i & \bold j & \bold k \\ A_x & A_y & A_z \\ B_x & B_y & B_z \\ \end{vmatrix}\]