C++标准编程：虚函数与内联

2023-10-09 11:38:40

我们曾经在讨论C++的时候，经常会问到：“虚函数能被声明为内联吗？”现在，我们几乎听不到这个问题了。现在听到的是：“你不应该使print成为内联的。声明一个虚函数为内联是错误的！”

　　这种说法的两个主要的原因是（1）虚函数是在运行期决议而内联是一个编译期动作，所以，我们将虚函数声明为内联并得不到什么效果；（2）声明一个虚函数为内联导致了函数的多分拷贝，而且我们为一个不应该在任何时候内联的函数白白花费了存储空间。这样做很没脑子。

　　不过，事实并不是这样。我们先来看看第一个：许多情况下，虚拟函数都被静态地决议了——比如在派生类虚拟函数中调用基类的虚拟函数的时候。为什么这样做呢？封装。一个比较明显的例子就是派生类析构函数调用链。所有的虚析构函数，除了最初触发这个析构链的虚析构函数，都被静态的决议了。如果不将基类的虚析构函数内联，我们无法从中获利[a]。这和不内联一个虚拟析构函数有什么不同吗？如果继承体系层次比较深并且有许多这样的类的实例要被销毁的话，答案是肯定的。

　　再来看另外一个不用析构函数的例子，想象一下设计一个图书馆类。我们将MaterialLocation作为抽象类LibraryMaterial的一个成员。将它的print成员函数声明为一个纯虚函数，并且提供函数定义：它输出MaterialLocation。

　　class LibraryMaterial

{ 　

private:

　　 MaterialLocation _loc; // shared data

　　// …

　　 public:

　　// declares pure virtual function

　　inline virtual void print(ostream& = cout) = 0;

};

　　// we actually want to encapsulate the handling of the

　　// location of the material within a base class

　　// LibraryMaterial print（） method - we just don’t want it

　　// invoked through the virtual interface. That is, it is

　　// only to be invoked within a derived class print（） method

　　inline void

　　LibraryMaterial::

　　print(ostream &os)

{

os <<_loc;

}

　　接着，我们引入一个Book类，它的print函数输出Title, Author等等。在这之前，它调用基类的print函数（LibraryMaterial::print（））来显示书本位置（MaterialLocation）。如下：

　　inline void

　　Book::

　　print(ostream &os)

　　{

　　// ok, this is resolved statically,

　　// and therefore is inline expanded …

　　LibraryMaterial::print()；

　　os << "title:" << _title

　　<< "author" << _author << endl;

　　}

　　AudioBook类，派生于Book类，并加入附加信息，比如旁述，音频格式等等。这些东西都用它的print函数输出。再这之前，我们需要调用Book::print（）来显示前面的信息。

　　inline void

　　AudioBook::

　　print(ostream &os)

　　{

　　// ok, this is resolved statically,

　　// and therefore is inline expanded …

　　Book::print();

　　 os <<"narrator:"<< _narrator <<endl;

　　}

　　这和虚析构函数调用链的例子一样，都只是最初调用的虚函数没有被静态决议，其它的都被原地展开。This unnamed hierarchical design pattern is significantly less effective if we never declare a virtual function to be inline.

　　那么对于第二个原因中代码膨胀的问题呢？我们来分析一下，如果我们写下：

　　LibraryMaterial *p =

　　new AudioBook("Mason & Dixon",

　　"Thomas Pynchon", "Johnny Depp");

　　// …

　　p->print()；

　　这个print实例是内联的吗？不，当然不是。这样不得不通过虚拟机制在运行期决议。这让print实例放弃了对它的内联声明了吗？也不是。这个调用转换为下面的形式（伪代码）：

　　// Pseudo C++ Code

　　// Possible transformation of p->print（）

　　(*p->_vptr[ 2 ])(p);

　　where 2 represents the location of print within the associated virtual function table.因为调用print是通过函数指针_vptr[2]进行的，所以，编译器不能静态的决定这个调用地址，并且，这个函数也不能内联。

　　当然，虚函数print的内联实体（definition）也必须在某个地方表现出来。即是说，至少有一个函数实体是在virtual table调用的地址原地展开的。编译器是如何决定在何时展开这个函数实体呢？其中一个编译（implementaion）策略是当virtual table生成的同时，生成这个函数实体。这就是说对于每一个派生类的virtual table都会生成一个函数实体。

　　在一个可应用的类[b]中有多少vitrual table会被生成呢？呵呵，这是一个好问题。C++标准中对虚函数行为进行了规定，但是没有对函数实现进行规定。由于virtual table没有在C++标准中进行规定，很明显，究竟这个virtual table怎样生成，和究竟要生成多少个vitrual table也没有规定。多少个？当然，我们只要一个。Stroustrup的cfront编译器，很巧妙的处理了这些情况。（ Stan and Andy Koenig described the algorithm in the March 1990 C++ Report article, "Optimizing Virtual Tables in C++ Release 2.0."）

　　Moreover, the C++ Standard now requires that inline functions behave as though only one definition for an inline function exists in the program even though the function may be defined in different files。新的规则要求编译器只展开一个内联虚函数。如果一点被广泛采用的话，虚函数的内联导致的代码膨胀问题就会消失。

码农公寓

相关文章