我在学习 I/O 完成端口 (IOCP) 时偶然发现了类似的障碍...
我认为问题在于,在 IOCP 套接字模型方案中,最复杂的部分是“套接字接受”的开始阶段。这就是为什么大多数教程跳过它并开始讨论如何处理发送/接收。
如果您想充分了解 IOCP 以便您可以实施生产软件,那么我对您的建议是学习它,直到您完全掌握它(下面的这个答案还不够)。我推荐的一个文档是“Microsoft Windows 网络编程 - 第 2 版”的第 5 章。这本书可能很旧,但对 IOCP 有效。此外,文章“Windows via C/C++: Synchronous and Asynchronous Device I/O”涉及到 IOCP 的某些方面,但信息不足以做生产软件。
我会尽力解释,但是,我必须警告您,这可能还不够。就这样吧……
因此,您缺少的部分是“如何在 IOCP 套接字模型中进行‘套接字接受’”。
首先,让我们检查一下服务器上典型的 Winsock(非 IOCP)调用序列;
// (1) Create listen socket on server.
WSASocket()
// (2) Bind an address to your listen socket.
bind()
// (3) Associate the listen socket with an event object on FD_ACCEPT event.
WSAEventSelect(,, FD_ACCEPT )
// (4) Put socket in listen state - now, Windows listening for new
// connection requests. If new requests comes, the associated
// event object will be set.
listen()
// (5) Wait on the event object associated on listen socket. This
// will get signaled when a new connection request comes.
WaitForSingleObject() {
// (6) A network activity has occurred. Verify that FD_ACCEPT has
// raised the event object. This also resets the event object
// so WaitForSingleObject() does not loop non-stop.
WSAEnumNetworkEvents()
// (7) Understanding this part is important. The WSAAccept() doesn't
// just accept connection, it first creates a new socket and
// then associates it with the newly accepted connection.
WSAAccept()
}
步骤 (7) 适用于非基于 IOCP 的模型。但是,从性能的角度来看 - 套接字创建是昂贵的。而且它会减慢连接接受过程。
在 IOCP 模型中,为新的传入连接请求提前创建了套接字。不仅套接字是预先创建的,它们甚至在连接请求到来之前就与侦听套接字相关联。为此,Microsoft 提供了扩展功能。 IOCP 模型所需的两个这样的函数是 AcceptEx() 和 GetAcceptExSockaddrs()。
注意:使用这些扩展函数时,需要在运行时加载它们以避免性能损失。这可以使用 WSAIoctl() 来实现。如需进一步阅读,请参阅有关 AcceptEx() 的 MSDN 文档。
警告:AcceptEx() 可用于设置新套接字以接收一些数据作为连接接受过程的一部分。需要禁用此功能,因为它使应用程序容易受到 DoS 攻击,即发出连接请求但不发送数据。接收应用程序将无限期地等待该套接字。为避免这种情况,只需为其 'dwReceiveDataLength' 参数传递 0 值。
如何为 IOCP 模型设置连接接受代码?
一种方法是:
// (1) Create IO completion port
CreateIoCompletionPort()
// (2) Have a method that creates worker threads say 'CreateWorkerThreads()'.
// This assign same method (say WorkerThread_Func()) to all worker threads.
// In the WorkerThread_Func() threads are blocked on call to
// GetQueuedCompletionStatus().
CreateWorkerThreads()
// (3) Create listen socket.
WSASocket()
// (4) Associate listen socket to IO Completion Port created earlier.
CreateIoCompletionPort()
// (5) Bind an address to your listen socket.
bind()
// (6) Put socket in listen state - now, Windows listening for new
// connection requests. If a new request comes, GetQueuedCompletionStatus()
// will release a thread.
listen()
// (7) Create sockets in advance and call AcceptEx on each of
// these sockets. If a new connection requests comes
// Windows will pick one of these sockets and associate the
// connection with it.
//
// As an example, below loop will create 1000 sockets.
GUID GuidAcceptEx = WSAID_ACCEPTEX;
DWORD dwBytes;
LPFN_ACCEPTEX lpfnAcceptEx;
// First, load extension method.
int retCode = WSAIoctl(listenSocket,
SIO_GET_EXTENSION_FUNCTION_POINTER,
&GuidAcceptEx,
sizeof(GuidAcceptEx),
&lpfnAcceptEx,
sizeof(lpfnAcceptEx),
&dwBytes,
NULL,
NULL
);
for( /* loop for 1000 times */ ) {
SOCKET preemptiveSocket = WSASocket(, , , , , WSA_FLAG_OVERLAPPED);
lpfnAcceptEx(listenSocket, preemptiveSocket,,,,,,);
}
这基本上让您的应用程序准备好以 IOCP 方式接受套接字。当一个新的连接请求到来时,正在等待 GetQueuedCompletionStatus() 的工作线程之一将被释放并移交给 数据结构 的指针。这将具有由 lpfnAcceptEx() 创建的套接字。
过程是否完整?还没有。通过 AcceptEx() 调用接受的套接字不继承 listenSocket 的属性。为此,您需要致电;
setsockopt( acceptSocket, SOL_SOCKET, SO_UPDATE_ACCEPT_CONTEXT,
(char*)&listenSocket, sizeof(listenSocket) );
现在,acceptSocket 可以很好地用于 WSASend / WSARecv!
有些东西不见了!我跳过了关于工作线程如何从 GetQueuedCompletionStatus() 获取接受套接字的部分?
答案是,通过将您特制的结构传递给 lpfnAcceptEx()。当 GetQueuedCompletionStatus() 返回时,它将具有包含您将传递的套接字的数据结构。
如何制作这样的结构?通过创建一个将“WSAOVERLAPPED”作为其第一个成员的结构。在第一个成员之后,您可以拥有自己的任何成员。例如,我的结构看起来像;
typedef struct _WSAOVERLAPPEDPLUS
{
WSAOVERLAPPED ProviderOverlapped; // 'WSAOVERLAPPED' has to be the first member.
SOCKET client; // Use this to pass preemptive socket.
SOCKET listenSocket; // Use this to pass the listenSocket.
DWORD dwBytes;
SOCKET_OPERATION operation; // Enum to assist in knowing what socket operation ...
} WSAOVERLAPPEDPLUS, *LPWSAOVERLAPPEDPLUS;
...
typedef enum SOCKET_OPERATION {
UNINITIALIZED_ENUM, // To protect against memory leaks and uninitialized buffers.
OP_ACCEPTEX,
OP_RECEIVE,
OP_SEND
};
...
//
// So the previously mentioned for() loop will become;
//
for( /* loop for 1000 times */ ) {
SOCKET preemptiveSocket = WSASocket(, , , , , WSA_FLAG_OVERLAPPED);
LPWSAOVERLAPPEDPLUS pOl = new WSAOVERLAPPEDPLUS();
// Initialize our "extended" overlapped structure
memset(pOl, 0, sizeof(WSAOVERLAPPEDPLUS));
pOl->operation = OP_ACCEPTEX;
pOl->client = preemptiveSocket;
pOl->listenSocket = listenSocket;
int buflen = (sizeof(SOCKADDR_IN) + 16) * 2;
char* pBuf = new char[buflen];
memset(pBuf, 0, buflen);
m_lpfnAcceptEx(listenSocket,
preemptiveSocket,
pBuf,
0, // Passed 0 to avoid reading data on accept which in turn
// avoids DDoS attack i.e., connection attempt without data will
// cause AcceptEx to wait indefinitely.
sizeof(SOCKADDR_IN) + 16,
sizeof(SOCKADDR_IN) + 16,
&pOl->dwBytes,
&pOl->ProviderOverlapped
);
}
...在工作线程中GetQueuedCompletionStatus() 返回时;
while (TRUE)
{
bOk = ::GetQueuedCompletionStatus(hCompPort, &bytes_transferred, &completion_key, &pOverlapped, INFINITE);
if (bOk) {
// Process a successfully completed I/O request
if (completion_key == 0) {
// Safe way to extract the customized structure from pointer
// is to use 'CONTAINING_RECORD'. Read more on 'CONTAINING_RECORD'.
WSAOVERLAPPEDPLUS *pOl = CONTAINING_RECORD(pOverlapped, WSAOVERLAPPEDPLUS, ProviderOverlapped);
if (pOl->operation == OP_ACCEPTEX) {
// Before doing any WSASend/WSARecv, inherit the
// listen socket properties by calling 'setsockopt()'
// as explained earlier.
// The listenSocket and the preemptive socket are available
// in the 'pOl->listenSocket' & 'pOl->client', respectively.
}
delete pOl;
}
}
else {
// Handle error ...
}
我希望这能让您了解 AcceptEx() 如何与 IOCP 一起使用。