c++如何使用std::async实现简单的并行for循环_c++任务并行基础_技术教程

使用std::async可通过拆分循环区间实现并行for，每个区间由异步任务处理；2. 示例中将数组分块并计算各块平方和，最后合并结果；3. 应根据std::thread::hardware_concurrency()控制任务数以避免资源耗尽；4. 优点为代码简洁、无需额外依赖，缺点是缺乏负载均衡且线程开销大；5. 需注意lambda捕获变量的有效性及调用fut.get()获取结果。

在C++中，std::async 提供了一种启动异步任务的简便方式。虽然它不像OpenMP那样直接支持并行for循环，但可以通过手动拆分循环迭代来模拟简单的并行for行为。这种方式适合学习任务并行的基础概念。

基本思路：将循环范围拆分为多个块

要使用 std::async 实现并行for循环，可以把一个大循环分成若干个子区间，每个子区间由一个异步任务处理。所有任务通过 std::async 启动，默认使用 std::launch::async 策略确保真正异步执行。

示例：并行计算数组元素的平方和

假设有一个包含10000个整数的数组，我们想并行计算它们的平方和：

#include 
#include 
#include 
#include 

long long parallel_for_square_sum(const std::vector& data, int num_tasks) {
    int chunk_size = data.size() / num_tasks;
    std::vector> futures;

    for (int i = 0; i < num_tasks; ++i) {
        int start = i * chunk_size;
        int end = (i == num_tasks - 1) ? data.size() : start + chunk_size;

        futures.push_back(std::async(std::launch::async, [&data, start, end]() {
            long long sum = 0;
            for (int j = start; j < end; ++j) {
                sum += static_cast(data[j]) * data[j];
            }
            return sum;
        }));
    }

    long long total = 0;
    for (auto& fut : futures) {
        total += fut.get(); // 等待完成并获取结果
    }

    return total;
}

控制并发数量避免资源耗尽

std::async 如果不加限制地创建大量任务，可能导致线程过多，系统调度开销变大甚至崩溃。建议根据硬件核心数控制任务数。

获取CPU核心数的方法：

```cpp int hardware_threads = std::thread::hardware_concurrency(); int num_tasks = std::min(hardware_threads > 0 ? hardware_threads : 4, static_cast(data.size())); ```

这样既能充分利用多核，又不会过度创建线程。