Tagged articles

OpenCL

14 articles · Page 1 of 1

Jan 14, 2024 · Industry Insights

Can Chinese GPUs Close the Gap with NVIDIA? 2023 GPGPU Landscape Analysis

2023 GPGPU research framework analysis reveals that while Chinese GPUs like BR100 and TianGai100 can match or exceed NVIDIA A100 in FP32, they still lag in FP64 and INT8 performance, and the domestic software ecosystem based on OpenCL trails far behind NVIDIA's CUDA, shaping short‑and‑term market dynamics.

AI computingCUDAChina

0 likes · 6 min read

Can Chinese GPUs Close the Gap with NVIDIA? 2023 GPGPU Landscape Analysis

Baidu Geek Talk

May 30, 2022 · Mobile Development

Advanced OpenCL Optimization Techniques for Qualcomm Adreno GPUs on Mobile Devices

The article presents advanced OpenCL optimization techniques for Qualcomm Adreno mobile GPUs, explaining the programming model, profiling methods, bottleneck identification, and kernel‑level strategies such as fast math, fp16, vectorized memory accesses, and hardware‑specific features to improve compute‑ and memory‑bound performance on Android devices.

AdrenoGPUMobile Computing

0 likes · 12 min read

Advanced OpenCL Optimization Techniques for Qualcomm Adreno GPUs on Mobile Devices

Baidu Geek Talk

May 18, 2022 · Mobile Development

Unlock Mobile GPU Power: A Hands‑On Guide to OpenCL Programming on Android

This article introduces the fundamentals of heterogeneous computing on mobile GPUs, explains OpenCL concepts and its programming model, and provides a step‑by‑step example of adding two arrays with complete OpenCL code for Android devices.

AndroidC#GPU computing

0 likes · 9 min read

Unlock Mobile GPU Power: A Hands‑On Guide to OpenCL Programming on Android

Baidu App Technology

Apr 1, 2022 · Fundamentals

Mastering Mobile OpenCL on Qualcomm Adreno: Architecture & Performance Tips

This article explains OpenCL fundamentals, the Qualcomm Adreno GPU architecture, compatibility considerations, and practical optimization techniques—including profiling, bottleneck identification, and CPU‑to‑GPU conversion tips—to help developers write high‑performance mobile OpenCL code.

AdrenoGPUMobile Computing

0 likes · 13 min read

Mastering Mobile OpenCL on Qualcomm Adreno: Architecture & Performance Tips

Baidu App Technology

Jan 24, 2022 · Mobile Development

Introduction to OpenCL Programming for Mobile GPU Computing

As mobile CPUs plateau, developers increasingly use OpenCL to harness Android GPUs like Qualcomm Adreno and Huawei Mali for heterogeneous computing, leveraging its platform, execution, and memory models to write portable kernels—illustrated by a simple array‑addition example that demonstrates device initialization, kernel creation, buffer management, and parallel execution.

AndroidC ProgrammingGPU computing

0 likes · 8 min read

Introduction to OpenCL Programming for Mobile GPU Computing

Architects' Tech Alliance

Aug 29, 2021 · Fundamentals

GPU Overview: History, Architecture, Processing Workflow, and Acceleration Technologies (CUDA & OpenCL)

This article provides a comprehensive overview of GPUs, covering their history, architecture, processing workflow, and acceleration technologies such as CUDA and OpenCL, while comparing GPU and CPU designs and offering resources for further study.

CUDAGPUOpenCL

0 likes · 14 min read

GPU Overview: History, Architecture, Processing Workflow, and Acceleration Technologies (CUDA & OpenCL)

DataFunTalk

Mar 25, 2021 · Artificial Intelligence

Optimizing MNN Mobile Neural Network Inference on GPU with OpenCL: Memory Objects, Work‑Group Tuning, and Auto‑Tuning

This article explains how the MNN deep‑learning framework leverages OpenCL to achieve high‑performance inference on mobile, PC and embedded GPUs by diversifying memory objects, aligning data, using local‑memory reductions, selecting optimal work‑group sizes, applying pre‑inference auto‑tuning, caching compiled programs, and providing practical GPU‑friendly model design guidelines.

GPU OptimizationMNNOpenCL

0 likes · 20 min read

Optimizing MNN Mobile Neural Network Inference on GPU with OpenCL: Memory Objects, Work‑Group Tuning, and Auto‑Tuning

Java Architect Essentials

Mar 15, 2021 · Industry Insights

Can Apple M1 Macs Mine Ethereum Effectively? A Hands‑On Test

This article documents a technical experiment that runs Ethereum mining software on an M1‑based MacBook Air, detailing the required code patches, build process, performance logs, and the resulting profit of roughly one Chinese yuan per day, while comparing the M1’s capabilities to traditional GPU miners.

Apple SiliconEthereumM1

0 likes · 9 min read

Can Apple M1 Macs Mine Ethereum Effectively? A Hands‑On Test

Architects' Tech Alliance

May 5, 2020 · Fundamentals

Why Heterogeneous Computing Is the Future: CPUs, GPUs, FPGAs, and More Explained

The article provides a comprehensive overview of heterogeneous computing, detailing its definition, real‑world system examples, performance advantages, key programming frameworks such as OpenCL and CUDA, industry trends like SOC integration, and a comparative analysis of CPUs, GPUs, FPGAs and ASICs.

CPUCUDAFPGA

0 likes · 9 min read

Why Heterogeneous Computing Is the Future: CPUs, GPUs, FPGAs, and More Explained

Tencent Music Tech Team

Apr 30, 2020 · Mobile Development

Edge Deep Learning Inference on Mobile Devices: Challenges, Hardware Diversity, and Optimization Strategies

Edge deep learning inference on mobile devices faces hardware and software fragmentation, diverse CPUs, GPUs, DSPs, and NPUs, and limited programmability; optimization techniques such as model selection, quantization, and architecture‑specific tuning enable real‑time performance, with most inference on CPUs, GPUs offering 5–10× speedups, and co‑processor support varying across Android and iOS.

DSPGPU programmingNPU

0 likes · 17 min read

Edge Deep Learning Inference on Mobile Devices: Challenges, Hardware Diversity, and Optimization Strategies

Architects' Tech Alliance

Oct 12, 2019 · Fundamentals

Understanding GPUs: History, Architecture, and Acceleration Technologies (CUDA & OpenCL)

This article explains the history, architecture, and operation of GPUs, and introduces major acceleration frameworks such as CUDA and OpenCL, highlighting their roles in parallel computing and modern graphics processing for scientific and AI workloads.

CUDAComputer ArchitectureGPU

0 likes · 13 min read

Understanding GPUs: History, Architecture, and Acceleration Technologies (CUDA & OpenCL)

Architects' Tech Alliance

Sep 5, 2019 · Fundamentals

GPU Origin, Architecture, and Acceleration Technologies (CUDA & OpenCL)

This article explains the history and origin of GPUs, compares CPU and GPU architectures, describes the GPU processing pipeline, and introduces acceleration technologies such as CUDA and OpenCL, highlighting their programming models, supported languages, and key performance metrics.

CUDAGPUGraphics Processing

0 likes · 14 min read

GPU Origin, Architecture, and Acceleration Technologies (CUDA & OpenCL)

Architects' Tech Alliance

Apr 21, 2019 · Fundamentals

Differences Between CPU and GPU Architectures and the Relationship Between OpenCL and CUDA

This article explains the fundamental architectural differences between CPUs and GPUs, their design goals and performance characteristics, and compares OpenCL and CUDA, highlighting OpenCL’s cross‑platform flexibility versus CUDA’s NVIDIA‑specific optimization, while illustrating how each fits various parallel computing tasks.

CPUCUDAGPU

0 likes · 7 min read

Differences Between CPU and GPU Architectures and the Relationship Between OpenCL and CUDA

Architects' Tech Alliance

Apr 18, 2019 · Fundamentals

What Powers Modern Graphics? A Deep Dive into GPU History and Architecture

This article traces the evolution of GPUs from early graphics chips to modern parallel processors, explains their internal pipeline, compares CPU and GPU architectures, and introduces key acceleration frameworks like CUDA and OpenCL for general‑purpose computing.

CUDAGPUGPU architecture

0 likes · 13 min read

What Powers Modern Graphics? A Deep Dive into GPU History and Architecture