You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- **CPU architecture**: Different processors have varying parallel processing capabilities
169
+
- **Available memory**: Limited RAM may require different optimization strategies
170
+
- **System load**: Other applications competing for resources affect DataFusion performance
171
+
172
+
**Recommendations for Production Use:**
173
+
174
+
To optimize DataFusion for your specific use case, it is strongly recommended to:
175
+
176
+
1. **Create custom benchmarks** using your actual data sources, formats, and query patterns
177
+
2. **Test with representative data volumes** that match your production workloads
178
+
3. **Measure end-to-end performance** including data loading, processing, and result handling
179
+
4. **Evaluate different configuration combinations** for your specific hardware and workload
180
+
5. **Monitor resource utilization** (CPU, memory, I/O) to identify bottlenecks in your environment
181
+
182
+
This approach will provide more accurate insights into how DataFusion configuration options
183
+
will impact your particular applications and infrastructure.
184
+
145
185
For more information about available :py:class:`~datafusion.context.SessionConfig` options, see the `rust DataFusion Configuration guide <https://arrow.apache.org/datafusion/user-guide/configs.html>`_,
146
186
and about :code:`RuntimeEnvBuilder` options in the rust `online API documentation <https://docs.rs/datafusion/latest/datafusion/execution/runtime_env/struct.RuntimeEnvBuilder.html>`_.
0 commit comments