Skip to content

Commit 4ac96b1

Browse files
stevewardmchammer01docs-botsophiethekingalejndr0
authored
GitHub Enterprise Server 3.18 release candidate (#56527)
Co-authored-by: mchammer01 <42146119+mchammer01@users.noreply.github.com> Co-authored-by: docs-bot <77750099+docs-bot@users.noreply.github.com> Co-authored-by: Sophie <29382425+sophietheking@users.noreply.github.com> Co-authored-by: alejndr0 <alejndr0@github.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Co-authored-by: Felicity Chapman <felicitymay@github.com>
1 parent bb6cc4f commit 4ac96b1

File tree

66 files changed

+991606
-259
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

66 files changed

+991606
-259
lines changed

content/admin/all-releases.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -58,6 +58,7 @@ If you run analysis in an external CI system, we recommend using the same versio
5858

5959
| {% data variables.product.prodname_ghe_server %} version | Recommended {% data variables.product.prodname_codeql_cli %} version |
6060
| ------------------------------------------------- | ---------------------- |
61+
| 3.18 | 2.21.4 ([changelog](https://codeql.github.com/docs/codeql-overview/codeql-changelog/codeql-cli-2.21.4/)) |
6162
| 3.17 | 2.20.7 ([changelog](https://codeql.github.com/docs/codeql-overview/codeql-changelog/codeql-cli-2.20.7/)) |
6263
| 3.16 | 2.20.3 ([changelog](https://codeql.github.com/docs/codeql-overview/codeql-changelog/codeql-cli-2.20.3/)) |
6364
| 3.15 | 2.18.4 ([changelog](https://codeql.github.com/docs/codeql-overview/codeql-changelog/codeql-cli-2.18.4/)) |
@@ -77,6 +78,7 @@ For instances with {% data variables.product.prodname_actions %} enabled, self-h
7778

7879
| {% data variables.product.prodname_ghe_server %} version | Minimum Runner version |
7980
| ------------------------------------------------- | ---------------------- |
81+
| 3.18 | 2.324.0 ([release notes](https://github.com/actions/runner/releases/tag/v2.324.0)) |
8082
| 3.17 | 2.322.0 ([release notes](https://github.com/actions/runner/releases/tag/v2.322.0)) |
8183
| 3.16 | 2.321.0 ([release notes](https://github.com/actions/runner/releases/tag/v2.321.0)) |
8284
| 3.15 | 2.319.1 ([release notes](https://github.com/actions/runner/releases/tag/v2.319.1)) |

content/admin/guides.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -70,7 +70,7 @@ includeGuides:
7070
- /admin/upgrading-your-instance/preparing-to-upgrade/enabling-automatic-update-checks
7171
- /admin/monitoring-and-managing-your-instance/configuring-high-availability/initiating-a-failover-to-your-replica-appliance
7272
- /admin/monitoring-and-managing-your-instance/monitoring-your-instance/recommended-alert-thresholds
73-
- /admin/monitoring-and-managing-your-instance/monitoring-your-instance/setting-up-external-monitoring
73+
- /admin/monitoring-and-managing-your-instance/monitoring-your-instance/collectd-metrics/setting-up-external-monitoring-with-collectd
7474
- /admin/upgrading-your-instance/preparing-to-upgrade/overview-of-the-upgrade-process
7575
- /admin/upgrading-your-instance/preparing-to-upgrade/upgrade-requirements
7676
- /admin/upgrading-your-instance/preparing-to-upgrade/taking-a-snapshot
Lines changed: 42 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,42 @@
1+
---
2+
title: About monitoring your instance
3+
intro: 'You can configure monitoring and alerting so that you''re aware of potential issues before they negatively impact application performance or availability.'
4+
versions:
5+
ghes: '*'
6+
topics:
7+
- Enterprise
8+
shortTitle: About monitoring
9+
---
10+
11+
## Metrics
12+
13+
{% ifversion ghes-opentelemetry %}OpenTelemetry metrics were introduced in {% data variables.product.prodname_ghe_server %} 3.18 and are the future foundation for monitoring your {% data variables.product.prodname_ghe_server %} instance. {% endif %}
14+
Collectd metrics {% ifversion ghes-opentelemetry %}will continue to be gathered by default, but we expect collectd metrics to be discontinued in a future release.{% else %}provide monitoring capabilities for your {% data variables.product.prodname_ghe_server %} instance.{% endif %}
15+
16+
{% ifversion ghes-opentelemetry %}For more information about OpenTelemetry metrics, see [AUTOTITLE](/admin/monitoring-and-managing-your-instance/monitoring-your-instance/opentelemetry-metrics/about-opentelemetry-metrics).{% endif %}
17+
18+
For more information about Collectd metrics, see [AUTOTITLE](/admin/monitoring-and-managing-your-instance/monitoring-your-instance/collectd-metrics/about-collectd-metrics).
19+
20+
## Dashboards
21+
22+
The {% data variables.enterprise.management_console %} provides built-in dashboards that visualize metrics to help you troubleshoot performance issues and better understand how your {% data variables.product.prodname_ghe_server %} appliance is being used. The data behind the graphs is gathered by the monitoring services and sampled regularly.
23+
24+
For more information about the monitor dashboards, see [AUTOTITLE](/admin/monitoring-and-managing-your-instance/monitoring-your-instance/about-the-monitor-dashboards).
25+
26+
## System Logs
27+
28+
{% data variables.product.prodname_ghe_server %} maintains system logs that can be used to monitor system events and troubleshoot issues. These logs provide detailed information about various system processes and can be valuable for debugging and monitoring purposes.
29+
30+
For more information about system logs, see [AUTOTITLE](/admin/monitoring-and-managing-your-instance/monitoring-your-instance/about-system-logs).
31+
32+
## Troubleshooting resource allocation problems
33+
34+
Resource allocation problems can impact the performance and availability of your {% data variables.product.prodname_ghe_server %} instance. Understanding how to identify and resolve these issues is crucial for maintaining optimal system performance.
35+
36+
For more information about troubleshooting resource allocation problems, see [AUTOTITLE](/admin/monitoring-and-managing-your-instance/monitoring-your-instance/troubleshooting-resource-allocation-problems).
37+
38+
## Generating a Health Check for your enterprise
39+
40+
A health check provides a comprehensive overview of your {% data variables.product.prodname_ghe_server %} instance's current state and can help identify potential issues before they become critical.
41+
42+
For more information about generating a health check, see [AUTOTITLE](/admin/monitoring-and-managing-your-instance/monitoring-your-instance/generating-a-health-check-for-your-enterprise).

content/admin/monitoring-and-managing-your-instance/monitoring-your-instance/about-the-monitor-dashboards.md

Lines changed: 26 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -19,8 +19,9 @@ topics:
1919
- Infrastructure
2020
- Monitoring
2121
- Performance
22-
shortTitle: About the monitor {% ifversion ghes > 3.15 %}dashboards{% else %}dashboard{% endif %}
22+
shortTitle: About monitor {% ifversion ghes > 3.15 %}dashboards{% else %}dashboard{% endif %}
2323
---
24+
2425
## Accessing the monitor {% ifversion ghes > 3.15 %}dashboards{% else %}dashboard{% endif %}
2526

2627
{% data reusables.enterprise_site_admin_settings.access-settings %}
@@ -34,12 +35,18 @@ shortTitle: About the monitor {% ifversion ghes > 3.15 %}dashboards{% else %}das
3435

3536
## Using the monitor dashboards
3637

37-
The dashboards visualize metrics which can be useful for troubleshooting performance issues and better understanding how your {% data variables.product.prodname_ghe_server %} appliance is being used. The data behind the graphs is gathered by the `collectd` service and sampled every 10 seconds.
38+
The dashboards visualize metrics which can be useful for troubleshooting performance issues and better understanding how your {% data variables.product.prodname_ghe_server %} appliance is being used. The data behind the graphs is gathered by the monitoring services and sampled regularly.
3839

3940
Within the pre-built dashboards you can find various sections grouping graphs of different types of system resources. Use the links on the page to navigate between the dashboards.
4041

4142
![Screenshot of the {% data variables.enterprise.management_console %} header. The dashboard navigation links provided at the top right are highlighted in orange.](/assets/images/enterprise/management-console/monitor-dash-navigation.png)
4243

44+
{% ifversion ghes-opentelemetry %}
45+
46+
If you have enabled metrics for OpenTelemetry, then additional dashboards are available, see: [AUTOTITLE](/admin/monitoring-and-managing-your-instance/monitoring-your-instance/opentelemetry-metrics/enable-advanced-dashboards).
47+
48+
{% endif %}
49+
4350
### "Operational Health" dashboard
4451

4552
This is the default dashboard displayed on the "Monitor" page. It visualizes key metrics that help you to get a quick overview of the health of your {% data variables.product.prodname_ghe_server %} appliance.
@@ -50,21 +57,35 @@ On this more detailed dashboard you can get further insights into all aspects of
5057

5158
## Creating new dashboards
5259

53-
Building your own dashboard and alerts requires the data to be forwarded to an external instance, by enabling `collectd` forwarding. For more information, see [AUTOTITLE](/admin/monitoring-and-managing-your-instance/monitoring-your-instance/configuring-collectd-for-your-instance).
60+
Building your own dashboard and alerts requires the data to be forwarded to an external instance. {% ifversion ghes-opentelemetry %}You can use OpenTelemetry and collectd metrics concurrently for external monitoring. During the {% data variables.release-phases.public_preview %}, collectd remains enabled and cannot be disabled.{% else %}You can use collectd metrics for external monitoring.{% endif %}
61+
62+
{% ifversion ghes-opentelemetry %}
63+
For more information about OpenTelemetry metrics, see [AUTOTITLE](/admin/monitoring-and-managing-your-instance/monitoring-your-instance/opentelemetry-metrics/about-opentelemetry-metrics).
64+
{% endif %}
65+
66+
For more information about collectd metrics, see [AUTOTITLE](/admin/monitoring-and-managing-your-instance/monitoring-your-instance/collectd-metrics/about-collectd-metrics).
5467

5568
{% else %}
5669

5770
## Using the monitor dashboard
5871

59-
The page visualizes metrics which can be useful for troubleshooting performance issues and better understanding how your {% data variables.product.prodname_ghe_server %} appliance is being used. The data behind the graphs is gathered by the `collectd` service and sampled every 10 seconds.
72+
The page visualizes metrics which can be useful for troubleshooting performance issues and better understanding how your {% data variables.product.prodname_ghe_server %} appliance is being used. The data behind the graphs is gathered by the monitoring services and sampled regularly.
6073

6174
Within the pre-built dashboard you can find various sections grouping graphs of different types of system resources.
6275

63-
Building your own dashboard and alerts requires the data to be forwarded to an external instance, by enabling `collectd` forwarding. For more information, see [AUTOTITLE](/admin/monitoring-and-managing-your-instance/monitoring-your-instance/configuring-collectd-for-your-instance).
76+
Building your own dashboard and alerts requires the data to be forwarded to an external instance. You can use collectd forwarding for external monitoring.
77+
78+
{% ifversion ghes-opentelemetry %}
79+
For more information about OpenTelemetry metrics, see [AUTOTITLE](/admin/monitoring-and-managing-your-instance/monitoring-your-instance/opentelemetry-metrics/about-opentelemetry-metrics).
80+
{% endif %}
81+
82+
For more information about collectd metrics, see [AUTOTITLE](/admin/monitoring-and-managing-your-instance/monitoring-your-instance/collectd-metrics/about-collectd-metrics).
6483
{% endif %}
6584

6685
## About the metrics on the monitor dashboards
6786

87+
The monitor dashboards provide a comprehensive view of various metrics that help in understanding the performance and health of your {% data variables.product.prodname_ghe_server %} instance. These metrics are categorized into different sections to make it easier to analyze specific aspects of the system.
88+
6889
### System Health
6990

7091
The system health graphs provide a general overview of services and system resource utilization. The CPU, memory, and load average graphs are useful for identifying trends or times where provisioned resource saturation has occurred. For more information, see [AUTOTITLE](/admin/monitoring-and-managing-your-instance/monitoring-your-instance/recommended-alert-thresholds).
Lines changed: 83 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,83 @@
1+
---
2+
title: About Collectd metrics
3+
intro: '{% ifversion ghes-opentelemetry %}Collectd metrics provide a legacy monitoring solution for {% data variables.product.prodname_ghe_server %} instances that is supported alongside a {% data variables.release-phases.public_preview %} of OpenTelemetry metrics.{% else %}Collectd metrics provide a monitoring solution for {% data variables.product.prodname_ghe_server %} instances.{% endif %}'
4+
versions:
5+
ghes: '*'
6+
topics:
7+
- Enterprise
8+
- Fundamentals
9+
- Infrastructure
10+
- Monitoring
11+
- Performance
12+
type: overview
13+
shortTitle: About Collectd metrics
14+
---
15+
16+
{% data reusables.enterprise.collectd-metrics-future %}
17+
18+
## About collectd metrics
19+
20+
Collectd is a daemon that collects system performance statistics periodically and stores them in a variety of ways. For {% data variables.product.prodname_ghe_server %}, collectd gathers metrics from various system components and services, providing insight into system health and performance.
21+
22+
### Key components
23+
24+
The collectd monitoring stack includes the following components:
25+
26+
* **Collectd**: The main daemon that collects system performance statistics
27+
* **Graphite**: Serves as the datasource for dashboard visualizations
28+
29+
### Metrics collection
30+
31+
Collectd gathers metrics from various sources including:
32+
33+
* **System metrics**: CPU usage, memory utilization, disk I/O, network statistics
34+
* **Application metrics**: HAProxy statistics, Resque queue metrics, database performance
35+
* **Custom metrics**: Service-specific metrics through custom plugins and scripts
36+
37+
## Architecture
38+
39+
### Single appliance
40+
41+
In a single appliance deployment, collectd runs locally and stores metrics in RRD (Round Robin Database) files. The {% data variables.enterprise.management_console %} reads these files to display monitoring dashboards.
42+
43+
### Cluster environment
44+
45+
In cluster environments, collectd operates in a distributed manner:
46+
47+
* **Metrics servers**: Designated nodes that collect and store metrics from all cluster nodes
48+
* **Metrics clients**: All other nodes that forward their metrics to the metrics servers
49+
* **Redundancy**: Metrics are duplicated across multiple metrics servers for failover support
50+
51+
## Configuring collectd metrics
52+
53+
Collectd metrics are enabled by default on {% data variables.product.prodname_ghe_server %} instances.
54+
55+
## External monitoring with collectd
56+
57+
You can set up external monitoring systems to collect and analyze collectd metrics from your {% data variables.product.prodname_ghe_server %} instance. This enables integration with existing monitoring infrastructure and provides additional visualization and alerting capabilities.
58+
59+
For more information about setting up external monitoring, see [AUTOTITLE](/admin/monitoring-and-managing-your-instance/monitoring-your-instance/collectd-metrics/setting-up-external-monitoring-with-collectd).
60+
61+
## Collectd metrics reference
62+
63+
{% data variables.product.prodname_ghe_server %} collects various metrics through collectd, covering system resources, application performance, and service health. Understanding these metrics is essential for effective monitoring and troubleshooting.
64+
65+
For a comprehensive list of available metrics, see [AUTOTITLE](/admin/monitoring-and-managing-your-instance/monitoring-your-instance/collectd-metrics/collectd-metrics-for-github-enterprise-server).
66+
67+
{% ifversion ghes-opentelemetry %}
68+
69+
## Migration considerations
70+
71+
As {% data variables.product.prodname_ghe_server %} transitions to OpenTelemetry metrics, consider the following:
72+
73+
* **Coexistence**: Both collectd and OpenTelemetry metrics can run simultaneously during the transition period
74+
* **Feature parity**: OpenTelemetry metrics provide equivalent and enhanced monitoring capabilities
75+
* **Planning**: Begin evaluating OpenTelemetry metrics for your monitoring workflows
76+
* **Timeline**: Plan for the eventual {% data variables.release-phases.closing_down %} and then {% data variables.release-phases.retired %} of collectd metrics in future releases
77+
78+
{% endif %}
79+
80+
## Next steps
81+
82+
* To set up external monitoring, see [AUTOTITLE](/admin/monitoring-and-managing-your-instance/monitoring-your-instance/collectd-metrics/setting-up-external-monitoring-with-collectd).
83+
* To explore available metrics, see [AUTOTITLE](/admin/monitoring-and-managing-your-instance/monitoring-your-instance/collectd-metrics/collectd-metrics-for-github-enterprise-server).
Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -15,9 +15,11 @@ redirect_from:
1515
- /admin/monitoring-managing-and-updating-your-instance/monitoring-your-instance/collectd-metrics-for-github-enterprise-server
1616
---
1717

18+
{% data reusables.enterprise.collectd-metrics-future %}
19+
1820
## About `collectd` metrics
1921

20-
By default, `collectd` on {% data variables.location.product_location %} gathers metrics related to the instance's performance. For more information, see [AUTOTITLE](/admin/monitoring-and-managing-your-instance/monitoring-your-instance/configuring-collectd-for-your-instance).
22+
By default, `collectd` on {% data variables.location.product_location %} gathers metrics related to the instance's performance.
2123

2224
You can learn more about the type of data that `collectd` gathers, and you can download a CSV file that contains a full list of metrics.
2325

Lines changed: 12 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -11,8 +11,12 @@ topics:
1111
- Infrastructure
1212
- Monitoring
1313
- Performance
14+
redirect_from:
15+
- /admin/monitoring-and-managing-your-instance/monitoring-your-instance/exporting-and-scraping-prometheus-metrics
1416
---
1517

18+
{% data reusables.enterprise.collectd-metrics-future %}
19+
1620
{% data variables.product.prodname_ghe_server %} includes a Prometheus metrics exporter to simplify observability and monitoring. This feature allows metrics to be scraped directly from the appliance, eliminating the need to forward `collectd` metrics to an external `collectd` server for transformation before sending them to a Prometheus system. By supporting Prometheus-compatible metrics natively, the exporter enables seamless integration with external monitoring dashboards and alerting systems.
1721

1822
## Enabling Prometheus metrics exporter and scraping metrics
@@ -35,8 +39,10 @@ To enable the Prometheus metrics exporter, SSH into the {% data variables.produc
3539

3640
```shell
3741
ghe-config app.prometheus.enabled true
38-
# Specify a comma-separated list of trusted IPv4/IPv6 addresses or CIDR blocks.
39-
ghe-config app.prometheus.trusted-ips "10.0.0.1, 192.168.1.0/8"
42+
# Specify a comma-separated list of
43+
# trusted IPv4/IPv6 addresses or CIDR blocks.
44+
ghe-config app.prometheus.trusted-ips \
45+
"10.0.0.1, 192.168.1.0/8"
4046
ghe-config-apply
4147
```
4248

@@ -45,7 +51,10 @@ ghe-config-apply
4551
You can enable the Prometheus metrics exporter using the API by sending a `PUT` request to update the configuration settings.
4652

4753
```shell
48-
curl -L -H "Content-Type: application/json" -X PUT -u "api_key:xxxxxxx" https://<hostname>/manage/v1/config/settings -d '{"prometheus": {"enabled": true, "trusted_ips": "10.0.0.1, 192.168.1.0/8"}}'
54+
curl -L -H "Content-Type: application/json" \
55+
-X PUT -u "api_key:xxxxxxx" \
56+
https://<hostname>/manage/v1/config/settings \
57+
-d '{"prometheus": {"enabled": true, "trusted_ips": "10.0.0.1, 192.168.1.0/8"}}'
4958
```
5059

5160
## Verifying the Prometheus metrics exporter is enabled
Lines changed: 16 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,16 @@
1+
---
2+
title: Collectd metrics
3+
intro: 'You can configure monitoring and alerting so that you''re aware of potential issues before they negatively impact application performance or availability.'
4+
versions:
5+
ghes: '*'
6+
topics:
7+
- Enterprise
8+
children:
9+
- /about-collectd-metrics
10+
- /setting-up-external-monitoring-with-collectd
11+
- /collectd-metrics-for-github-enterprise-server
12+
- /exporting-and-scraping-prometheus-metrics
13+
shortTitle: Collectd metrics
14+
---
15+
16+
{% data reusables.enterprise.collectd-metrics-future %}

0 commit comments

Comments
 (0)