Apache Project Website Checks

Checking Project Websites for required and disallowed content

This script periodically crawls all Apache project and podling websites to check them for a few specific links or text blocks that all projects are expected to have. The checks include verifying that all required links appear on a project homepage, along with an "image" check if project logo files are in apache.org/img

The script also checks for 3rd party resource references that might be in conflict with our privacy policy.

View the crawler code, website display code, validation checks details, and raw JSON data.
Last crawl time: Mon, 16 Dec 2024 06:14:20 GMT over 217 websites.

Site Check - All Project Results

  • Data key:
  • # Sites with links to primary ASF page
  • # Sites with link, but not an expected ASF one
  • # Sites with no link for this topic
  • Click column badges to sort
Project Uri
216 1 0
Foundation
184 3 30
Events
134 0 83
License
145 7 65
Thanks
169 3 45
Security
171 7 39
Sponsorship
177 7 33
Trademarks
189 4 24
Copyright
202 1 14
Privacy
98 27 92
Resources
114 103 0
Image
200 0 17
Accumulo
ActiveMQ
AGE
Airavata
Airflow
Allura
Ambari
Ant
APISIX
Portable Runtime (APR)
Aries
Arrow
AsterixDB
Atlas
Attic
Avro
Axis
Beam
Bigtop
BookKeeper
Brand Management
Brooklyn
bRPC
BuildStream
BVal
Calcite
Camel
CarbonData
Cassandra
Causeway
Cayenne
Celeborn
Celix
CloudStack
Cocoon
Community Development
Commons
Conferences
Cordova
CouchDB
Creadur
cTAKES
Curator
CXF
Daffodil
DataFu
DataFusion
Data Privacy
DataSketches
DB
DeltaSpike
Directory
Diversity and Inclusion
DolphinScheduler
Doris
Drill
Druid
Dubbo
ECharts
Empire-db
EventMesh
Felix
Fineract
Flagon
Flex
Flink
Fluo
FreeMarker
Fundraising
Geode
Geronimo
Gobblin
Gora
Griffin
Groovy
Guacamole
Gump
Hadoop
HBase
Helix
Hive
Hop
HttpComponents
HTTP Server
Hudi
Iceberg
Ignite
Impala
Incubator
InLong
IoTDB
Jackrabbit
James
jclouds
Jena
JMeter
Johnzon
JSPWiki
Juneau
Kafka
Karaf
Kibble
Knox
Kudu
Kvrocks
Kylin
Kyuubi
Legal Affairs
Libcloud
Linkis
Logging Services
Lucene
Lucene.Net
MADlib
Mahout
ManifoldCF
Marketing and Publicity
Maven
Mesos
MINA
Mnemonic
MyFaces
Mynewt
NetBeans
NiFi
Nutch
NuttX
OFBiz
Olingo
Oozie
OpenDAL
OpenJPA
OpenMeetings
OpenNLP
OpenOffice
OpenWebBeans
OpenWhisk
ORC
Ozone
Paimon
Parquet
PDFBox
Pekko
Perl
Petri
Phoenix
Pig
Pinot
Pivot
PLC4X
POI
Portals
Public Affairs
Pulsar
Qpid
Ranger
Ratis
RocketMQ
Roller
Royale
Rya
Samza
Santuario
SDAP
SeaTunnel
Security Team
Sedona
Serf
ServiceComb
ServiceMix
ShardingSphere
ShenYu
Shiro
SINGA
SIS
SkyWalking
Sling
Solr
SpamAssassin
Spark
Steve
Storm
StreamPipes
Struts
Subversion
Superset
Synapse
Syncope
SystemDS
Travel Assistance
Tapestry
Tcl
Tez
Thrift
Tika
TinkerPop
Tomcat
TomEE
Traffic Control
Traffic Server
TsFile
Turbine
TVM
UIMA
Unomi
VCL
Velocity
Whimsy
Wicket
Web Services
Xalan
Xerces
XML Graphics
Yetus
YuniKorn
Zeppelin
ZooKeeper