Minimally integrate per-page HTML content into each "index.html" file #1383

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

d-ronnqvist wants to merge 2 commits into swiftlang:main from d-ronnqvist:output-html-integrate-1

Contributor

d-ronnqvist commented Dec 5, 2025 •

edited

Loading

Bug/issue #, if applicable: rdar://163326857

Summary

This is the 1st (of probably 2) integration slices of #1366

This PR adds a new HTMLRenderer that can render articles and symbols into HTMLRenderer/RenderedPageInfo.

In order to access information about resolved pages, resolved assets, etc. it uses a private ContextLinkProvider that conforms to the LinkProvider protocol in DocCHTML.

This PR also adds a new HTMLContentConsumer protocol and a concrete FileWritingHTMLContentConsumer implementation that embeds the per-page HTML content inside of the <noscript> tag of the index.html file that DocC would normally make an exact copy of for each page.

The main integration of these new types happen in the ConvertActionConsumer. If it is passed an HTMLContentConsumer it will create a HTMLRenderer for each DocumentationNode that it processes. This means that the output is both a JSON file with content and and HTML file with content.

Notably missing from this PR is:

A user-facing CLI feature flag so that developers can enable this feature
An entry in features.json so that tools can know if DocC supports the new flag/feature
Most of the per-page content. This is waiting on this change and a number of smaller HTML rendering changes to land first:

Dependencies

None.

Testing

Nothing in particular for this PR. It intentionally lacks the CLI feature flag that would allow this to be used in docc convert. See #1366 for how it eventually does get used.

Checklist

Make sure you check off the following items. If they cannot be completed, provide a reason.

Added tests
Ran the ./bin/test script and it succeeded
Updated documentation if necessary


          Minimally integrate per-page HTML content into each "index.html" file

508dba7

rdar://163326857

Contributor Author

d-ronnqvist commented Dec 5, 2025

@swift-ci please test


          Merge branch 'main' into output-html-integrate-1

3a09af6

Contributor Author

d-ronnqvist commented Dec 9, 2025

@swift-ci please test

patshaughnessy reviewed

View reviewed changes

Contributor

patshaughnessy left a comment

Does this solution consider the --no-transform-for-static-hosting (no index.html files at all) and --experimental-enable-custom-templates CLI options? Does the custom template option allow DocC users to change the contents of index.html? That might cause trouble in FileWritingHTMLContentConsumer.

Otherwise looks great!

Sources/SwiftDocC/Model/Rendering/HTML/HTMLRenderer.swift

+                  struct RenderedPageInfo {
+                      /// The HTML content of the page as an XMLNode hierarchy.
+                      ///
+                      /// The string representation of those node hierarchy is intended to be inserted _somewhere_ inside the `<body>` HTML element.

Contributor

patshaughnessy Dec 10, 2025

Suggested change

      
                    /// The string representation of those node hierarchy is intended to be inserted _somewhere_ inside the `<body>` HTML element.
          
                    /// The string representation of this node hierarchy is intended to be inserted _somewhere_ inside the `<body>` HTML element.

Also, should we split this doc comment up into an abstract and a note, or overview?

Sources/SwiftDocC/Model/Rendering/HTML/HTMLRenderer.swift

+                          return nil
+                      }
+                      let names: LinkedElement.Names

Contributor

patshaughnessy Dec 10, 2025

We could extract this passage about determining the names of the page into a separate function.

Sources/SwiftDocC/Model/Rendering/HTML/HTMLRenderer.swift

+                              // This symbol has multiple unique names
+                              let titles = [SourceLanguage: String](
+                                  titles.map { trait, title in
+                                      (trait.sourceLanguage ?? .swift, title)

Contributor

patshaughnessy Dec 10, 2025

Strictly speaking should we ignore traits that are not about source language? (Not sure if any such traits exist in practice?) I believe in a similar loop elsewhere you have a guard on the source language and a compactMap.

Sources/SwiftDocC/Model/Rendering/HTML/HTMLRenderer.swift

+                      // A helper function that transforms SymbolKit fragments into renderable identifier/decorator fragments
+                      func convert(_ fragments: [SymbolGraph.Symbol.DeclarationFragments.Fragment]) -> [LinkedElement.SymbolNameFragment] {
+                          func convert(kind: SymbolGraph.Symbol.DeclarationFragments.Fragment.Kind) -> LinkedElement.SymbolNameFragment.Kind {

Contributor

patshaughnessy Dec 10, 2025

Two nested functions with the same name is a bit confusing. Could the inner function be an extension on the SymbolGraph.Symbol.DeclarationFragments.Fragment.Kind type?

Sources/SwiftDocC/Model/Rendering/HTML/HTMLRenderer.swift

+                          }
+                          // Join together multiple fragments of the same identifier/decorator kind to produce a smaller output.
+                          var result: [LinkedElement.SymbolNameFragment] = []

Contributor

patshaughnessy Dec 10, 2025

We could extract this part into a class method on the LinkedElement.SymbolNameFragment type, or extend Array<LinkedElement.SymbolNameFragment> maybe? Just trying to simplify this code and make it easier to read.

Sources/SwiftDocC/Model/Rendering/HTML/HTMLRenderer.swift

+                      // Title
+                      let titleVariants = symbol.titleVariants.allValues.sorted(by: { $0.trait < $1.trait })
+                      for (trait, languageSpecificTitle) in titleVariants {
+                          guard let language = trait.sourceLanguage else { continue }

Contributor

patshaughnessy Dec 10, 2025

This guard statement is what I was referring to earlier. We ignore traits here that don't refer to source language - should we do that above also?

Sources/SwiftDocC/Model/Rendering/HTML/HTMLRenderer.swift

+                              attributes = nil
+                          }
+                          hero.addChild(

Contributor

patshaughnessy Dec 10, 2025

It would be great to see an example of what the markup would look like for a symbol page with multiple titles. Since DocC Render doesn't render that normally, this would be covering new ground.

Sources/SwiftDocC/Model/Rendering/HTML/HTMLRenderer.swift

+              // Note; this isn't a Comparable conformance so that it can remain private to this file.
+              private extension DocumentationDataVariantsTrait {
+                  static func < (lhs: DocumentationDataVariantsTrait, rhs: DocumentationDataVariantsTrait) -> Bool {

Contributor

patshaughnessy Dec 10, 2025

Maybe add a generic map function here that you could use in various places above.

Sources/SwiftDocCUtilities/Action/Actions/Convert/FileWritingHTMLContentConsumer.swift

+                      init(data: Data) throws {
+                          let content = String(decoding: data, as: UTF8.self)
+                          // ???: Should we parse the content with XMLParser instead? If so, what do we do if it's not valid XHTML?

Contributor

patshaughnessy Dec 10, 2025

This strikes me as a bit dangerous. It's probably a safe assumption that each index.html file will contain these tags, but we shouldn't crash if they do not. Or if there are multiple <noscript> tags for some reason.

Could we check the ranges a bit more robustly, and skip the pages that have an unexpected index.html file? And maybe write a warning in this case?

Also we should handle the case if the index.html is missing entirely for some reason.

Sources/SwiftDocCUtilities/Action/Actions/Convert/FileWritingHTMLContentConsumer.swift

+                  ) throws {
+                      self.targetFolder = targetFolder
+                      self.fileManager = fileManager
+                      self.htmlTemplate = try HTMLTemplate(data: fileManager.contents(of: htmlTemplate))

Contributor

patshaughnessy Dec 10, 2025

If the file is missing, should we catch this Foundation error, and wrap it in a custom error that would emit an error message that is more specific and descriptive?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet