Difference between revisions of "Talos II/Building FAQ"

From RCS Wiki
Jump to navigation Jump to search
(23 intermediate revisions by 5 users not shown)
Line 25: Line 25:
  
 
== CPU/HSF installation ==
 
== CPU/HSF installation ==
 +
 +
=== How far should the HSF screw be tightened? ===
 +
 +
The screw has a hardstop; you turn until you can't.
  
 
=== What is an indium pad? Does the stock HSF include it? ===
 
=== What is an indium pad? Does the stock HSF include it? ===
Line 31: Line 35:
 
4-core and 8-core CPUs do not require them (and do not ship with them).
 
4-core and 8-core CPUs do not require them (and do not ship with them).
 
More powerful CPUs should ship with them if required (TBD whether pre-applied to the HSF, or separately).
 
More powerful CPUs should ship with them if required (TBD whether pre-applied to the HSF, or separately).
 +
 +
=== Should thermal paste be used? ===
 +
 +
The use of thermal paste is '''not''' recommended under any circumstance. The heatsink is attached using an unusual high-pressure mounting system which places over 200lbs of force on the CPU module, making thermal paste unnecessary.
 +
 +
Optionally, an indium pad may be placed between the heatsink and the CPU heatspreader to enhance dissipation. Testing found this to make no difference to temperatures for 4-core and 8-core CPUs. An indium pad is included with 18 and 22-core CPUs and its use is recommended for those CPUs.
  
 
=== Should I remove the label/sticker from the HSF? ===
 
=== Should I remove the label/sticker from the HSF? ===
Line 36: Line 46:
 
No.
 
No.
 
Do not remove the label/sticker, or you will void the warranty of the HSF.
 
Do not remove the label/sticker, or you will void the warranty of the HSF.
 +
 +
=== Can I use 4mm hex driver? ===
 +
Yes, 5/32" = 3.97 mm.
 +
 +
=== Removing the HSF from a CPU with an indium pad ===
 +
The heat emitted during the operation of the CPU may cause the indium pad to stick to the HSF and the CPU. If the HSF is removed, there is a possibility that the CPU and HSF may stick together, only to separate once the HSF has been partially removed. This could cause the CPU to fall downwards (onto the socket) at an angle, which may damage the socket. For this reason, excercise extreme caution when removing the HSF from a CPU with an indium pad which has been run at load.
  
 
== Front panel I/O ==
 
== Front panel I/O ==
Line 41: Line 57:
 
=== Which is the other side of the buttons? ===
 
=== Which is the other side of the buttons? ===
  
Ground? (guess)
+
Typically ground, though there is nothing mandating this in the general case.  ATX case switches normally short out two adjacent pins when depressed.
 +
 
 +
FIXME: Confirm this is the case for Talos specifically.
  
 
=== Are the LED "cathode" pins the plus or minus side? ===
 
=== Are the LED "cathode" pins the plus or minus side? ===
  
(Unknown)
+
Minus.
  
=== What should the other side of the LED be connected to? ===
+
=== What should the plus side of the LED be connected to? ===
  
(Unknown)
+
The associated Anode pin.
 +
 
 +
{| class="wikitable"
 +
! Purpose || - || +
 +
|-
 +
| Fan fail || 6 || 8
 +
|-
 +
| NIC 2 || 10 || 9
 +
|-
 +
| NIC 1 || 12 || 11
 +
|-
 +
| HD* || 14 || 15
 +
|-
 +
| Power || 16 || 15
 +
|}
  
 
=== What does the Identify button do? ===
 
=== What does the Identify button do? ===
  
(Unknown)
+
Turns on and off the Identify LEDs.  This is mainly useful in server farms, as the ID LED status can be both read and set via software (IPMI).  The main use is making sure that the correct server is unplugged, restarted, upgraded, etc. by datacenter staff.
  
 
=== What does the NMI button do? ===
 
=== What does the NMI button do? ===
  
(Unknown)
+
As of this writing the NMI button is ignored by the BMC.  It may be used to generate an NMI in future firmware revisions, or serve another purpose entirely.
 +
 
 +
=== The HD activity LED doesn't work! ===
 +
 
 +
The integrated Microsemi controller does not report activity (yet).  A much-belated SAS controller firmware update from Microsemi is expected by 04/20/2018 to enable this functionality.
 +
 
 +
In the interim, J10115 can be connected to other hardware to control the HD activity LED.
 +
 
 +
=== What is J10115? ===
 +
 
 +
Something related to HD activity LED. :)
 +
 
 +
== BMC serial port J7701 ==
 +
When buying the "serial port bracket" you will need one with Intel/TDK (DTK)/Tyan style, not AT/Everex/Gigabyte, see http://pinoutguide.com/Motherboard/rs232_header_pinout.shtml for differences.
 +
Intel is https://iczc.cz/8fi3g7r5amg33a1pjn9tl4v9r8_7/obrazek while the other one is https://iczc.cz/5dessg9ns0ht49fed64jmrsita_7/obrazek.
 +
The proof is on page 77 of the schematics.
 +
 
 +
Be careful when looking at specification pages on item listings, some of the wrong ones are sold as "Intel" compatible despite being the other style.
 +
 
 +
See [[Talos II/Hardware Compatibility List#Serial_Adapters_for_J7701_Header]] for a list of known compatible and incompatible adapters.
  
 
== What is OCC mode? ==
 
== What is OCC mode? ==
  
(Unknown)
+
The On Chip Controller (OCC) is a clock / thermal management engine.
 +
 
 +
The OCC can enter a safe mode if external hardware detects a condition that would require power throttling.  This feature is not active in firmware on Talos II, but the wiring required to support it is present for future expansion.
  
 
== What are the effects of the "CPU secure mode disable" jumpers? ==
 
== What are the effects of the "CPU secure mode disable" jumpers? ==
  
(Unknown)
+
When secure mode is disabled, the on-board SBE will not halt IPL if the next stage (hostboot) fails security verification.  When secure mode is enabled, each step of the IPL process verifies the next, and will halt IPL if a discrepancy (hash difference, invalid signature, etc.) is found.  Talos II ships with secure mode disabled as of this writing.
  
 
== How do I verify the PGP key that signed the DVD? ==
 
== How do I verify the PGP key that signed the DVD? ==
  
(Unknown)
+
See the page on [[Verifying DVDs]].
 +
 
 +
== What is micro PCI-e? ==
 +
 
 +
Unknown.
 +
 
 +
== How to get versions of firmware components? ==
 +
* run <code>lsprop</code> under <code>/proc/device-tree/ibm,firmware-versions</code>
 +
* run <code>lsmcode</code> (available in <code>lsvpd</code> package)
 +
* run <code>ipmitool fru print 47</code>
 +
 
 +
== How to change BMC hostname ==
 +
* run <code>hostnamectl set-hostname talos-bmc</code>

Revision as of 02:06, 20 August 2018

Where is the installation manual online?

File:T2P9D01 users guide version 1 0.pdf

My motherboard bag's seal/labels are broken! Has it been compromised?

This is normal for now. (It may have been compromised still, but the broken labels don't indicate that.)

Mounting in case

Where do I get the stand-offs and screws?

They should come with your case. (Check inside drive bays and such.)

Should I use rubber spacers with the stand-offs?

Stand-offs are supposed to help ground the motherboard, so it's better not to.

My case doesn't have holes for some stand-offs!

Not necessarily a big deal, especially for the top-left where the I/O plate helps hold it in place.

However, note that without stand-offs, you may accidentally bend the board when inserting CPUs, RAM, or other components. Such bending may damage the board!

CPU/HSF installation

How far should the HSF screw be tightened?

The screw has a hardstop; you turn until you can't.

What is an indium pad? Does the stock HSF include it?

Indium pads help heat transfer from the CPU to the HSF. 4-core and 8-core CPUs do not require them (and do not ship with them). More powerful CPUs should ship with them if required (TBD whether pre-applied to the HSF, or separately).

Should thermal paste be used?

The use of thermal paste is not recommended under any circumstance. The heatsink is attached using an unusual high-pressure mounting system which places over 200lbs of force on the CPU module, making thermal paste unnecessary.

Optionally, an indium pad may be placed between the heatsink and the CPU heatspreader to enhance dissipation. Testing found this to make no difference to temperatures for 4-core and 8-core CPUs. An indium pad is included with 18 and 22-core CPUs and its use is recommended for those CPUs.

Should I remove the label/sticker from the HSF?

No. Do not remove the label/sticker, or you will void the warranty of the HSF.

Can I use 4mm hex driver?

Yes, 5/32" = 3.97 mm.

Removing the HSF from a CPU with an indium pad

The heat emitted during the operation of the CPU may cause the indium pad to stick to the HSF and the CPU. If the HSF is removed, there is a possibility that the CPU and HSF may stick together, only to separate once the HSF has been partially removed. This could cause the CPU to fall downwards (onto the socket) at an angle, which may damage the socket. For this reason, excercise extreme caution when removing the HSF from a CPU with an indium pad which has been run at load.

Front panel I/O

Which is the other side of the buttons?

Typically ground, though there is nothing mandating this in the general case. ATX case switches normally short out two adjacent pins when depressed.

FIXME: Confirm this is the case for Talos specifically.

Are the LED "cathode" pins the plus or minus side?

Minus.

What should the plus side of the LED be connected to?

The associated Anode pin.

Purpose - +
Fan fail 6 8
NIC 2 10 9
NIC 1 12 11
HD* 14 15
Power 16 15

What does the Identify button do?

Turns on and off the Identify LEDs. This is mainly useful in server farms, as the ID LED status can be both read and set via software (IPMI). The main use is making sure that the correct server is unplugged, restarted, upgraded, etc. by datacenter staff.

What does the NMI button do?

As of this writing the NMI button is ignored by the BMC. It may be used to generate an NMI in future firmware revisions, or serve another purpose entirely.

The HD activity LED doesn't work!

The integrated Microsemi controller does not report activity (yet). A much-belated SAS controller firmware update from Microsemi is expected by 04/20/2018 to enable this functionality.

In the interim, J10115 can be connected to other hardware to control the HD activity LED.

What is J10115?

Something related to HD activity LED. :)

BMC serial port J7701

When buying the "serial port bracket" you will need one with Intel/TDK (DTK)/Tyan style, not AT/Everex/Gigabyte, see http://pinoutguide.com/Motherboard/rs232_header_pinout.shtml for differences. Intel is https://iczc.cz/8fi3g7r5amg33a1pjn9tl4v9r8_7/obrazek while the other one is https://iczc.cz/5dessg9ns0ht49fed64jmrsita_7/obrazek. The proof is on page 77 of the schematics.

Be careful when looking at specification pages on item listings, some of the wrong ones are sold as "Intel" compatible despite being the other style.

See Talos II/Hardware Compatibility List#Serial_Adapters_for_J7701_Header for a list of known compatible and incompatible adapters.

What is OCC mode?

The On Chip Controller (OCC) is a clock / thermal management engine.

The OCC can enter a safe mode if external hardware detects a condition that would require power throttling. This feature is not active in firmware on Talos II, but the wiring required to support it is present for future expansion.

What are the effects of the "CPU secure mode disable" jumpers?

When secure mode is disabled, the on-board SBE will not halt IPL if the next stage (hostboot) fails security verification. When secure mode is enabled, each step of the IPL process verifies the next, and will halt IPL if a discrepancy (hash difference, invalid signature, etc.) is found. Talos II ships with secure mode disabled as of this writing.

How do I verify the PGP key that signed the DVD?

See the page on Verifying DVDs.

What is micro PCI-e?

Unknown.

How to get versions of firmware components?

  • run lsprop under /proc/device-tree/ibm,firmware-versions
  • run lsmcode (available in lsvpd package)
  • run ipmitool fru print 47

How to change BMC hostname

  • run hostnamectl set-hostname talos-bmc