To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???W}???W{^ 0011111100111111001111110101011101111101001111110011111100111111010101110111101101011110 3f3f3f577d3f3f3f577b5e
SJIS-WIN 谷端測W}谷端測W{^ 1001001001001010100100100101101110010001101010100101011101111101100100100100101010010010010110111001000110101010010101110111101101011110 924a925b91aa577d924a925b91aa577b5e
EUC-JP 谷端測W}谷端測W{^ 1100001110101011110000111011110011000010101011000101011101111101110000111010101111000011101111001100001010101100010101110111101101011110 c3abc3bcc2ac577dc3abc3bcc2ac577b5e
UTF-8 谷端測W}谷端測W{^ 1110100010110000101101111110011110101011101011111110011010111000101011000101011101111101111010001011000010110111111001111010101110101111111001101011100010101100010101110111101101011110 e8b0b7e7abafe6b8ac577de8b0b7e7abafe6b8ac577b5e
UHC 谷端測W}谷端測W{^ 1100110111011011110100111010111011110110101101000101011101111101110011011101101111010011101011101111011010110100010101110111101101011110 cddbd3aef6b4577dcddbd3aef6b4577b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)