To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 谷歎他谷臓端谷歎他谷臓端B 10010010010010101001001001010110100100011011110010010010010010101001000110011111100100100101101110010010010010101001001001010110100100011011110010010010010010101001000110011111100100100101101101000010 924a925691bc924a919f925b924a925691bc924a919f925b42
EUC-JP 谷歎他谷臓端谷歎他谷臓端B 11000011101010111100001110110111110000101011111011000011101010111100001010100001110000111011110011000011101010111100001110110111110000101011111011000011101010111100001010100001110000111011110001000010 c3abc3b7c2bec3abc2a1c3bcc3abc3b7c2bec3abc2a1c3bc42
UTF-8 谷歎他谷臓端谷歎他谷臓端B 11101000101100001011011111100110101011011000111011100100101110111001011011101000101100001011011111101000100001111001001111100111101010111010111111101000101100001011011111100110101011011000111011100100101110111001011011101000101100001011011111101000100001111001001111100111101010111010111101000010 e8b0b7e6ad8ee4bb96e8b0b7e88793e7abafe8b0b7e6ad8ee4bb96e8b0b7e88793e7abaf42
UHC 谷歎他谷?端谷歎他谷?端B 1100110111011011111101111010011111110110111000101100110111011011001111111101001110101110110011011101101111110111101001111111011011100010110011011101101100111111110100111010111001000010 cddbf7a7f6e2cddb3fd3aecddbf7a7f6e2cddb3fd3ae42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)