To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???巽??遺??沃??悠??矜裕?ぜ蹂 00111111001111110011111110010010010001100011111100111111100010001110001000111111001111111001011110000000001111110011111110010111010010010011111100111111111000011110000010010111010101000011111110000010101110101110011011111000 3f3f3f92463f3f88e23f3f97803f3f97493f3fe1e097543f82bae6f8
EUC-JP ???巽??遺??沃??悠??矜裕?ぜ蹂 00111111001111110011111111000011101001110011111100111111101100001110010000111111001111111100110111100000001111110011111111001101101010100011111100111111111000101110001011001101101101010011111110100100101111001110110011111010 3f3f3fc3a73f3fb0e43f3fcde03f3fcdaa3f3fe2e2cdb53fa4bcecfa
UTF-8 樂낅뜄巽섇젆遺삠걶沃섅끉悠드슫矜裕뉓ぜ蹂 111011111010011010111111111010111000001010000101111010111001110010000100111001011011011110111101111011001000010010000111111011001010000010000110111010011000000110111010111011001000001010100000111010101011000110110110111001101011001010000011111011001000010010000101111010111000000110001001111001101000001010100000111010111001001110011100111011001000101010101011111001111001111110011100111010001010001110010101111010111000100110010011111000111000000110011100111010001011100110000010 efa6bfeb8285eb9c84e5b7bdec8487eca086e981baec82a0eab1b6e6b283ec8485eb8189e682a0eb939cec8aabe79f9ce8a395eb8993e3819ce8b982
UHC 樂낅뜄巽섇젆遺삠걶沃섅끉悠드슫矜裕뉓ぜ蹂 11101000111110011000010111101011100011011000100011100001110111101001100011100101101000001000100111101011101101101011101111100011100000011001110011101000101010101001100011100011100001011011110011101010111011011011010111100101100110101011010011010000111010001110101110101110100001111110100010101010101111001110101110110011 e8f985eb8d88e1de98e5a089ebb6bbe3819ce8aa98e385bceaedb5e59ab4d0e8ebae87e8aabcebb3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)