To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???意????????誼??碎?????裕 0011111100111111001111111000100011010011001111110011111100111111001111110011111100111111001111110011111110001011011000100011111100111111111000011110101000111111001111110011111100111111001111111001011101010100 3f3f3f88d33f3f3f3f3f3f3f3f8b623f3fe1ea3f3f3f3f3f9754
EUC-JP ???意????????誼??碎??倻??裕 00111111001111110011111110110000110101010011111100111111001111110011111100111111001111110011111100111111101101011100001100111111001111111110001011101100001111110011111110001111101100011111011000111111001111111100110110110101 3f3f3fb0d53f3f3f3f3f3f3f3fb5c33f3fe2ec3f3f8fb1f63f3fcdb5
UTF-8 列룸씈意곫걗栒뀀뙓列룸씈誼끻룛碎몌폇倻딅쑐裕 111011111010011010011100111010111010001110111000111011001001010010001000111001101000010010001111111010101011001110101011111010101011000110010111111001101010000010010010111010111000000010000000111010111001100110010011111011111010011010011100111010111010001110111000111011001001010010001000111010001010101010111100111010111000000110111011111010111010001110011011111001111010001010001110111010111010101010001100111011011000111110000111111001011000000010111011111010111001010010000101111011001001000110010000111010001010001110010101 efa69ceba3b8ec9488e6848feab3abeab197e6a092eb8080eb9993efa69ceba3b8ec9488e8aabceb81bbeba39be7a28eebaa8ced8f87e580bbeb9485ec9190e8a395
UHC 列룸씈意곫걗栒뀀뙓列룸씈誼끻룛碎몌폇倻딅쑐裕 1110011011101010101101111110101110011101101000001110101111110010100000011110011010000001100000101110001011100011101100101110101110001100100110001110011011101010101101111110101110011101101000001110101111111110100001011110010110001111100101111110000111101111101110001110111110111100100101001110010110100110100010101110101110011100101011111110101110101110 e6eab7eb9da0ebf281e68182e2e3b2eb8c98e6eab7eb9da0ebfe85e58f97e1efb8efbc94e5a68aeb9cafebae

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)