To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???韋??邑??閻??裕??兢悠??柔?? 0011111100111111001111111110100011101000001111110011111110010111010101110011111100111111111010001000010100111111001111111001011101010100001111110011111110011001010111011001011101001001001111110011111110001111010111110011111100111111 3f3f3fe8e83f3f97573f3fe8853f3f97543f3f995d97493f3f8f5f3f3f
EUC-JP ???韋??邑??閻??裕??兢悠??柔?? 0011111100111111001111111111000011101010001111110011111111001101101110000011111100111111111011111110010100111111001111111100110110110101001111110011111111010001101111101100110110101010001111110011111110111101110000000011111100111111 3f3f3ff0ea3f3fcdb83f3fefe53f3fcdb53f3fd1becdaa3f3fbdc03f3f
UTF-8 捻뀁궠韋껃젆邑룔럹閻롫쓹裕드슫兢悠껓쭪柔곗쵁 111011111010011010100100111010111000000010000001111010101011011010100000111010011001111110001011111010101011101110000011111011001010000010000110111010011000001010010001111010111010001110010100111010111001111110111001111010011001011010111011111010111010000110101011111011001001001110111001111010001010001110010101111010111001001110011100111011001000101010101011111001011000010110100010111001101000001010100000111010101011101110010011111011001010110110101010111001101001111110010100111010101011001110010111111011001011010110000001 efa6a4eb8081eab6a0e99f8beabb83eca086e98291eba394eb9fb9e996bbeba1abec93b9e8a395eb939cec8aabe585a2e682a0eabb93ecadaae69f94eab397ecb581
UHC 捻뀁궠韋껃젆邑룔럹閻롫쓹裕드슫兢悠껓쭪柔곗쵁 1110011011110111101100101110110010000010101100111110101011011111100000111110010110100000100010011110101111101001101101111110001110001110100110001110011110100010100011101110101110011101100101011110101110101110101101011110010110011010101101001101000011100111111010101110110110000011111011111010011110011110111010101111010110110000111011001010110010000011 e6f7b2ec82b3eadf83e5a089ebe9b7e38e98e7a28eeb9d95ebaeb5e59ab4d0e7eaed83efa79eeaf5b0ecac83

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)