To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????厓??節ц?節??躍????? 0011111100111111001111110011111100111111001111111111101010001101001111110011111110010000110111111000010010001000001111111001000011011111001111110011111110010110111101000011111100111111001111110011111100111111 3f3f3f3f3f3ffa8d3f3f90df84883f90df3f3f96f43f3f3f3f3f
EUC-JP 旿??縕??厓??節ц?節??躍??饔?? 100011111100000111110100001111110011111110001111110101001100001000111111001111111000111110110100110001110011111100111111110000001110000110100111111010000011111111000000111000010011111100111111110011001111011000111111001111111000111111101000111011110011111100111111 8fc1f43f3f8fd4c23f3f8fb4c73f3fc0e1a7e83fc0e13f3fccf63f3f8fe8ef3f3f
UTF-8 旿딉슁縕됧툤厓됭쾫節ц쐥節녘쾳躍쎾룊饔ㅿ쉭 1110011010010111101111111110101110010100100010011110110010001010100000011110011110111000100101011110101110010000101001111110110110001000101001001110010110001110100100111110101110010000101011011110110010111110101010111110011110101111100000001101000110000110111011001001000010100101111001111010111110000000111010111000010110011000111011001011111010110011111010001011101010001101111011001000111010111110111010111010001110001010111010011010010110010100111000111000010110111111111011001000100110101101 e697bfeb9489ec8a81e7b895eb90a7ed88a4e58e93eb90adecbeabe7af80d186ec90a5e7af80eb8598ecbeb3e8ba8dec8ebeeba38ae9a594e385bfec89ad
UHC 旿딉슁縕됧툤厓됭쾫節ц쐥節녘쾳躍쎾룊饔ㅿ쉭 111001111111101010001010111011111011110110110011111010001011001010001001111001011011100010011011111001001110110110001001111010001011001010000010111011111011110110101100111010001001110010001010111011111011110110110011111010001011001010001001111001011011100010011011111001011000111110001001111010001011110110100100111011111011110110101101 e7fa8aefbdb3e8b289e5b89be4ed89e8b282efbdace89c8aefbdb3e8b289e5b89be58f89e8bda4efbdad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)