To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???鴨佃?冀?????腎?◆佃ゆ暄??? 0011111100111111001111111000101010011011100100101100111100111111100110010110001000111111001111110011111100111111001111111001000001110100001111111000000110011111100100101100111110000010111001001001110111110101001111110011111100111111 3f3f3f8a9b92cf3f99623f3f3f3f3f90743f819f92cf82e49df53f3f3f
EUC-JP ???鴨佃?冀?????腎?◆佃ゆ暄??? 0011111100111111001111111011001111111011110001001101000100111111110100011100001100111111001111110011111100111111001111111011111111010101001111111010001010100001110001001101000110100100111001101101101011110111001111110011111100111111 3f3f3fb3fbc4d13fd1c33f3f3f3f3fbfd53fa2a1c4d1a4e6daf73f3f3f
UTF-8 쒔롍뤎鴨佃쳩冀쏜렠쒀롊뤎腎퀚◆佃ゆ暄춲햵쥙 111011001001001010010100111010111010000110001101111010111010010010001110111010011011010010101000111001001011110110000011111011001011001110101001111001011000011010000000111011001000111110011100111010111010000010100000111011001001001010000000111010111010000110001010111010111010010010001110111010001000010110001110111011011000000010011010111000101001011110000110111001001011110110000011111000111000001010000110111001101001101010000100111011001011011010110010111011011001011010110101111011001010010110011001 ec9294eba18deba48ee9b4a8e4bd83ecb3a9e58680ec8f9ceba0a0ec9280eba18aeba48ee8858eed809ae29786e4bd83e38286e69a84ecb6b2ed96b5eca599
UHC 쒔롍뤎鴨佃쳩冀쏜렠쒀롊뤎腎퀚◆佃ゆ暄춲햵쥙 101111101010110110001110110100111000111110111110111001001110010111101110111011001010101110001110110100001110110110111101111100001000111010110001101111101010110010001110110100001000111110111110111000111110110010110011100011101010000111011111111011101110110010101010111001101111110110111110101011011000111011000001100011101010001010001110 bead8ed38fbee4e5eeecab8ed0edbdf08eb1beac8ed08fbee3ecb38ea1dfeeecaae6fdbead8ec18ea28e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)