To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ?厓ヘ??ギ??ぃ?厓ヘ??ギ??ぃB 001111111111101010001101100000110111011100111111001111111000001101001101001111110011111110000010101000010011111111111010100011011000001101110111001111110011111110000011010011010011111100111111100000101010000101000010 3ffa8d83773f3f834d3f3f82a13ffa8d83773f3f834d3f3f82a142
EUC-JP ?厓ヘ??ギ??ぃ?厓ヘ??ギ??ぃB 0011111110001111101101001100011110100101110110000011111100111111101001011010111000111111001111111010010010100011001111111000111110110100110001111010010111011000001111110011111110100101101011100011111100111111101001001010001101000010 3f8fb4c7a5d83f3fa5ae3f3fa4a33f8fb4c7a5d83f3fa5ae3f3fa4a342
UTF-8 룶厓ヘ룶欄ギ룫휁ぃ룶厓ヘ룶欄ギ룫휁ぃB 11101011101000111011011011100101100011101001001111100011100000111001100011101011101000111011011011101111101001001001110111100011100000101010111011101011101000111010101111101101100111001000000111100011100000011000001111101011101000111011011011100101100011101001001111100011100000111001100011101011101000111011011011101111101001001001110111100011100000101010111011101011101000111010101111101101100111001000000111100011100000011000001101000010 eba3b6e58e93e38398eba3b6efa49de382aeeba3abed9c81e38183eba3b6e58e93e38398eba3b6efa49de382aeeba3abed9c81e3818342
UHC 룶厓ヘ룶欄ギ룫휁ぃ룶厓ヘ룶欄ギ룫휁ぃB 10001111101010111110010011101101101010111101100010001111101010111101000111101101101010111010111010001111101000101100010010001111101010101010001110001111101010111110010011101101101010111101100010001111101010111101000111101101101010111010111010001111101000101100010010001111101010101010001101000010 8fabe4edabd88fabd1edabae8fa2c48faaa38fabe4edabd88fabd1edabae8fa2c48faaa342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)