To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 際?爾?蕪應賂??際?爾?蕪應賂??B 1000110111011011001111111000111010100010001111111001010110010011100111001110010010011000010001110011111100111111100011011101101100111111100011101010001000111111100101011001001110011100111001001001100001000111001111110011111101000010 8ddb3f8ea23f95939ce498473f3f8ddb3f8ea23f95939ce498473f3f42
EUC-JP 際?爾?蕪應賂??際?爾?蕪應賂??B 1011101011011101001111111011110010100100001111111100100111110011110110001110011011001111101010000011111100111111101110101101110100111111101111001010010000111111110010011111001111011000111001101100111110101000001111110011111101000010 badd3fbca43fc9f3d8e6cfa83f3fbadd3fbca43fc9f3d8e6cfa83f3f42
UTF-8 際렑爾잭蕪應賂렰렞際렑爾잭蕪應賂렰렞B 11101001100110101001101111101011101000001001000111100111100010001011111011101100100111101010110111101000100101011010101011100110100001111000100111101000101100111000001011101011101000001011000011101011101000001001111011101001100110101001101111101011101000001001000111100111100010001011111011101100100111101010110111101000100101011010101011100110100001111000100111101000101100111000001011101011101000001011000011101011101000001001111001000010 e99a9beba091e788beec9eade895aae68789e8b382eba0b0eba09ee99a9beba091e788beec9eade895aae68789e8b382eba0b0eba09e42
UHC 際렑爾잭蕪應賂렰렞際렑爾잭蕪應賂렰렞B 11110000101101111000111010100110111011001011001111000000111010001101100111110011111010111110101111010110111100011000111010111101100011101010111111110000101101111000111010100110111011001011001111000000111010001101100111110011111010111110101111010110111100011000111010111101100011101010111101000010 f0b78ea6ecb3c0e8d9f3ebebd6f18ebd8eaff0b78ea6ecb3c0e8d9f3ebebd6f18ebd8eaf42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)