To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ?厓ち?釐ち??び?厓ち?釐ち??びB 0011111111111010100011011000001010111111001111111110011111011000100000101011111100111111001111111000001011010001001111111111101010001101100000101011111100111111111001111101100010000010101111110011111100111111100000101101000101000010 3ffa8d82bf3fe7d882bf3f3f82d13ffa8d82bf3fe7d882bf3f3f82d142
EUC-JP ?厓ち?釐ち??び?厓ち?釐ち??びB 00111111100011111011010011000111101001001100000100111111111011101101101010100100110000010011111100111111101001001101001100111111100011111011010011000111101001001100000100111111111011101101101010100100110000010011111100111111101001001101001101000010 3f8fb4c7a4c13feedaa4c13f3fa4d33f8fb4c7a4c13feedaa4c13f3fa4d342
UTF-8 룵厓ち룶釐ち룶欄び룵厓ち룶釐ち룶欄びB 11101011101000111011010111100101100011101001001111100011100000011010000111101011101000111011011011101001100001111001000011100011100000011010000111101011101000111011011011101111101001001001110111100011100000011011001111101011101000111011010111100101100011101001001111100011100000011010000111101011101000111011011011101001100001111001000011100011100000011010000111101011101000111011011011101111101001001001110111100011100000011011001101000010 eba3b5e58e93e381a1eba3b6e98790e381a1eba3b6efa49de381b3eba3b5e58e93e381a1eba3b6e98790e381a1eba3b6efa49de381b342
UHC 룵厓ち룶釐ち룶欄び룵厓ち룶釐ち룶欄びB 10001111101010101110010011101101101010101100000110001111101010111101011111101101101010101100000110001111101010111101000111101101101010101101001110001111101010101110010011101101101010101100000110001111101010111101011111101101101010101100000110001111101010111101000111101101101010101101001101000010 8faae4edaac18fabd7edaac18fabd1edaad38faae4edaac18fabd7edaac18fabd1edaad342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)