To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????zW????zzK}????zW????zzK{^ 0011111100111111001111110011111101111010010101110011111100111111001111110011111101111010011110100100101101111101001111110011111100111111001111110111101001010111001111110011111100111111001111110111101001111010010010110111101101011110 3f3f3f3f7a573f3f3f3f7a7a4b7d3f3f3f3f7a573f3f3f3f7a7a4b7b5e
SJIS-WIN 稟頃??zW稟頃??zzK}稟頃??zW稟頃??zzK{^ 11100010011001111000110110100000001111110011111101111010010101111110001001100111100011011010000000111111001111110111101001111010010010110111110111100010011001111000110110100000001111110011111101111010010101111110001001100111100011011010000000111111001111110111101001111010010010110111101101011110 e2678da03f3f7a57e2678da03f3f7a7a4b7de2678da03f3f7a57e2678da03f3f7a7a4b7b5e
EUC-JP 稟頃??zW稟頃??zzK}稟頃??zW稟頃??zzK{^ 11100011110010001011101010100010001111110011111101111010010101111110001111001000101110101010001000111111001111110111101001111010010010110111110111100011110010001011101010100010001111110011111101111010010101111110001111001000101110101010001000111111001111110111101001111010010010110111101101011110 e3c8baa23f3f7a57e3c8baa23f3f7a7a4b7de3c8baa23f3f7a57e3c8baa23f3f7a7a4b7b5e
UTF-8 稟頃렰렔zW稟頃렰렔zzK}稟頃렰렔zW稟頃렰렔zzK{^ 11100111101010001001111111101001101000001000001111101011101000001011000011101011101000001001010001111010010101111110011110101000100111111110100110100000100000111110101110100000101100001110101110100000100101000111101001111010010010110111110111100111101010001001111111101001101000001000001111101011101000001011000011101011101000001001010001111010010101111110011110101000100111111110100110100000100000111110101110100000101100001110101110100000100101000111101001111010010010110111101101011110 e7a89fe9a083eba0b0eba0947a57e7a89fe9a083eba0b0eba0947a7a4b7de7a89fe9a083eba0b0eba0947a57e7a89fe9a083eba0b0eba0947a7a4b7b5e
UHC 稟頃렰렔zW稟頃렰렔zzK}稟頃렰렔zW稟頃렰렔zzK{^ 111110011010001011001100111100011000111010111101100011101010100101111010010101111111100110100010110011001111000110001110101111011000111010101001011110100111101001001011011111011111100110100010110011001111000110001110101111011000111010101001011110100101011111111001101000101100110011110001100011101011110110001110101010010111101001111010010010110111101101011110 f9a2ccf18ebd8ea97a57f9a2ccf18ebd8ea97a7a4b7df9a2ccf18ebd8ea97a57f9a2ccf18ebd8ea97a7a4b7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)