To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????b[?????????b[^ 0011111100111111001111110011111100111111001111110011111100111111001111110110001001011011001111110011111100111111001111110011111100111111001111110011111100111111011000100101101101011110 3f3f3f3f3f3f3f3f3f625b3f3f3f3f3f3f3f3f3f625b5e
SJIS-WIN 厭レ?愿??矣??b[厭レ?愿??矣??b[^ 10001001011111011000001110001100001111111001110011000011001111110011111111100001111000010011111100111111011000100101101110001001011111011000001110001100001111111001110011000011001111110011111111100001111000010011111100111111011000100101101101011110 897d838c3f9cc33f3fe1e13f3f625b897d838c3f9cc33f3fe1e13f3f625b5e
EUC-JP 厭レ?愿??矣??b[厭レ?愿??矣??b[^ 10110001110111101010010111101100001111111101100011000101001111110011111111100010111000110011111100111111011000100101101110110001110111101010010111101100001111111101100011000101001111110011111111100010111000110011111100111111011000100101101101011110 b1dea5ec3fd8c53f3fe2e33f3f625bb1dea5ec3fd8c53f3fe2e33f3f625b5e
UTF-8 厭レ슃愿뚨뼨矣뺤릉b[厭レ슃愿뚨뼨矣뺤릉b[^ 1110010110001110101011011110001110000011101011001110110010001010100000111110011010000100101111111110101110011010101010001110101110111100101010001110011110011111101000111110101110111010101001001110101110100110100010010110001001011011111001011000111010101101111000111000001110101100111011001000101010000011111001101000010010111111111010111001101010101000111010111011110010101000111001111001111110100011111010111011101010100100111010111010011010001001011000100101101101011110 e58eade383acec8a83e684bfeb9aa8ebbca8e79fa3ebbaa4eba689625be58eade383acec8a83e684bfeb9aa8ebbca8e79fa3ebbaa4eba689625b5e
UHC 厭レ슃愿뚨뼨矣뺤릉b[厭レ슃愿뚨뼨矣뺤릉b[^ 1110011011110100101010111110110010011010100101011110101010110100100011001110011110010110101010111110101111111000100101011110110010111000101010100110001001011011111001101111010010101011111011001001101010010101111010101011010010001100111001111001011010101011111010111111100010010101111011001011100010101010011000100101101101011110 e6f4abec9a95eab48ce796abebf895ecb8aa625be6f4abec9a95eab48ce796abebf895ecb8aa625b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)