To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????i?????????iB 001111110011111100111111001111110011111100111111001111110011111100111111011010010011111100111111001111110011111100111111001111110011111100111111001111110110100101000010 3f3f3f3f3f3f3f3f3f693f3f3f3f3f3f3f3f3f6942
SJIS-WIN 厭レ?幼??矣?┓i厭レ?幼??矣?┓iB 10001001011111011000001110001100001111111001011101100011001111110011111111100001111000010011111110000100101011010110100110001001011111011000001110001100001111111001011101100011001111110011111111100001111000010011111110000100101011010110100101000010 897d838c3f97633f3fe1e13f84ad69897d838c3f97633f3fe1e13f84ad6942
EUC-JP 厭レ?幼??矣?┓i厭レ?幼??矣?┓iB 10110001110111101010010111101100001111111100110111000100001111110011111111100010111000110011111110101000101011110110100110110001110111101010010111101100001111111100110111000100001111110011111111100010111000110011111110101000101011110110100101000010 b1dea5ec3fcdc43f3fe2e33fa8af69b1dea5ec3fcdc43f3fe2e33fa8af6942
UTF-8 厭レ슌幼뚨뼨矣쒖┓i厭レ슌幼뚨뼨矣쒖┓iB 111001011000111010101101111000111000001110101100111011001000101010001100111001011011100110111100111010111001101010101000111010111011110010101000111001111001111110100011111011001001001010010110111000101001010010010011011010011110010110001110101011011110001110000011101011001110110010001010100011001110010110111001101111001110101110011010101010001110101110111100101010001110011110011111101000111110110010010010100101101110001010010100100100110110100101000010 e58eade383acec8a8ce5b9bceb9aa8ebbca8e79fa3ec9296e2949369e58eade383acec8a8ce5b9bceb9aa8ebbca8e79fa3ec9296e294936942
UHC 厭レ슌幼뚨뼨矣쒖┓i厭レ슌幼뚨뼨矣쒖┓iB 111001101111010010101011111011001001101010011100111010101110101010001100111001111001011010101011111010111111100010011100111011001010011010101111011010011110011011110100101010111110110010011010100111001110101011101010100011001110011110010110101010111110101111111000100111001110110010100110101011110110100101000010 e6f4abec9a9ceaea8ce796abebf89ceca6af69e6f4abec9a9ceaea8ce796abebf89ceca6af6942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)