To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????~ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101111110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f7e
SJIS-WIN 厭レ????矣λ?厭レ????矣?┓~ 100010010111110110000011100011000011111100111111001111110011111111100001111000011000001111001001001111111000100101111101100000111000110000111111001111110011111100111111111000011110000100111111100001001010110101111110 897d838c3f3f3f3fe1e183c93f897d838c3f3f3f3fe1e13f84ad7e
EUC-JP 厭レ?堉??矣λ?厭レ?堉??矣?┓~ 10110001110111101010010111101100001111111000111110110111111111010011111100111111111000101110001110100110110010110011111110110001110111101010010111101100001111111000111110110111111111010011111100111111111000101110001100111111101010001010111101111110 b1dea5ec3f8fb7fd3f3fe2e3a6cb3fb1dea5ec3f8fb7fd3f3fe2e33fa8af7e
UTF-8 厭レ슌堉사뼨矣λ츍厭レ슌堉사뼨矣쒖┓~ 111001011000111010101101111000111000001110101100111011001000101010001100111001011010000010001001111011001000001010101100111010111011110010101000111001111001111110100011110011101011101111101100101110001000110111100101100011101010110111100011100000111010110011101100100010101000110011100101101000001000100111101100100000101010110011101011101111001010100011100111100111111010001111101100100100101001011011100010100101001001001101111110 e58eade383acec8a8ce5a089ec82acebbca8e79fa3cebbecb88de58eade383acec8a8ce5a089ec82acebbca8e79fa3ec9296e294937e
UHC 厭レ슌堉사뼨矣λ츍厭レ슌堉사뼨矣쒖┓~ 11100110111101001010101111101100100110101001110011101011101111001011101111100111100101101010101111101011111110001010010111101011101011101000100011100110111101001010101111101100100110101001110011101011101111001011101111100111100101101010101111101011111110001001110011101100101001101010111101111110 e6f4abec9a9cebbcbbe796abebf8a5ebae88e6f4abec9a9cebbcbbe796abebf89ceca6af7e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)