To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 厭レ?魏??矣??厭レ????矣??B 1000100101111101100000111000110000111111111010011011000000111111001111111110000111100001001111110011111110001001011111011000001110001100001111110011111100111111001111111110000111100001001111110011111101000010 897d838c3fe9b03f3fe1e13f3f897d838c3f3f3f3fe1e13f3f42
EUC-JP 厭レ?魏??矣??厭レ?沅??矣??B 10110001110111101010010111101100001111111111001010110010001111110011111111100010111000110011111100111111101100011101111010100101111011000011111110001111110001101110100100111111001111111110001011100011001111110011111101000010 b1dea5ec3ff2b23f3fe2e33f3fb1dea5ec3f8fc6e93f3fe2e33f3f42
UTF-8 厭レ슕魏섊뼨矣뺤끽厭レ슖沅뺟뼨矣뺤퍢B 11100101100011101010110111100011100000111010110011101100100010101001010111101001101011011000111111101100100001001000101011101011101111001010100011100111100111111010001111101011101110101010010011101011100000011011110111100101100011101010110111100011100000111010110011101100100010101001011011100110101100101000010111101011101110101001111111101011101111001010100011100111100111111010001111101011101110101010010011101101100011011010001001000010 e58eade383acec8a95e9ad8fec848aebbca8e79fa3ebbaa4eb81bde58eade383acec8a96e6b285ebba9febbca8e79fa3ebbaa4ed8da242
UHC 厭レ슕魏섊뼨矣뺤끽厭レ슖沅뺟뼨矣뺤퍢B 11100110111101001010101111101100100110101010010011101010111000001001100011100111100101101010101111101011111110001001010111101100101100111010001111100110111101001010101111101100100110101010010111101010101101101001010111100111100101101010101111101011111110001001010111101100101110111001100101000010 e6f4abec9aa4eae098e796abebf895ecb3a3e6f4abec9aa5eab695e796abebf895ecbb9942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)