To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????BF 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100001001000110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4246
SJIS-WIN 厭レ?宥??矣??厭レ????矣μ?BF 10001001011111011000001110001100001111111001011101000111001111110011111111100001111000010011111100111111100010010111110110000011100011000011111100111111001111110011111111100001111000011000001111001010001111110100001001000110 897d838c3f97473f3fe1e13f3f897d838c3f3f3f3fe1e183ca3f4246
EUC-JP 厭レ?宥??矣??厭レ????矣μ?BF 10110001110111101010010111101100001111111100110110101000001111110011111111100010111000110011111100111111101100011101111010100101111011000011111100111111001111110011111111100010111000111010011011001100001111110100001001000110 b1dea5ec3fcda83f3fe2e33f3fb1dea5ec3f3f3f3fe2e3a6cc3f4246
UTF-8 厭レ슕宥끿뼨矣뺤끽厭レ슌栒롧뼨矣μ럞BF 11100101100011101010110111100011100000111010110011101100100010101001010111100101101011101010010111101011100000011011111111101011101111001010100011100111100111111010001111101011101110101010010011101011100000011011110111100101100011101010110111100011100000111010110011101100100010101000110011100110101000001001001011101011101000011010011111101011101111001010100011100111100111111010001111001110101111001110101110011111100111100100001001000110 e58eade383acec8a95e5aea5eb81bfebbca8e79fa3ebbaa4eb81bde58eade383acec8a8ce6a092eba1a7ebbca8e79fa3cebceb9f9e4246
UHC 厭レ슕宥끿뼨矣뺤끽厭レ슌栒롧뼨矣μ럞BF 1110011011110100101010111110110010011010101001001110101011101001100001011110011110010110101010111110101111111000100101011110110010110011101000111110011011110100101010111110110010011010100111001110001011100011100011101110011110010110101010111110101111111000101001011110110010001110100000010100001001000110 e6f4abec9aa4eae985e796abebf895ecb3a3e6f4abec9a9ce2e38ee796abebf8a5ec8e814246

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)