To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????®??????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111010111000111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3fae3f3f3f3f3f3f3f
SJIS-WIN 厭レ?艤??矣??厭レ?援??矣??癲?? 100010010111110110000011100011000011111111100100011111100011111100111111111000011110000100111111001111111000100101111101100000111000110000111111100010011000011100111111001111111110000111100001001111110011111111100001100111110011111100111111 897d838c3fe47e3f3fe1e13f3f897d838c3f89873f3fe1e13f3fe19f3f3f
EUC-JP 厭レ?艤??矣??厭レ?援®?矣??癲?? 1011000111011110101001011110110000111111111001111101111100111111001111111110001011100011001111110011111110110001110111101010010111101100001111111011000111100111100011111010001011101110001111111110001011100011001111110011111111100010101000010011111100111111 b1dea5ec3fe7df3f3fe2e33f3fb1dea5ec3fb1e78fa2ee3fe2e33f3fe2a13f3f
UTF-8 厭レ슃艤욜뼨矣ㅻ뮕厭レ슆援®뼨矣섑렦癲잛틳 1110010110001110101011011110001110000011101011001110110010001010100000111110100010001001101001001110110010011010100111001110101110111100101010001110011110011111101000111110001110000101101110111110101110101110100101011110010110001110101011011110001110000011101011001110110010001010100001101110011010001111101101001100001010101110111010111011110010101000111001111001111110100011111011001000010010010001111010111010000010100110111001111001100110110010111011001001111010011011111011011000101110110011 e58eade383acec8a83e889a4ec9a9cebbca8e79fa3e385bbebae95e58eade383acec8a86e68fb4c2aeebbca8e79fa3ec8491eba0a6e799b2ec9e9bed8bb3
UHC 厭レ슃艤욜뼨矣ㅻ뮕厭レ슆援®뼨矣섑렦癲잛틳 111001101111010010101011111011001001101010010101111010111111101010111111111001111001011010101011111010111111100010100100111010111001001010100001111001101111010010101011111011001001101010011000111010101011010110100010111001111001011010101011111010111111100010011000111011011000111010110101111011111010011010011111111011001011101010011011 e6f4abec9a95ebfabfe796abebf8a4eb92a1e6f4abec9a98eab5a2e796abebf898ed8eb5efa69fecba9b

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)