To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???淫????淫?^ 00111111001111110011111110001000111110100011111100111111001111110011111110001000111110100011111101011110 3f3f3f88fa3f3f3f3f88fa3f5e
EUC-JP ???淫????淫?^ 00111111001111110011111110110000111111000011111100111111001111110011111110110000111111000011111101011110 3f3f3fb0fc3f3f3f3fb0fc3f5e
UTF-8 捻곗넀淫륞捻곗넀淫륞^ 11101111101001101010010011101010101100111001011111101011100001001000000011100110101101111010101111101011101001011001111011101111101001101010010011101010101100111001011111101011100001001000000011100110101101111010101111101011101001011001111001011110 efa6a4eab397eb8480e6b7abeba59eefa6a4eab397eb8480e6b7abeba59e5e
UHC 捻곗넀淫륞捻곗넀淫륞^ 111001101111011110110000111011001000011010010000111010111110001010010000010001001110011011110111101100001110110010000110100100001110101111100010100100000100010001011110 e6f7b0ec8690ebe29044e6f7b0ec8690ebe290445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)