To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 将、宵ォ「娼ホ将「宵苡ォ、宵ォ「娼ホ 100011111010101110100100100011111010101011110010100011111010101110100010100011111010100111001110100011111010101110100010100011111010101011100100100011111010101110100100100011111010101011110010100011111010101110100010100011111010100111001110 8faba48faaf28faba28fa9ce8faba28faae48faba48faaf28faba28fa9ce
EUC-JP 将、宵?ォ「娼ホ将「宵苡ォ、宵?ォ「娼ホ 1011111010101101100011101010010010111110101011000011111110001110101010111000111010100010101111101010101110001110110011101011111010101101100011101010001010111110101011001110011111101111100011101010101110001110101001001011111010101100001111111000111010101011100011101010001010111110101010111000111011001110 bead8ea4beac3f8eab8ea2beab8ecebead8ea2beace7ef8eab8ea4beac3f8eab8ea2beab8ece
UTF-8 将、宵ォ「娼ホ将「宵苡ォ、宵ォ「娼ホ 111001011011000010000110111011111011110110100100111001011010111010110101111011101000011110000110111011111011110110101011111011111011110110100010111001011010100010111100111011111011111010001110111001011011000010000110111011111011110110100010111001011010111010110101111010001000101110100001111011111011110110101011111011111011110110100100111001011010111010110101111011101000011110000110111011111011110110101011111011111011110110100010111001011010100010111100111011111011111010001110 e5b086efbda4e5aeb5ee8786efbdabefbda2e5a8bcefbe8ee5b086efbda2e5aeb5e88ba1efbdabefbda4e5aeb5ee8786efbdabefbda2e5a8bcefbe8e
UHC ??宵???娼???宵苡??宵???娼? 0011111100111111111000011011001000111111001111110011111111110011110111100011111100111111001111111110000110110010111011001011111000111111001111111110000110110010001111110011111100111111111100111101111000111111 3f3fe1b23f3f3ff3de3f3f3fe1b2ecbe3f3fe1b23f3f3ff3de3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)