To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 狎?????矣??如??修??猿??娃?? 111000001011111000111111001111110011111100111111001111111110000111100001001111110011111110010100010000000011111100111111100011110100001100111111001111111000100110001110001111110011111110001000101000010011111100111111 e0be3f3f3f3f3fe1e13f3f94403f3f8f433f3f898e3f3f88a13f3f
EUC-JP 狎?????矣??如??修??猿??娃?? 111000001100000000111111001111110011111100111111001111111110001011100011001111110011111111000111101000010011111100111111101111011010010000111111001111111011000111101110001111110011111110110000101000110011111100111111 e0c03f3f3f3f3fe2e33f3fc7a13f3fbda43f3fb1ee3f3fb0a33f3f
UTF-8 狎띠떜杻삼㎖矣뺣럡如붴뫖修뗤벧猿딆뒢娃뺣찆 111001111000101110001110111010111001110110100000111010111001011010011100111011111010011110001000111011001000001010111100111000111000111010010110111001111001111110100011111010111011101010100011111010111001111110100001111001011010011010000010111010111011011010110100111010111010101110010110111001001011111110101110111010111001011110100100111010111011001010100111111001111000110010111111111010111001010010000110111010111001001010100010111001011010100010000011111010111011101010100011111011001011000010000110 e78b8eeb9da0eb969cefa788ec82bce38e96e79fa3ebbaa3eb9fa1e5a682ebb6b4ebab96e4bfaeeb97a4ebb2a7e78cbfeb9486eb92a2e5a883ebbaa3ecb086
UHC 狎띠떜杻삼㎖矣뺣럡如붴뫖修뗤벧猿딆뒢娃뺣찆 111001001110010010110110111011001000101110110010111010101111010010111011111011111010011110100010111010111111100010010101111010111000111010000100111001011111110110010100111000101001000110111000111000011111001110001011111001001011101010100110111010101011101110001010111011001000101010011110111010001101111110010101111010111010100110001010 e4e4b6ec8bb2eaf4bbefa7a2ebf895eb8e84e5fd94e291b8e1f38be4baa6eabb8aec8a9ee8df95eba98a

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)