To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??誼③?猿??域??移?? 111000011001111100111111001111111000101101100010100001110100001000111111100010011000111000111111001111111000100011100110001111110011111110001000110110100011111100111111 e19f3f3f8b6287423f898e3f3f88e63f3f88da3f3f
EUC-JP 癲??誼??猿??域??移?? 1110001010100001001111110011111110110101110000110011111100111111101100011110111000111111001111111011000011101000001111110011111110110000110111000011111100111111 e2a13f3fb5c33f3fb1ee3f3fb0e83f3fb0dc3f3f
UTF-8 癲뗢뫀誼③뇦猿딅뼬域듈꽒移겻뭄 111001111001100110110010111010111001011110100010111010111010101110000000111010001010101010111100111000101001000110100010111010111000011110100110111001111000110010111111111010111001010010000101111010111011110010101100111001011001111110011111111010111001001110001000111010101011110110010010111001111010011110111011111010101011001010111011111010111010110110000100 e799b2eb97a2ebab80e8aabce291a2eb87a6e78cbfeb9485ebbcace59f9feb9388eabd92e7a7bbeab2bbebad84
UHC 癲뗢뫀誼③뇦猿딅뼬域듈꽒移겻뭄 111011111010011010001011111000101001000110100100111010111111111010101000111010011000011110001110111010101011101110001010111010111001011010101111111001101011010010110101111000101000010010100001111011001011100110110000111001001011100110110011 efa68be291a4ebfea8e9878eeabb8aeb96afe6b4b5e284a1ecb9b0e4b9b3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)