To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 玉??一??猿?? 100010111100101000111111001111111000100011101010001111110011111110001001100011100011111100111111 8bca3f3f88ea3f3f898e3f3f
EUC-JP 玉??一??猿?? 101101101100110000111111001111111011000011101100001111110011111110110001111011100011111100111111 b6cc3f3fb0ec3f3fb1ee3f3f
UTF-8 玉좊㈃一뜹푻猿껊렩 111001111000111010001001111011001010001010001010111000111000100010000011111001001011100010000000111010111001110010111001111011011001000110111011111001111000110010111111111010101011101110001010111010111010000010101001 e78e89eca28ae38883e4b880eb9cb9ed91bbe78cbfeabb8aeba0a9
UHC 玉좊㈃一뜹푻猿껊렩 111010001010110010100000111010111010100110110100111011001110100110110110111001011011111010000111111010101011101110000011111010111000111010110111 e8aca0eba9b4ece9b6e5be87eabb83eb8eb7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)