To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN ???沆▼?臾?? 001111110011111100111111111110101111011110000001101001010011111111100100011010110011111100111111 3f3f3ffaf781a53fe46b3f3f
EUC-JP ???沆▼?臾?? 00111111001111110011111110001111110001101110101010100010101001110011111111100111110011000011111100111111 3f3f3f8fc6eaa2a73fe7cc3f3f
UTF-8 令⒲꺁沆▼첎臾딅윣 111011111010011010101000111000101001001010110010111010101011101010000001111001101011001010000110111000101001011010111100111011001011001010001110111010001000011110111110111010111001010010000101111011001001110010100011 efa6a8e292b2eaba81e6b286e296bcecb28ee887beeb9485ec9ca3
UHC 令⒲꺁沆▼첎臾딅윣 111001111010100110101001111000111000001110101010111110011111101010100001111001011010101010011011111010111010110010001010111010111001111110100100 e7a9a9e383aaf9faa1e5aa9bebac8aeb9fa4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)