To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 郢樣≠繧幄ュッ 111001111011100110011110111010011000000110000010111000111000001010011011111010001010110110101111 e7b99ee98182e3829be8adaf
EUC-JP 郢樣≠繧幄ュッ 1110111010111011110111001110101110100001111000101110010111100010110101101110101010001110101011011000111010101111 eebbdceba1e2e5e2d6ea8ead8eaf
UTF-8 郢樣≠繧幄ュッ 111010011000001110100010111001101010100010100011111000101000100110100000111001111011100110100111111001011011100110000100111011111011110110101101111011111011110110101111 e983a2e6a8a3e289a0e7b9a7e5b984efbdadefbdaf
UHC ?樣≠?幄?? 00111111111001011100011010100001110000010011111111100100110000010011111100111111 3fe5c6a1c13fe4c13f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)