To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 驍ゑスエ陷搾スコ鬲 1110100110000010100000101110111110111101101101001110100010011100100011011110111110111101101110101110100110101101 e98282efbdb4e89c8defbdbae9ad
EUC-JP 驍ゑスエ陷搾スコ鬲 111100011110001010100100111100011000111010111101100011101011010011101111111111001011101011110001100011101011110110001110101110101111001010101111 f1e2a4f18ebd8eb4effcbaf18ebd8ebaf2af
UTF-8 驍ゑスエ陷搾スコ鬲 111010011010100110001101111000111000001010010001111011111011110110111101111011111011110110110100111010011001100110110111111001101001000010111110111011111011110110111101111011111011110110111010111010011010110010110010 e9a98de38291efbdbdefbdb4e999b7e690beefbdbdefbdbae9acb2
UHC 驍ゑ??陷搾??? 11111101101001001010101011110001001111110011111111111001111010001111001110110110001111110011111100111111 fda4aaf13f3ff9e8f3b63f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)