To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 驍丞ェ帝o蛟洪 11101001100000101000111111100101101010101001001011101001100000101000111111100101100000001000110101011110 e9828fe5aa92e9828fe5808d5e
EUC-JP 驍丞ェ帝o蛟洪 1111000111100010101111101110011110001110101010101100010011101011101000111110111111101001111000001011100110111111 f1e2bee78eaac4eba3efe9e0b9bf
UTF-8 驍丞ェ帝o蛟洪 111010011010100110001101111001001011100010011110111011111011110110101010111001011011100010011101111011111011110110001111111010001001101110011111111001101011010010101010 e9a98de4b89eefbdaae5b89defbd8fe89b9fe6b4aa
UHC 驍丞?帝o蛟洪 11111101101001001110001110101010001111111111000010101000101000111110111111001110111100011111101111110011 fda4e3aa3ff0a8a3efcef1fbf3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)