To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 鈔ア譚大蘒閼ア 111001111110001010110001111001101001110110010001111001011111101110011111111010001000010010110001 e7e2b1e69d91e5fb9fe884b1
EUC-JP 鈔ア譚大?閼ア 11101110111001001000111010110001111010111111110111000010111001110011111111101111111001001000111010110001 eee48eb1ebfdc2e73fefe48eb1
UTF-8 鈔ア譚大蘒閼ア 111010011000100010010100111011111011110110110001111010001010110110011010111001011010010010100111111011111010100010100000111010011001011010111100111011111011110110110001 e98894efbdb1e8ad9ae5a4a7efa8a0e996bcefbdb1
UHC ??譚大?閼? 00111111001111111101001111001001110100111101111000111111111001001101100100111111 3f3fd3c9d3de3fe4d93f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)