To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 籤至窟褌順奨赳 1110001011011100100011101000101010001100010000011110010111101100100011111000011110001111101001111110011011100000 e2dc8e8a8c41e5ec8f878fa7e6e0
EUC-JP 籤至窟褌順奨赳 1110010011011110101110111110101010110111101000101110101011101110101111011110011110111110101010011110110011100010 e4debbeab7a2eaeebde7bea9ece2
UTF-8 籤至窟褌順奨赳 111001111011000110100100111010001000011110110011111001111010101010011111111010001010010010001100111010011010000010000110111001011010010110101000111010001011010110110011 e7b1a4e887b3e7aa9fe8a48ce9a086e5a5a8e8b5b3
UHC 籤至窟?順?赳 111101001101100111110010101110001100111111011111001111111110001011110111001111111101000010101111 f4d9f2b8cfdf3fe2f73fd0af

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)