To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 吟℡?蘖豆?? 1000101111100001100001111000010000111111100111110101000010010011101001000011111100111111 8be187843f9f5093a43f3f
EUC-JP 吟??蘖豆?? 10110110111000110011111100111111110111011011000111000110101001100011111100111111 b6e33f3fddb1c6a63f3f
UTF-8 吟℡썬蘖豆렎렩 111001011001000010011111111000101000010010100001111011001000110110101100111010001001100010010110111010001011000110000110111010111010000010001110111010111010000010101001 e5909fe284a1ec8dace89896e8b186eba08eeba0a9
UHC 吟℡썬蘖豆렎렩 1110101111100001101000101110010110111101111000111110010111101110110101001110011110001110101001001000111010110111 ebe1a2e5bde3e5eed4e78ea48eb7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)