To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 菫茸羣竏・隱シ 111001001011111110010001111110011110001110111000111000101000100010100101111010001010101010111100 e4bf91f9e3b8e288a5e8aabc
EUC-JP 菫茸羣竏・隱シ 1110100011000001110000101111101111100110101110101110001111101000100011101010010111110000101011001000111010111100 e8c1c2fbe6bae3e88ea5f0ac8ebc
UTF-8 菫茸羣竏・隱シ 111010001000111110101011111010001000110010111000111001111011111010100011111001111010101110001111111011111011110110100101111010011001101010110001111011111011110110111100 e88fabe88cb8e7bea3e7ab8fefbda5e99ab1efbdbc
UHC 菫茸???隱? 11010000110010111110100111000111001111110011111100111111111010111101111100111111 d0cbe9c73f3f3febdf3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)