To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 腸?┠窩??? 10010010101100000011111110000100101101011110001001111100001111110011111100111111 92b03f84b5e27c3f3f3f
EUC-JP 腸?┠窩??? 11000100101100100011111110101000101101111110001111011101001111110011111100111111 c4b23fa8b7e3dd3f3f3f
UTF-8 腸흙┠窩븍렜렜 111010001000010110111000111011011001110110011001111000101001010010100000111001111010101010101001111010111011100010001101111010111010000010011100111010111010000010011100 e885b8ed9d99e294a0e7aaa9ebb88deba09ceba09c
UHC 腸흙┠窩븍렜렜 1110110111110011110010001110101110100110101101111110100011000000101110101110101110001110101011101000111010101110 edf3c8eba6b7e8c0baeb8eae8eae

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)