To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????v????vB 0011111100111111001111110011111101110110001111110011111100111111001111110111011001000010 3f3f3f3f763f3f3f3f7642
SJIS-WIN 鼎???v鼎???vB 10010011010000110011111100111111001111110111011010010011010000110011111100111111001111110111011001000010 93433f3f3f7693433f3f3f7642
EUC-JP 鼎???v鼎???vB 11000101101001000011111100111111001111110111011011000101101001000011111100111111001111110111011001000010 c5a43f3f3f76c5a43f3f3f7642
UTF-8 鼎陋렊햇v鼎陋렊햇vB 111010011011110010001110111011111010010110010001111010111010000010001010111011011001011010000111011101101110100110111100100011101110111110100101100100011110101110100000100010101110110110010110100001110111011001000010 e9bc8eefa591eba08aed968776e9bc8eefa591eba08aed96877642
UHC 鼎陋렊햇v鼎陋렊햇vB 11110000101000111101001011101011100011101010000111000111110111100111011011110000101000111101001011101011100011101010000111000111110111100111011001000010 f0a3d2eb8ea1c7de76f0a3d2eb8ea1c7de7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)