To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???v???vB 001111110011111100111111011101100011111100111111001111110111011001000010 3f3f3f763f3f3f7642
SJIS-WIN 褫芽瓜v褫芽瓜vB 111001011111001110001001111010001000100101011010011101101110010111110011100010011110100010001001010110100111011001000010 e5f389e8895a76e5f389e8895a7642
EUC-JP 褫芽瓜v褫芽瓜vB 111010101111010110110010111010101011000110111011011101101110101011110101101100101110101010110001101110110111011001000010 eaf5b2eab1bb76eaf5b2eab1bb7642
UTF-8 褫芽瓜v褫芽瓜vB 111010001010010010101011111010001000101010111101111001111001001110011100011101101110100010100100101010111110100010001010101111011110011110010011100111000111011001000010 e8a4abe88abde7939c76e8a4abe88abde7939c7642
UHC ?芽瓜v?芽瓜vB 00111111111001001011010011001101111111100111011000111111111001001011010011001101111111100111011001000010 3fe4b4cdfe763fe4b4cdfe7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)