To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?æ?v?æ?vB 001111111110011000111111011101100011111111100110001111110111011001000010 3fe63f763fe63f7642
SJIS-WIN ??朋v??朋vB 0011111100111111100101011111110001110110001111110011111110010101111111000111011001000010 3f3f95fc763f3f95fc7642
EUC-JP ?æ朋v?æ朋vB 001111111000111110101001110000011100101011111110011101100011111110001111101010011100000111001010111111100111011001000010 3f8fa9c1cafe763f8fa9c1cafe7642
UTF-8 룶æ朋v룶æ朋vB 11101011101000111011011011000011101001101110011010011100100010110111011011101011101000111011011011000011101001101110011010011100100010110111011001000010 eba3b6c3a6e69c8b76eba3b6c3a6e69c8b7642
UHC 룶æ朋v룶æ朋vB 100011111010101110101001101000011101110111011011011101101000111110101011101010011010000111011101110110110111011001000010 8faba9a1dddb768faba9a1dddb7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)