To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???v???vB 001111110011111100111111011101100011111100111111001111110111011001000010 3f3f3f763f3f3f7642
SJIS-WIN 悠??v悠??vB 1001011101001001001111110011111101110110100101110100100100111111001111110111011001000010 97493f3f7697493f3f7642
EUC-JP 悠??v悠??vB 1100110110101010001111110011111101110110110011011010101000111111001111110111011001000010 cdaa3f3f76cdaa3f3f7642
UTF-8 悠덃늿v悠덃늿vB 111001101000001010100000111010111000110110000011111010111000101010111111011101101110011010000010101000001110101110001101100000111110101110001010101111110111011001000010 e682a0eb8d83eb8abf76e682a0eb8d83eb8abf7642
UHC 悠덃늿v悠덃늿vB 111010101110110110001000111001101000100010001000011101101110101011101101100010001110011010001000100010000111011001000010 eaed88e6888876eaed88e688887642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)