To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN ???楯?)猥 00111111001111110011111110001111011111000011111110000001011010101110000011001110 3f3f3f8f7c3f816ae0ce
EUC-JP ???楯?)猥 00111111001111110011111110111101110111010011111110100001110010111110000011010000 3f3f3fbddd3fa1cbe0d0
UTF-8 琉딁룺楯귣)猥 111011111010011110001100111010111001010010000001111010111010001110111010111001101010010110101111111010101011011110100011111011111011110010001001111001111000110010100101 efa78ceb9481eba3bae6a5afeab7a3efbc89e78ca5
UHC 琉딁룺楯귣)猥 1110101110100100100010101110011110001111101011011110001011100100100000101110101110100011101010011110100011100101 eba48ae78fade2e482eba3a9e8e5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)