To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????z 00111111001111110011111100111111001111110011111100111111001111110011111101111010 3f3f3f3f3f3f3f3f3f7a
SJIS-WIN ???宥??違??z 001111110011111100111111100101110100011100111111001111111000100011100001001111110011111101111010 3f3f3f97473f3f88e13f3f7a
EUC-JP ???宥??違??z 001111110011111100111111110011011010100000111111001111111011000011100011001111110011111101111010 3f3f3fcda83f3fb0e33f3f7a
UTF-8 劣꾨툙宥랃쭛違곷땷z 11101111101001101001110111101010101111101010100011101101100010001001100111100101101011101010010111101011100111101000001111101100101011011001101111101001100000011001010111101010101100111011011111101011100101011011011101111010 efa69deabea8ed8899e5aea5eb9e83ecad9be98195eab3b7eb95b77a
UHC 劣꾨툙宥랃쭛違곷땷z 11100110111010111000010011101011101110001001000011101010111010011000110111101111101001111001000111101010110111101000000111101011100010111000110101111010 e6eb84ebb890eae98defa791eade81eb8b8d7a

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)