To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????W 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101010111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f57
SJIS-WIN ?????????????????????W 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101010111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f57
EUC-JP ?????????????????????W 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101010111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f57
UTF-8 챌혶혴챤혝쨀챤혚혟챈혲쨀챦쩐혺챦쩐혞챤혗쨘W 11101100101100011000110011101101100110001011011011101101100110001011010011101100101100011010010011101101100110001001110111101100101010001000000011101100101100011010010011101101100110001001101011101101100110001001111111101100101100011000100011101101100110001011001011101100101010001000000011101100101100011010011011101100101010011001000011101101100110001011101011101100101100011010011011101100101010011001000011101101100110001001111011101100101100011010010011101101100110001001011111101100101010001001100001010111 ecb18ced98b6ed98b4ecb1a4ed989deca880ecb1a4ed989aed989fecb188ed98b2eca880ecb1a6eca990ed98baecb1a6eca990ed989eecb1a4ed9897eca89857
UHC 챌혶혴챤혝쨀챤혚혟챈혲쨀챦쩐혺챦쩐혞챤혗쨘W 11000011101001111100001010011101110000101001101111000011101011101100001010000111110000101011001111000011101011101100001010000101110000101000100111000011101001101100001010011001110000101011001111000011101011111100001010111110110000101001111111000011101011111100001010111110110000101000100011000011101011101100001010000010110000101011101001010111 c3a7c29dc29bc3aec287c2b3c3aec285c289c3a6c299c2b3c3afc2bec29fc3afc2bec288c3aec282c2ba57

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)