To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???韋??碎??巍ル‘揖????韋?? 001111110011111100111111111010001110100000111111001111111110000111101010001111110011111110011011110110011000001110001011100000010110010110010111010010110011111100111111001111110011111111101000111010000011111100111111 3f3f3fe8e83f3fe1ea3f3f9bd9838b8165974b3f3f3f3fe8e83f3f
EUC-JP ???韋??碎??巍ル‘揖????韋?? 001111110011111100111111111100001110101000111111001111111110001011101100001111110011111111010110110110111010010111101011101000011100011011001101101011000011111100111111001111110011111111110000111010100011111100111111 3f3f3ff0ea3f3fe2ec3f3fd6dba5eba1c6cdac3f3f3f3ff0ea3f3f
UTF-8 嶺뚮뱪韋됬뙼碎ⓦ럷巍ル‘揖촪嶺뚮뱪韋됬뙼 111011111010011010101011111010111001101010101110111010111011000110101010111010011001111110001011111010111001000010101100111010111001100110111100111001111010001010001110111000101001001110100110111010111001111110110111111001011011011110001101111000111000001110101011111000101000000010011000111001101000111110010110111011001011010010101010111011111010011010101011111010111001101010101110111010111011000110101010111010011001111110001011111010111001000010101100111010111001100110111100 efa6abeb9aaeebb1aae99f8beb90aceb99bce7a28ee293a6eb9fb7e5b78de383abe28098e68f96ecb4aaefa6abeb9aaeebb1aae99f8beb90aceb99bc
UHC 嶺뚮뱪韋됬뙼碎ⓦ럷巍ル‘揖촪嶺뚮뱪韋됬뙼 11100111101011011000110011101011100100111001000011101010110111111000100111100111100011001011111111100001111011111010100011100011100011101001011011101000111001001010101111101011101000011010111011101011111001111010110001101000111001111010110110001100111010111001001110010000111010101101111110001001111001111000110010111111 e7ad8ceb9390eadf89e78cbfe1efa8e38e96e8e4abeba1aeebe7ac68e7ad8ceb9390eadf89e78cbf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)