To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????^????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f5e3f3f3f3f3f3f3f3f3f
SJIS-WIN ???渦o????橈??^???要????? 0011111100111111001111111000100101010001100000101000111100111111001111110011111100111111100111101111010000111111001111110101111000111111001111110011111110010111011101100011111100111111001111110011111100111111 3f3f3f8951828f3f3f3f3f9ef43f3f5e3f3f3f97763f3f3f3f3f
EUC-JP 縕??渦o?縕??橈??^縕??要??縕?? 10001111110101001100001000111111001111111011000110110010101000111110111100111111100011111101010011000010001111110011111111011100111101100011111100111111010111101000111111010100110000100011111100111111110011011101011100111111001111111000111111010100110000100011111100111111 8fd4c23f3fb1b2a3ef3f8fd4c23f3fdcf63f3f5e8fd4c23f3fcdd73f3f8fd4c23f3f
UTF-8 縕됵슴渦o쉰縕됵슴橈놅슛^縕됵슴要뺧쉰縕됵슴 11100111101110001001010111101011100100001011010111101100100010101011010011100110101110001010011011101111101111011000111111101100100010011011000011100111101110001001010111101011100100001011010111101100100010101011010011100110101010011000100011101011100001101000010111101100100010101001101101011110111001111011100010010101111010111001000010110101111011001000101010110100111010001010011010000001111010111011101010100111111011001000100110110000111001111011100010010101111010111001000010110101111011001000101010110100 e7b895eb90b5ec8ab4e6b8a6efbd8fec89b0e7b895eb90b5ec8ab4e6a988eb8685ec8a9b5ee7b895eb90b5ec8ab4e8a681ebbaa7ec89b0e7b895eb90b5ec8ab4
UHC 縕됵슴渦o쉰縕됵슴橈놅슛^縕됵슴要뺧쉰縕됵슴 11101000101100101000100111101111101111011011111111101000101111101010001111101111101111011010111011101000101100101000100111101111101111011011111111101000111110101000011011101111101111011011100001011110111010001011001010001001111011111011110110111111111010011010100110010101111011111011110110101110111010001011001010001001111011111011110110111111 e8b289efbdbfe8bea3efbdaee8b289efbdbfe8fa86efbdb85ee8b289efbdbfe9a995efbdaee8b289efbdbf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)