To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???要?????倭??節??渦o????渦??B 00111111001111110011111110010111011101100011111100111111001111110011111100111111100110000110000000111111001111111001000011011111001111110011111110001001010100011000001010001111001111110011111100111111001111111000100101010001001111110011111101000010 3f3f3f97763f3f3f3f3f98603f3f90df3f3f8951828f3f3f3f3f89513f3f42
EUC-JP 縕??要??縕??倭??節??渦o?縕??渦??B 10001111110101001100001000111111001111111100110111010111001111110011111110001111110101001100001000111111001111111100111111000001001111110011111111000000111000010011111100111111101100011011001010100011111011110011111110001111110101001100001000111111001111111011000110110010001111110011111101000010 8fd4c23f3fcdd73f3f8fd4c23f3fcfc13f3fc0e13f3fb1b2a3ef3f8fd4c23f3fb1b23f3f42
UTF-8 縕됵슴要뺧쉰縕됵슴倭롡댌節뱄슴渦o쉰縕됵슴渦뤄슉B 11100111101110001001010111101011100100001011010111101100100010101011010011101000101001101000000111101011101110101010011111101100100010011011000011100111101110001001010111101011100100001011010111101100100010101011010011100101100000001010110111101011101000011010000111101011100011001000110011100111101011111000000011101011101100011000010011101100100010101011010011100110101110001010011011101111101111011000111111101100100010011011000011100111101110001001010111101011100100001011010111101100100010101011010011100110101110001010011011101011101001001000010011101100100010101000100101000010 e7b895eb90b5ec8ab4e8a681ebbaa7ec89b0e7b895eb90b5ec8ab4e580adeba1a1eb8c8ce7af80ebb184ec8ab4e6b8a6efbd8fec89b0e7b895eb90b5ec8ab4e6b8a6eba484ec8a8942
UHC 縕됵슴要뺧쉰縕됵슴倭롡댌節뱄슴渦o쉰縕됵슴渦뤄슉B 11101000101100101000100111101111101111011011111111101001101010011001010111101111101111011010111011101000101100101000100111101111101111011011111111101000110111101000111011100010100010001011010111101111101111011011100111101111101111011011111111101000101111101010001111101111101111011010111011101000101100101000100111101111101111011011111111101000101111101011011111101111101111011011010101000010 e8b289efbdbfe9a995efbdaee8b289efbdbfe8de8ee288b5efbdb9efbdbfe8bea3efbdaee8b289efbdbfe8beb7efbdb542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)