To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ??????????щ??k?鶯?????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111100001001000101100111111001111111000001010001011001111111110100111110010001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f848b3f3f828b3fe9f23f3f3f3f3f5e
EUC-JP ?????????馹щ??k?鶯?????^ 001111110011111100111111001111110011111100111111001111110011111100111111100011111110100110100001101001111110101100111111001111111010001111101011001111111111001011110100001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f8fe9a1a7eb3f3fa3eb3ff2f43f3f3f3f3f5e
UTF-8 溜삳젫溜삣짍溜잙졎馹щ졎溜k졎鶯숇젧溜삵벉^ 111011111010011110001011111011001000001010110011111011001010000010101011111011111010011110001011111011001000001010100011111011001010011110001101111011111010011110001011111011001001111010011001111011001010000110001110111010011010011010111001110100011000100111101100101000011000111011101111101001111000101111101111101111011000101111101100101000011000111011101001101101101010111111101100100010001000011111101100101000001010011111101111101001111000101111101100100000101011010111101011101100101000100101011110 efa78bec82b3eca0abefa78bec82a3eca78defa78bec9e99eca18ee9a6b9d189eca18eefa78befbd8beca18ee9b6afec8887eca0a7efa78bec82b5ebb2895e
UHC 溜삳젫溜삣짍溜잙졎馹щ졎溜k졎鶯숇젧溜삵벉^ 11101010111111101011101111101011101000001010001111101010111111101011101111100101101000111001100111101010111111101001111111101011101000001011101111101100111100011010110011101011101000001011101111101010111111101010001111101011101000001011101111100101101000111001100111101011101000001001111111101010111111101011101111101101100100111010110001011110 eafebbeba0a3eafebbe5a399eafe9feba0bbecf1aceba0bbeafea3eba0bbe5a399eba09feafebbed93ac5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)