To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????\ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011100 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5c
SJIS-WIN ???梧??釗??????????????\ 001111110011111100111111100011001110011000111111001111111111101110111011001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011100 3f3f3f8ce63f3ffbbb3f3f3f3f3f3f3f3f3f3f3f3f3f3f5c
EUC-JP ???梧??釗??????????????\ 00111111001111110011111110111000111010000011111100111111100011111110001110100110001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011100 3f3f3fb8e83f3f8fe3a63f3f3f3f3f3f3f3f3f3f3f3f3f3f5c
UTF-8 溜삠뀛梧노졎釗숇젔溜삠뀛溜붾졎栒붷뵜栒붾젧\ 11101111101001111000101111101100100000101010000011101011100000001001101111100110101000101010011111101011100001011011100011101100101000011000111011101001100001111001011111101100100010001000011111101100101000001001010011101111101001111000101111101100100000101010000011101011100000001001101111101111101001111000101111101011101101101011111011101100101000011000111011100110101000001001001011101011101101101011011111101011101101011001110011100110101000001001001011101011101101101011111011101100101000001010011101011100 efa78bec82a0eb809be6a2a7eb85b8eca18ee98797ec8887eca094efa78bec82a0eb809befa78bebb6beeca18ee6a092ebb6b7ebb59ce6a092ebb6beeca0a75c
UHC 溜삠뀛梧노졎釗숇젔溜삠뀛溜붾졎栒붷뵜栒붾젧\ 11101010111111101011101111100011100001011001010011100111111111001011001111101011101000001011101111100001111100101001100111101011101000001001001011101010111111101011101111100011100001011001010011101010111111101001010011101011101000001011101111100010111000111001010011100101100101001001110011100010111000111001010011101011101000001001111101011100 eafebbe38594e7fcb3eba0bbe1f299eba092eafebbe38594eafe94eba0bbe2e394e5949ce2e394eba09f5c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)