To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ??z?????????z???????^ 0011111100111111100000101001101000111111001111110011111100111111001111110011111100111111001111110011111110000010100110100011111100111111001111110011111100111111001111110011111101011110 3f3f829a3f3f3f3f3f3f3f3f3f829a3f3f3f3f3f3f3f5e
EUC-JP 獐?z???獐???獐?z???獐???^ 10001111110010111011101000111111101000111111101000111111001111110011111110001111110010111011101000111111001111110011111110001111110010111011101000111111101000111111101000111111001111110011111110001111110010111011101000111111001111110011111101011110 8fcbba3fa3fa3f3f3f8fcbba3f3f3f8fcbba3fa3fa3f3f3f8fcbba3f3f3f5e
UTF-8 獐숀z欌쾡땄獐렜狀렢獐숀z欌쾡땄獐렜狀렢^ 11100111100011011001000011101100100010001000000011101111101111011001101011100110101011001000110011101100101111101010000111101011100101011000010011100111100011011001000011101011101000001001110011101111101001111011101011101011101000001010001011100111100011011001000011101100100010001000000011101111101111011001101011100110101011001000110011101100101111101010000111101011100101011000010011100111100011011001000011101011101000001001110011101111101001111011101011101011101000001010001001011110 e78d90ec8880efbd9ae6ac8cecbea1eb9584e78d90eba09cefa7baeba0a2e78d90ec8880efbd9ae6ac8cecbea1eb9584e78d90eba09cefa7baeba0a25e
UHC 獐숀z欌쾡땄獐렜狀렢獐숀z欌쾡땄獐렜狀렢^ 1110110111101111101111001111000010100011111110101110110111101011110001001110100110110110101001001110110111101111100011101010111011101101111011101000111010110011111011011110111110111100111100001010001111111010111011011110101111000100111010011011011010100100111011011110111110001110101011101110110111101110100011101011001101011110 edefbcf0a3faedebc4e9b6a4edef8eaeedee8eb3edefbcf0a3faedebc4e9b6a4edef8eaeedee8eb35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)