To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 堤?兆???耳??釜堤?兆???耳??釜^ 1001001011100111001111111001001010011011001111110011111100111111100011101010100000111111001111111000101010011000100100101110011100111111100100101001101100111111001111110011111110001110101010000011111100111111100010101001100001011110 92e73f929b3f3f3f8ea83f3f8a9892e73f929b3f3f3f8ea83f3f8a985e
EUC-JP 堤?兆???耳??釜堤?兆???耳??釜^ 1100010011101001001111111100001111111011001111110011111100111111101111001010101000111111001111111011001111111000110001001110100100111111110000111111101100111111001111110011111110111100101010100011111100111111101100111111100001011110 c4e93fc3fb3f3f3fbcaa3f3fb3f8c4e93fc3fb3f3f3fbcaa3f3fb3f85e
UTF-8 堤렞兆닻렗렢耳렰渽釜堤렞兆닻렗렢耳렰渽釜^ 11100101101000001010010011101011101000001001111011100101100001011000011011101011100010111011101111101011101000001001011111101011101000001010001011101000100000001011001111101011101000001011000011100110101110001011110111101001100001111001110011100101101000001010010011101011101000001001111011100101100001011000011011101011100010111011101111101011101000001001011111101011101000001010001011101000100000001011001111101011101000001011000011100110101110001011110111101001100001111001110001011110 e5a0a4eba09ee58586eb8bbbeba097eba0a2e880b3eba0b0e6b8bde9879ce5a0a4eba09ee58586eb8bbbeba097eba0a2e880b3eba0b0e6b8bde9879c5e
UHC 堤렞兆닻렗렢耳렰渽釜堤렞兆닻렗렢耳렰渽釜^ 1111000010100111100011101010111111110000101111001011010011101001100011101010110010001110101100111110110010111100100011101011110111101110101010101101110110111100111100001010011110001110101011111111000010111100101101001110100110001110101011001000111010110011111011001011110010001110101111011110111010101010110111011011110001011110 f0a78eaff0bcb4e98eac8eb3ecbc8ebdeeaaddbcf0a78eaff0bcb4e98eac8eb3ecbc8ebdeeaaddbc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)