To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????n}???????????n{^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 上゚痔濵汐シエ礁スシァn}上゚痔濵汐シエ礁スシァn{^ 10001111111000111101111110001110101001001111101101001101100011101010110010111100101101001000111111001010101111011011110010100111011011100111110110001111111000111101111110001110101001001111101101001101100011101010110010111100101101001000111111001010101111011011110010100111011011100111101101011110 8fe3df8ea4fb4d8eacbcb48fcabdbca76e7d8fe3df8ea4fb4d8eacbcb48fcabdbca76e7b5e
EUC-JP 上゚痔濵汐シエ礁スシァn}上゚痔濵汐シエ礁スシァn{^ 101111101110010110001110110111111011110010100110100011111100100110100110101111001010111010001110101111001000111010110100101111101100110010001110101111011000111010111100100011101010011101101110011111011011111011100101100011101101111110111100101001101000111111001001101001101011110010101110100011101011110010001110101101001011111011001100100011101011110110001110101111001000111010100111011011100111101101011110 bee58edfbca68fc9a6bcae8ebc8eb4becc8ebd8ebc8ea76e7dbee58edfbca68fc9a6bcae8ebc8eb4becc8ebd8ebc8ea76e7b5e
UTF-8 上゚痔濵汐シエ礁スシァn}上゚痔濵汐シエ礁スシァn{^ 1110010010111000100010101110111110111110100111111110011110010111100101001110011010111111101101011110011010110001100100001110111110111101101111001110111110111101101101001110011110100100100000011110111110111101101111011110111110111101101111001110111110111101101001110110111001111101111001001011100010001010111011111011111010011111111001111001011110010100111001101011111110110101111001101011000110010000111011111011110110111100111011111011110110110100111001111010010010000001111011111011110110111101111011111011110110111100111011111011110110100111011011100111101101011110 e4b88aefbe9fe79794e6bfb5e6b190efbdbcefbdb4e7a481efbdbdefbdbcefbda76e7de4b88aefbe9fe79794e6bfb5e6b190efbdbcefbdb4e7a481efbdbdefbdbcefbda76e7b5e
UHC 上?痔?汐??礁???n}上?痔?汐??礁???n{^ 1101111110111110001111111111011011000000001111111110000010110001001111110011111111110101101001110011111100111111001111110110111001111101110111111011111000111111111101101100000000111111111000001011000100111111001111111111010110100111001111110011111100111111011011100111101101011110 dfbe3ff6c03fe0b13f3ff5a73f3f3f6e7ddfbe3ff6c03fe0b13f3ff5a73f3f3f6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)