To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????W}????????W{^ 001111110011111100111111001111110011111100111111001111110011111101010111011111010011111100111111001111110011111100111111001111110011111100111111010101110111101101011110 3f3f3f3f3f3f3f3f577d3f3f3f3f3f3f3f3f577b5e
SJIS-WIN 宵ホ?貔将。娼ーW}宵ホ?貔将。娼ーW{^ 1000111110101010110011100011111111100110101111101000111110101011101000011000111110101001101100000101011101111101100011111010101011001110001111111110011010111110100011111010101110100001100011111010100110110000010101110111101101011110 8faace3fe6be8faba18fa9b0577d8faace3fe6be8faba18fa9b0577b5e
EUC-JP 宵ホ鑃貔将。娼ーW}宵ホ鑃貔将。娼ーW{^ 101111101010110010001110110011101000111111100101111010011110110011000000101111101010110110001110101000011011111010101011100011101011000001010111011111011011111010101100100011101100111010001111111001011110100111101100110000001011111010101101100011101010000110111110101010111000111010110000010101110111101101011110 beac8ece8fe5e9ecc0bead8ea1beab8eb0577dbeac8ece8fe5e9ecc0bead8ea1beab8eb0577b5e
UTF-8 宵ホ鑃貔将。娼ーW}宵ホ鑃貔将。娼ーW{^ 1110010110101110101101011110111110111110100011101110100110010001100000111110100010110010100101001110010110110000100001101110111110111101101000011110010110101000101111001110111110111101101100000101011101111101111001011010111010110101111011111011111010001110111010011001000110000011111010001011001010010100111001011011000010000110111011111011110110100001111001011010100010111100111011111011110110110000010101110111101101011110 e5aeb5efbe8ee99183e8b294e5b086efbda1e5a8bcefbdb0577de5aeb5efbe8ee99183e8b294e5b086efbda1e5a8bcefbdb0577b5e
UHC 宵?????娼?W}宵?????娼?W{^ 11100001101100100011111100111111001111110011111100111111111100111101111000111111010101110111110111100001101100100011111100111111001111110011111100111111111100111101111000111111010101110111101101011110 e1b23f3f3f3f3ff3de3f577de1b23f3f3f3f3ff3de3f577b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)