To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 額?ぉ??????}額?ぉ??????{^ 10001010011110100011111110000010101001110011111100111111001111110011111100111111001111110111110110001010011110100011111110000010101001110011111100111111001111110011111100111111001111110111101101011110 8a7a3f82a73f3f3f3f3f3f7d8a7a3f82a73f3f3f3f3f3f7b5e
EUC-JP 額?ぉ??????}額?ぉ??????{^ 10110011110110110011111110100100101010010011111100111111001111110011111100111111001111110111110110110011110110110011111110100100101010010011111100111111001111110011111100111111001111110111101101011110 b3db3fa4a93f3f3f3f3f3f7db3db3fa4a93f3f3f3f3f3f7b5e
UTF-8 額ㅻぉ溜깍쪚溜깊꽩}額ㅻぉ溜깍쪚溜깊꽩{^ 111010011010000110001101111000111000010110111011111000111000000110001001111011111010011110001011111010101011100110001101111011001010101010011010111011111010011110001011111010101011100110001010111010101011110110101001011111011110100110100001100011011110001110000101101110111110001110000001100010011110111110100111100010111110101010111001100011011110110010101010100110101110111110100111100010111110101010111001100010101110101010111101101010010111101101011110 e9a18de385bbe38189efa78beab98decaa9aefa78beab98aeabda97de9a18de385bbe38189efa78beab98decaa9aefa78beab98aeabda97b5e
UHC 額ㅻぉ溜깍쪚溜깊꽩}額ㅻぉ溜깍쪚溜깊꽩{^ 111001001111111010100100111010111010101010101001111010101111111010110001111011111010010110010011111010101111111010110001111011011000010010110100011111011110010011111110101001001110101110101010101010011110101011111110101100011110111110100101100100111110101011111110101100011110110110000100101101000111101101011110 e4fea4ebaaa9eafeb1efa593eafeb1ed84b47de4fea4ebaaa9eafeb1efa593eafeb1ed84b47b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)