To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 墺?ぉ??????}墺?ぉ??????{^ 10011010110100100011111110000010101001110011111100111111001111110011111100111111001111110111110110011010110100100011111110000010101001110011111100111111001111110011111100111111001111110111101101011110 9ad23f82a73f3f3f3f3f3f7d9ad23f82a73f3f3f3f3f3f7b5e
EUC-JP 墺?ぉ??????}墺?ぉ??????{^ 11010100110101000011111110100100101010010011111100111111001111110011111100111111001111110111110111010100110101000011111110100100101010010011111100111111001111110011111100111111001111110111101101011110 d4d43fa4a93f3f3f3f3f3f7dd4d43fa4a93f3f3f3f3f3f7b5e
UTF-8 墺싲ぉ溜볧쓼溜욌젳}墺싲ぉ溜볧쓼溜욌젳{^ 111001011010001010111010111011001000101110110010111000111000000110001001111011111010011110001011111010111011001110100111111011001001001110111100111011111010011110001011111011001001101010001100111011001010000010110011011111011110010110100010101110101110110010001011101100101110001110000001100010011110111110100111100010111110101110110011101001111110110010010011101111001110111110100111100010111110110010011010100011001110110010100000101100110111101101011110 e5a2baec8bb2e38189efa78bebb3a7ec93bcefa78bec9a8ceca0b37de5a2baec8bb2e38189efa78bebb3a7ec93bcefa78bec9a8ceca0b37b5e
UHC 墺싲ぉ溜볧쓼溜욌젳}墺싲ぉ溜볧쓼溜욌젳{^ 111001111111001010011010111010111010101010101001111010101111111010010011111011011001110110010111111010101111111010011110111010111010000010100111011111011110011111110010100110101110101110101010101010011110101011111110100100111110110110011101100101111110101011111110100111101110101110100000101001110111101101011110 e7f29aebaaa9eafe93ed9d97eafe9eeba0a77de7f29aebaaa9eafe93ed9d97eafe9eeba0a77b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)