To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 鵝?????矮??}鵝?????矮??{^ 11101010010000000011111100111111001111110011111100111111111000011110001000111111001111110111110111101010010000000011111100111111001111110011111100111111111000011110001000111111001111110111101101011110 ea403f3f3f3f3fe1e23f3f7dea403f3f3f3f3fe1e23f3f7b5e
EUC-JP 鵝?????矮??}鵝?????矮??{^ 11110011101000010011111100111111001111110011111100111111111000101110010000111111001111110111110111110011101000010011111100111111001111110011111100111111111000101110010000111111001111110111101101011110 f3a13f3f3f3f3fe2e43f3f7df3a13f3f3f3f3fe2e43f3f7b5e
UTF-8 鵝롧쎊溜싲젷矮듬젎}鵝롧쎊溜싲젷矮듬젎{^ 111010011011010110011101111010111010000110100111111011001000111010001010111011111010011110001011111011001000101110110010111011001010000010110111111001111001111110101110111010111001001110101100111011001010000010001110011111011110100110110101100111011110101110100001101001111110110010001110100010101110111110100111100010111110110010001011101100101110110010100000101101111110011110011111101011101110101110010011101011001110110010100000100011100111101101011110 e9b59deba1a7ec8e8aefa78bec8bb2eca0b7e79faeeb93aceca08e7de9b59deba1a7ec8e8aefa78bec8bb2eca0b7e79faeeb93aceca08e7b5e
UHC 鵝롧쎊溜싲젷矮듬젎}鵝롧쎊溜싲젷矮듬젎{^ 111001001011110110001110111001111001101110110010111010101111111010011010111010111010000010101011111010001110000110110101111010111010000010001111011111011110010010111101100011101110011110011011101100101110101011111110100110101110101110100000101010111110100011100001101101011110101110100000100011110111101101011110 e4bd8ee79bb2eafe9aeba0abe8e1b5eba08f7de4bd8ee79bb2eafe9aeba0abe8e1b5eba08f7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)