To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 鼇?????語⑤?}鼇?????語⑤?{^ 111010101000011100111111001111110011111100111111001111111000110011101010100001110100010000111111011111011110101010000111001111110011111100111111001111110011111110001100111010101000011101000100001111110111101101011110 ea873f3f3f3f3f8cea87443f7dea873f3f3f3f3f8cea87443f7b5e
EUC-JP 鼇?????語??}鼇?????語??{^ 11110011111001110011111100111111001111110011111100111111101110001110110000111111001111110111110111110011111001110011111100111111001111110011111100111111101110001110110000111111001111110111101101011110 f3e73f3f3f3f3fb8ec3f3f7df3e73f3f3f3f3fb8ec3f3f7b5e
UTF-8 鼇귞쭅溜롫젎語⑤젛}鼇귞쭅溜롫젎語⑤젛{^ 111010011011110010000111111010101011011110011110111011001010110110000101111011111010011110001011111010111010000110101011111011001010000010001110111010001010101010011110111000101001000110100100111011001010000010011011011111011110100110111100100001111110101010110111100111101110110010101101100001011110111110100111100010111110101110100001101010111110110010100000100011101110100010101010100111101110001010010001101001001110110010100000100110110111101101011110 e9bc87eab79eecad85efa78beba1abeca08ee8aa9ee291a4eca09b7de9bc87eab79eecad85efa78beba1abeca08ee8aa9ee291a4eca09b7b5e
UHC 鼇귞쭅溜롫젎語⑤젛}鼇귞쭅溜롫젎語⑤젛{^ 111010001010100010000010111001111010011110000001111010101111111010001110111010111010000010001111111001011101111010101000111010111010000010010111011111011110100010101000100000101110011110100111100000011110101011111110100011101110101110100000100011111110010111011110101010001110101110100000100101110111101101011110 e8a882e7a781eafe8eeba08fe5dea8eba0977de8a882e7a781eafe8eeba08fe5dea8eba0977b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)