To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 鼇??由ュ?喩??}鼇??由ュ?喩??{^ 1110101010000111001111110011111110010111010100101000001110000101001111111001101001100111001111110011111101111101111010101000011100111111001111111001011101010010100000111000010100111111100110100110011100111111001111110111101101011110 ea873f3f975283853f9a673f3f7dea873f3f975283853f9a673f3f7b5e
EUC-JP 鼇??由ュ?喩??}鼇??由ュ?喩??{^ 1111001111100111001111110011111111001101101100111010010111100101001111111101001111001000001111110011111101111101111100111110011100111111001111111100110110110011101001011110010100111111110100111100100000111111001111110111101101011110 f3e73f3fcdb3a5e53fd3c83f3f7df3e73f3fcdb3a5e53fd3c83f3f7b5e
UTF-8 鼇앸갑由ュㄾ喩붽굡}鼇앸갑由ュㄾ喩붽굡{^ 111010011011110010000111111011001001010110111000111010101011000010010001111001111001010010110001111000111000001110100101111000111000010010111110111001011001011010101001111010111011011010111101111010101011010110100001011111011110100110111100100001111110110010010101101110001110101010110000100100011110011110010100101100011110001110000011101001011110001110000100101111101110010110010110101010011110101110110110101111011110101010110101101000010111101101011110 e9bc87ec95b8eab091e794b1e383a5e384bee596a9ebb6bdeab5a17de9bc87ec95b8eab091e794b1e383a5e384bee596a9ebb6bdeab5a17b5e
UHC 鼇앸갑由ュㄾ喩붽굡}鼇앸갑由ュㄾ喩붽굡{^ 111010001010100010011101111010111011000010101001111010111010011010101011111001011010010010101110111010101110011110010100111010101011000110110110011111011110100010101000100111011110101110110000101010011110101110100110101010111110010110100100101011101110101011100111100101001110101010110001101101100111101101011110 e8a89debb0a9eba6abe5a4aeeae794eab1b67de8a89debb0a9eba6abe5a4aeeae794eab1b67b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)