To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 詣??徇?????}詣??徇?????{^ 10001100011101110011111100111111100111000110110100111111001111110011111100111111001111110111110110001100011101110011111100111111100111000110110100111111001111110011111100111111001111110111101101011110 8c773f3f9c6d3f3f3f3f3f7d8c773f3f9c6d3f3f3f3f3f7b5e
EUC-JP 詣??徇?????}詣??徇?????{^ 10110111110110000011111100111111110101111100111000111111001111110011111100111111001111110111110110110111110110000011111100111111110101111100111000111111001111110011111100111111001111110111101101011110 b7d83f3fd7ce3f3f3f3f3f7db7d83f3fd7ce3f3f3f3f3f7b5e
UTF-8 詣꾡툧徇욆겏凉긱걨}詣꾡툧徇욆겏凉긱걨{^ 111010001010100110100011111010101011111010100001111011011000100010100111111001011011111010000111111011001001101010000110111010101011001010001111111011111010010110111001111010101011100010110001111010101011000110101000011111011110100010101001101000111110101010111110101000011110110110001000101001111110010110111110100001111110110010011010100001101110101010110010100011111110111110100101101110011110101010111000101100011110101010110001101010000111101101011110 e8a9a3eabea1ed88a7e5be87ec9a86eab28fefa5b9eab8b1eab1a87de8a9a3eabea1ed88a7e5be87ec9a86eab28fefa5b9eab8b1eab1a87b5e
UHC 詣꾡툧徇욆겏凉긱걨}詣꾡툧徇욆겏凉긱걨{^ 111001111110000110000100111001001011100010011110111000101101111110011110111010001000000110101000111001011011110010110001111000111000000110010001011111011110011111100001100001001110010010111000100111101110001011011111100111101110100010000001101010001110010110111100101100011110001110000001100100010111101101011110 e7e184e4b89ee2df9ee881a8e5bcb1e381917de7e184e4b89ee2df9ee881a8e5bcb1e381917b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)