To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????h???????? 001111110011111100111111001111110011111100111111001111110011111100111111011010000011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f683f3f3f3f3f3f3f3f
SJIS-WIN 鱈湛遜誰遜賊誰遜属h鱈湛遜誰遜賊誰遜 1001001001001100100100100101100010010001101110111001001001001110100100011011101110010001101011111001001001001110100100011011101110010001101011100110100010010010010011001001001001011000100100011011101110010010010011101001000110111011100100011010111110010010010011101001000110111011 924c925891bb924e91bb91af924e91bb91ae68924c925891bb924e91bb91af924e91bb
EUC-JP 鱈湛遜誰遜賊誰遜属h鱈湛遜誰遜賊誰遜 1100001110101101110000111011100111000010101111011100001110101111110000101011110111000010101100011100001110101111110000101011110111000010101100000110100011000011101011011100001110111001110000101011110111000011101011111100001010111101110000101011000111000011101011111100001010111101 c3adc3b9c2bdc3afc2bdc2b1c3afc2bdc2b068c3adc3b9c2bdc3afc2bdc2b1c3afc2bd
UTF-8 鱈湛遜誰遜賊誰遜属h鱈湛遜誰遜賊誰遜 11101001101100011000100011100110101110011001101111101001100000011001110011101000101010101011000011101001100000011001110011101000101100111000101011101000101010101011000011101001100000011001110011100101101100011001111001101000111010011011000110001000111001101011100110011011111010011000000110011100111010001010101010110000111010011000000110011100111010001011001110001010111010001010101010110000111010011000000110011100 e9b188e6b99be9819ce8aab0e9819ce8b38ae8aab0e9819ce5b19e68e9b188e6b99be9819ce8aab0e9819ce8b38ae8aab0e9819c
UHC ?湛遜誰遜賊誰遜?h?湛遜誰遜賊誰遜 0011111111010011110000001110000111100001111000101100000111100001111000011110111011100100111000101100000111100001111000010011111101101000001111111101001111000000111000011110000111100010110000011110000111100001111011101110010011100010110000011110000111100001 3fd3c0e1e1e2c1e1e1eee4e2c1e1e13f683fd3c0e1e1e2c1e1e1eee4e2c1e1e1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)