To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????R??????o??????]^ 00111111001111110011111100111111001111110011111101010010001111110011111100111111001111110011111100111111011011110011111100111111001111110011111100111111001111110101110101011110 3f3f3f3f3f3f523f3f3f3f3f3f6f3f3f3f3f3f3f5d5e
SJIS-WIN 誰遜賊誰遜束R誰遜賊誰遜束o誰遜賊誰遜束]^ 10010010010011101001000110111011100100011010111110010010010011101001000110111011100100011010100101010010100100100100111010010001101110111001000110101111100100100100111010010001101110111001000110101001011011111001001001001110100100011011101110010001101011111001001001001110100100011011101110010001101010010101110101011110 924e91bb91af924e91bb91a952924e91bb91af924e91bb91a96f924e91bb91af924e91bb91a95d5e
EUC-JP 誰遜賊誰遜束R誰遜賊誰遜束o誰遜賊誰遜束]^ 11000011101011111100001010111101110000101011000111000011101011111100001010111101110000101010101101010010110000111010111111000010101111011100001010110001110000111010111111000010101111011100001010101011011011111100001110101111110000101011110111000010101100011100001110101111110000101011110111000010101010110101110101011110 c3afc2bdc2b1c3afc2bdc2ab52c3afc2bdc2b1c3afc2bdc2ab6fc3afc2bdc2b1c3afc2bdc2ab5d5e
UTF-8 誰遜賊誰遜束R誰遜賊誰遜束o誰遜賊誰遜束]^ 11101000101010101011000011101001100000011001110011101000101100111000101011101000101010101011000011101001100000011001110011100110100111011001111101010010111010001010101010110000111010011000000110011100111010001011001110001010111010001010101010110000111010011000000110011100111001101001110110011111011011111110100010101010101100001110100110000001100111001110100010110011100010101110100010101010101100001110100110000001100111001110011010011101100111110101110101011110 e8aab0e9819ce8b38ae8aab0e9819ce69d9f52e8aab0e9819ce8b38ae8aab0e9819ce69d9f6fe8aab0e9819ce8b38ae8aab0e9819ce69d9f5d5e
UHC 誰遜賊誰遜束R誰遜賊誰遜束o誰遜賊誰遜束]^ 11100010110000011110000111100001111011101110010011100010110000011110000111100001111000011101011001010010111000101100000111100001111000011110111011100100111000101100000111100001111000011110000111010110011011111110001011000001111000011110000111101110111001001110001011000001111000011110000111100001110101100101110101011110 e2c1e1e1eee4e2c1e1e1e1d652e2c1e1e1eee4e2c1e1e1e1d66fe2c1e1e1eee4e2c1e1e1e1d65d5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)