To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????z?????????zB 001111110011111100111111001111110011111100111111001111110011111100111111011110100011111100111111001111110011111100111111001111110011111100111111001111110111101001000010 3f3f3f3f3f3f3f3f3f7a3f3f3f3f3f3f3f3f3f7a42
SJIS-WIN 巽其遜脱属卒脱袖存z巽其遜脱属卒脱袖存zB 100100100100011010010001101101001001000110111011100100100100010110010001101011101001000110110010100100100100010110010001101100111001000110110110011110101001001001000110100100011011010010010001101110111001001001000101100100011010111010010001101100101001001001000101100100011011001110010001101101100111101001000010 924691b491bb924591ae91b2924591b391b67a924691b491bb924591ae91b2924591b391b67a42
EUC-JP 巽其遜脱属卒脱袖存z巽其遜脱属卒脱袖存zB 110000111010011111000010101101101100001010111101110000111010011011000010101100001100001010110100110000111010011011000010101101011100001010111000011110101100001110100111110000101011011011000010101111011100001110100110110000101011000011000010101101001100001110100110110000101011010111000010101110000111101001000010 c3a7c2b6c2bdc3a6c2b0c2b4c3a6c2b5c2b87ac3a7c2b6c2bdc3a6c2b0c2b4c3a6c2b5c2b87a42
UTF-8 巽其遜脱属卒脱袖存z巽其遜脱属卒脱袖存zB 111001011011011110111101111001011000010110110110111010011000000110011100111010001000010010110001111001011011000110011110111001011000110110010010111010001000010010110001111010001010001010010110111001011010110110011000011110101110010110110111101111011110010110000101101101101110100110000001100111001110100010000100101100011110010110110001100111101110010110001101100100101110100010000100101100011110100010100010100101101110010110101101100110000111101001000010 e5b7bde585b6e9819ce884b1e5b19ee58d92e884b1e8a296e5ad987ae5b7bde585b6e9819ce884b1e5b19ee58d92e884b1e8a296e5ad987a42
UHC 巽其遜??卒?袖存z巽其遜??卒?袖存zB 111000011101111011010000111011001110000111100001001111110011111111110000111011110011111111100010110000001111000011101101011110101110000111011110110100001110110011100001111000010011111100111111111100001110111100111111111000101100000011110000111011010111101001000010 e1ded0ece1e13f3ff0ef3fe2c0f0ed7ae1ded0ece1e13f3ff0ef3fe2c0f0ed7a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)