To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????A?? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111010000010011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f413f3f
SJIS-WIN 竪端贈巽造他巽属遜奪属尊奪村側A竪端 1001001001000111100100100101101110010001101000011001001001000110100100011010001010010001101111001001001001000110100100011010111010010001101110111001001001000100100100011010111010010001101110001001001001000100100100011011101010010001101001000100000110010010010001111001001001011011 9247925b91a1924691a291bc924691ae91bb924491ae91b8924491ba91a4419247925b
EUC-JP 竪端贈巽造他巽属遜奪属尊奪村側A竪端 1100001110101000110000111011110011000010101000111100001110100111110000101010010011000010101111101100001110100111110000101011000011000010101111011100001110100101110000101011000011000010101110101100001110100101110000101011110011000010101001100100000111000011101010001100001110111100 c3a8c3bcc2a3c3a7c2a4c2bec3a7c2b0c2bdc3a5c2b0c2bac3a5c2bcc2a641c3a8c3bc
UTF-8 竪端贈巽造他巽属遜奪属尊奪村側A竪端 11100111101010111010101011100111101010111010111111101000101101001000100011100101101101111011110111101001100000001010000011100100101110111001011011100101101101111011110111100101101100011001111011101001100000011001110011100101101001011010101011100101101100011001111011100101101100001000101011100101101001011010101011100110100111011001000111100101100000011011010001000001111001111010101110101010111001111010101110101111 e7abaae7abafe8b488e5b7bde980a0e4bb96e5b7bde5b19ee9819ce5a5aae5b19ee5b08ae5a5aae69d91e581b441e7abaae7abaf
UHC 竪端贈巽造他巽?遜奪?尊奪村側A竪端 111000101011010111010011101011101111000111111100111000011101111011110000111000111111011011100010111000011101111000111111111000011110000111110111101011000011111111110000111011101111011110101100111101011011110111110110101100000100000111100010101101011101001110101110 e2b5d3aef1fce1def0e3f6e2e1de3fe1e1f7ac3ff0eef7acf5bdf6b041e2b5d3ae

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)