To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????D?????????D^ 001111110011111100111111001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f445e
SJIS-WIN 耶??以??淫??D耶??以??淫??D^ 100101101110101100111111001111111000100011001000001111110011111110001000111110100011111100111111010001001001011011101011001111110011111110001000110010000011111100111111100010001111101000111111001111110100010001011110 96eb3f3f88c83f3f88fa3f3f4496eb3f3f88c83f3f88fa3f3f445e
EUC-JP 耶??以??淫??D耶??以??淫??D^ 110011001110110100111111001111111011000011001010001111110011111110110000111111000011111100111111010001001100110011101101001111110011111110110000110010100011111100111111101100001111110000111111001111110100010001011110 cced3f3fb0ca3f3fb0fc3f3f44cced3f3fb0ca3f3fb0fc3f3f445e
UTF-8 耶껁굥以귝에淫뉙뫛D耶껁굥以귝에淫뉙뫛D^ 111010001000000010110110111010101011101110000001111010101011010110100101111001001011101110100101111010101011011110011101111011001001011110010000111001101011011110101011111010111000100110011001111010111010101110011011010001001110100010000000101101101110101010111011100000011110101010110101101001011110010010111011101001011110101010110111100111011110110010010111100100001110011010110111101010111110101110001001100110011110101110101011100110110100010001011110 e880b6eabb81eab5a5e4bba5eab79dec9790e6b7abeb8999ebab9b44e880b6eabb81eab5a5e4bba5eab79dec9790e6b7abeb8999ebab9b445e
UHC 耶껁굥以귝에淫뉙뫛D耶껁굥以귝에淫뉙뫛D^ 111001011010110110000011111000111000001010001011111011001010010010000010111001101011111110100001111010111110001010000111111011011001000110111011010001001110010110101101100000111110001110000010100010111110110010100100100000101110011010111111101000011110101111100010100001111110110110010001101110110100010001011110 e5ad83e3828beca482e6bfa1ebe287ed91bb44e5ad83e3828beca482e6bfa1ebe287ed91bb445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)