To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN ???以??淫??n}???以??淫??n{^ 001111110011111100111111100010001100100000111111001111111000100011111010001111110011111101101110011111010011111100111111001111111000100011001000001111110011111110001000111110100011111100111111011011100111101101011110 3f3f3f88c83f3f88fa3f3f6e7d3f3f3f88c83f3f88fa3f3f6e7b5e
EUC-JP ???以??淫??n}???以??淫??n{^ 001111110011111100111111101100001100101000111111001111111011000011111100001111110011111101101110011111010011111100111111001111111011000011001010001111110011111110110000111111000011111100111111011011100111101101011110 3f3f3fb0ca3f3fb0fc3f3f6e7d3f3f3fb0ca3f3fb0fc3f3f6e7b5e
UTF-8 聯륁늾以귝에淫됯뭄n}聯륁늾以귝에淫됯뭄n{^ 1110111110100110100101111110101110100101100000011110101110001010101111101110010010111011101001011110101010110111100111011110110010010111100100001110011010110111101010111110101110010000101011111110101110101101100001000110111001111101111011111010011010010111111010111010010110000001111010111000101010111110111001001011101110100101111010101011011110011101111011001001011110010000111001101011011110101011111010111001000010101111111010111010110110000100011011100111101101011110 efa697eba581eb8abee4bba5eab79dec9790e6b7abeb90afebad846e7defa697eba581eb8abee4bba5eab79dec9790e6b7abeb90afebad846e7b5e
UHC 聯륁늾以귝에淫됯뭄n}聯륁늾以귝에淫됯뭄n{^ 1110011011100001100011111110110010001000100001111110110010100100100000101110011010111111101000011110101111100010100010011110101010111001101100110110111001111101111001101110000110001111111011001000100010000111111011001010010010000010111001101011111110100001111010111110001010001001111010101011100110110011011011100111101101011110 e6e18fec8887eca482e6bfa1ebe289eab9b36e7de6e18fec8887eca482e6bfa1ebe289eab9b36e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)