To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 巽息息谷息即谷脱賊巽息息谷息賊谷村即B 10010010010001101001000110100111100100011010011110010010010010101001000110100111100100011010011010010010010010101001001001000101100100011010111110010010010001101001000110100111100100011010011110010010010010101001000110100111100100011010111110010010010010101001000110111010100100011010011001000010 924691a791a7924a91a791a6924a924591af924691a791a7924a91a791af924a91ba91a642
EUC-JP 巽息息谷息即谷脱賊巽息息谷息賊谷村即B 11000011101001111100001010101001110000101010100111000011101010111100001010101001110000101010100011000011101010111100001110100110110000101011000111000011101001111100001010101001110000101010100111000011101010111100001010101001110000101011000111000011101010111100001010111100110000101010100001000010 c3a7c2a9c2a9c3abc2a9c2a8c3abc3a6c2b1c3a7c2a9c2a9c3abc2a9c2b1c3abc2bcc2a842
UTF-8 巽息息谷息即谷脱賊巽息息谷息賊谷村即B 11100101101101111011110111100110100000011010111111100110100000011010111111101000101100001011011111100110100000011010111111100101100011011011001111101000101100001011011111101000100001001011000111101000101100111000101011100101101101111011110111100110100000011010111111100110100000011010111111101000101100001011011111100110100000011010111111101000101100111000101011101000101100001011011111100110100111011001000111100101100011011011001101000010 e5b7bde681afe681afe8b0b7e681afe58db3e8b0b7e884b1e8b38ae5b7bde681afe681afe8b0b7e681afe8b38ae8b0b7e69d91e58db342
UHC 巽息息谷息?谷?賊巽息息谷息賊谷村?B 11100001110111101110001111010011111000111101001111001101110110111110001111010011001111111100110111011011001111111110111011100100111000011101111011100011110100111110001111010011110011011101101111100011110100111110111011100100110011011101101111110101101111010011111101000010 e1dee3d3e3d3cddbe3d33fcddb3feee4e1dee3d3e3d3cddbe3d3eee4cddbf5bd3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)