To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 曜???????”n}曜???????”n{^ 100101110110101000111111001111110011111100111111001111110011111100111111100000010110100001101110011111011001011101101010001111110011111100111111001111110011111100111111001111111000000101101000011011100111101101011110 976a3f3f3f3f3f3f3f81686e7d976a3f3f3f3f3f3f3f81686e7b5e
EUC-JP 曜???????”n}曜???????”n{^ 110011011100101100111111001111110011111100111111001111110011111100111111101000011100100101101110011111011100110111001011001111110011111100111111001111110011111100111111001111111010000111001001011011100111101101011110 cdcb3f3f3f3f3f3f3fa1c96e7dcdcb3f3f3f3f3f3f3fa1c96e7b5e
UTF-8 曜쒐쉸溜쒖뼏溜잛”n}曜쒐쉸溜쒖뼏溜잛”n{^ 1110011010011011100111001110110010010010100100001110110010001001101110001110111110100111100010111110110010010010100101101110101110111100100011111110111110100111100010111110110010011110100110111110001010000000100111010110111001111101111001101001101110011100111011001001001010010000111011001000100110111000111011111010011110001011111011001001001010010110111010111011110010001111111011111010011110001011111011001001111010011011111000101000000010011101011011100111101101011110 e69b9cec9290ec89b8efa78bec9296ebbc8fefa78bec9e9be2809d6e7de69b9cec9290ec89b8efa78bec9296ebbc8fefa78bec9e9be2809d6e7b5e
UHC 曜쒐쉸溜쒖뼏溜잛”n}曜쒐쉸溜쒖뼏溜잛”n{^ 1110100011111000100111001110011110011010100011101110101011111110100111001110110010010110100101111110101011111110100111111110110010100001101100010110111001111101111010001111100010011100111001111001101010001110111010101111111010011100111011001001011010010111111010101111111010011111111011001010000110110001011011100111101101011110 e8f89ce79a8eeafe9cec9697eafe9feca1b16e7de8f89ce79a8eeafe9cec9697eafe9feca1b16e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)