To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 節??厓???ユ?弱??節??厓???ユ?弱??^ 100100001101111100111111001111111111101010001101001111110011111100111111100000111000011000111111100011101110001100111111001111111001000011011111001111110011111111111010100011010011111100111111001111111000001110000110001111111000111011100011001111110011111101011110 90df3f3ffa8d3f3f3f83863f8ee33f3f90df3f3ffa8d3f3f3f83863f8ee33f3f5e
EUC-JP 節??厓???ユ?弱??節??厓???ユ?弱??^ 1100000011100001001111110011111110001111101101001100011100111111001111110011111110100101111001100011111110111100111001010011111100111111110000001110000100111111001111111000111110110100110001110011111100111111001111111010010111100110001111111011110011100101001111110011111101011110 c0e13f3f8fb4c73f3f3fa5e63fbce53f3fc0e13f3f8fb4c73f3f3fa5e63fbce53f3f5e
UTF-8 節쇽푶厓김뒡樂ユ윺弱꾦뀒節쇽푶厓김뒡樂ユ윺弱꾤겮^ 11100111101011111000000011101100100001111011110111101101100100011011011011100101100011101001001111101010101110011000000011101011100100101010000111101111101001101011111111100011100000111010011011101100100111001011101011100101101111001011000111101010101111101010011011101011100000001001001011100111101011111000000011101100100001111011110111101101100100011011011011100101100011101001001111101010101110011000000011101011100100101010000111101111101001101011111111100011100000111010011011101100100111001011101011100101101111001011000111101010101111101010010011101010101100101010111001011110 e7af80ec87bded91b6e58e93eab980eb92a1efa6bfe383a6ec9cbae5bcb1eabea6eb8092e7af80ec87bded91b6e58e93eab980eb92a1efa6bfe383a6ec9cbae5bcb1eabea4eab2ae5e
UHC 節쇽푶厓김뒡樂ユ윺弱꾦뀒節쇽푶厓김뒡樂ユ윺弱꾤겮^ 11101111101111011011110011101111101111101000010011100100111011011011000111101000100010101001110111101000111110011010101111100110100111111011010011100101101100001000010011101001100001011000110011101111101111011011110011101111101111101000010011100100111011011011000111101000100010101001110111101000111110011010101111100110100111111011010011100101101100001000010011100111100000011011110001011110 efbdbcefbe84e4edb1e88a9de8f9abe69fb4e5b084e9858cefbdbcefbe84e4edb1e88a9de8f9abe69fb4e5b084e781bc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)