To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???^h???^fN}???^h???^fN{^ 00111111001111110011111101011110011010000011111100111111001111110101111001100110010011100111110100111111001111110011111101011110011010000011111100111111001111110101111001100110010011100111101101011110 3f3f3f5e683f3f3f5e664e7d3f3f3f5e683f3f3f5e664e7b5e
SJIS-WIN 厭??^h厭??^fN}厭??^h厭??^fN{^ 1000100101111101001111110011111101011110011010001000100101111101001111110011111101011110011001100100111001111101100010010111110100111111001111110101111001101000100010010111110100111111001111110101111001100110010011100111101101011110 897d3f3f5e68897d3f3f5e664e7d897d3f3f5e68897d3f3f5e664e7b5e
EUC-JP 厭??^h厭??^fN}厭??^h厭??^fN{^ 1011000111011110001111110011111101011110011010001011000111011110001111110011111101011110011001100100111001111101101100011101111000111111001111110101111001101000101100011101111000111111001111110101111001100110010011100111101101011110 b1de3f3f5e68b1de3f3f5e664e7db1de3f3f5e68b1de3f3f5e664e7b5e
UTF-8 厭깒돦^h厭깒돦^fN}厭깒돦^h厭깒돦^fN{^ 11100101100011101010110111101010101110011001001011101011100011111010011001011110011010001110010110001110101011011110101010111001100100101110101110001111101001100101111001100110010011100111110111100101100011101010110111101010101110011001001011101011100011111010011001011110011010001110010110001110101011011110101010111001100100101110101110001111101001100101111001100110010011100111101101011110 e58eadeab992eb8fa65e68e58eadeab992eb8fa65e664e7de58eadeab992eb8fa65e68e58eadeab992eb8fa65e664e7b5e
UHC 厭깒돦^h厭깒돦^fN}厭깒돦^h厭깒돦^fN{^ 11100110111101001000001110001100100010011010101001011110011010001110011011110100100000111000110010001001101010100101111001100110010011100111110111100110111101001000001110001100100010011010101001011110011010001110011011110100100000111000110010001001101010100101111001100110010011100111101101011110 e6f4838c89aa5e68e6f4838c89aa5e664e7de6f4838c89aa5e68e6f4838c89aa5e664e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)