To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 訝??節ф?預э?嵬??炎?????絶??^ 111001100110001000111111001111111001000011011111100001001000011000111111100101110110000110000100100011110011111110011011110010100011111100111111100010011000101000111111001111110011111100111111001111111001000011100010001111110011111101011110 e6623f3f90df84863f9761848f3f9bca3f3f898a3f3f3f3f3f90e23f3f5e
EUC-JP 訝??節ф?預э?嵬??炎?????絶??^ 111010111100001100111111001111111100000011100001101001111110011000111111110011011100001010100111111011110011111111010110110011000011111100111111101100011110101000111111001111110011111100111111001111111100000011100100001111110011111101011110 ebc33f3fc0e1a7e63fcdc2a7ef3fd6cc3f3fb1ea3f3f3f3f3fc0e43f3f5e
UTF-8 訝딁쐢節ф씭預э풌嵬뚥툓炎덌쉬呂얕뒳絶묊겮^ 1110100010101000100111011110101110010100100000011110110010010000101000101110011110101111100000001101000110000100111011001001010010101101111010011010000010010000110100011000110111101101100100101000110011100101101101011010110011101011100110101010010111101101100010001001001111100111100000101000111011101011100011011000110011101100100010011010110011101111101001101000000011101100100101101001010111101011100100101011001111100111101101011011011011101011101011001000101011101010101100101010111001011110 e8a89deb9481ec90a2e7af80d184ec94ade9a090d18ded928ce5b5aceb9aa5ed8893e7828eeb8d8cec89acefa680ec9695eb92b3e7b5b6ebac8aeab2ae5e
UHC 訝딁쐢節ф씭預э풌嵬뚥툓炎덌쉬呂얕뒳絶묊겮^ 11100100101110001000101011100111100111001000100011101111101111011010110011100110100111011011111011100111111010001010110011101111101111101001000111101000111000111000110011100100101110001000101011100110111110101000100011101111101111011010110011100101111110111011111011101000100010101010110011101111101111101001000111100111100000011011110001011110 e4b88ae79c88efbdace69dbee7e8acefbe91e8e38ce4b88ae6fa88efbdace5fbbee88aacefbe91e781bc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)