To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 絶??絶??菴?????節??節??暗??孃?B 10010000111000100011111100111111100100001110001000111111001111111110010010111101001111110011111100111111001111110011111110010000110111110011111100111111100100001101111100111111001111111000100011000011001111110011111110011011011011110011111101000010 90e23f3f90e23f3fe4bd3f3f3f3f3f90df3f3f90df3f3f88c33f3f9b6f3f42
EUC-JP 絶??絶??菴?????節??節??暗??孃?B 11000000111001000011111100111111110000001110010000111111001111111110100010111111001111110011111100111111001111110011111111000000111000010011111100111111110000001110000100111111001111111011000011000101001111110011111111010101110100000011111101000010 c0e43f3fc0e43f3fe8bf3f3f3f3f3fc0e13f3fc0e13f3fb0c53f3fd5d03f42
UTF-8 絶껓숴絶귝툒菴득툒溫먲푵節ㅿ푴節쇽푴暗뗦툒孃뱘B 11100111101101011011011011101010101110111001001111101100100010001011010011100111101101011011011011101010101101111001110111101101100010001001001011101000100011111011010011101011100100111001110111101101100010001001001011100110101110101010101111101011101010001011001011101101100100011011010111100111101011111000000011100011100001011011111111101101100100011011010011100111101011111000000011101100100001111011110111101101100100011011010011100110100110101001011111101011100101111010011011101101100010001001001011100101101011011000001111101011101100011001100001000010 e7b5b6eabb93ec88b4e7b5b6eab79ded8892e88fb4eb939ded8892e6baabeba8b2ed91b5e7af80e385bfed91b4e7af80ec87bded91b4e69a97eb97a6ed8892e5ad83ebb19842
UHC 絶껓숴絶귝툒菴득툒溫먲푵節ㅿ푴節쇽푴暗뗦툒孃뱘B 1110111110111110100000111110111110111101101001001110111110111110100000101110011010111000100010011110010011100000101101011110011010111000100010011110100010101110100100001110111110111110100000111110111110111101101001001110111110111110100000101110111110111101101111001110111110111110100000101110010011011110100010111110011010111000100010011110010110111110100100110111100101000010 efbe83efbda4efbe82e6b889e4e0b5e6b889e8ae90efbe83efbda4efbe82efbdbcefbe82e4de8be6b889e5be937942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)