To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 顫磯、捺、狗イ撰スコ髴郁諸ァ郢厄スコ 111010001111101010001000111010011010010010010011111001101010010010001011111001111011001010010000111011111011110110111010111010011001110010001000111010001111100110000101111110111010100110100111111001111011100110010110111011111011110110111010 e8fa88e9a493e6a48be7b290efbdbae99c88e8f985fba9a7e7b996efbdba
EUC-JP 顫磯、捺、狗イ撰スコ髴郁??ァ郢厄スコ 111100001111110010110000111010111000111010100100110001101110100010001110101001001011011011101001100011101011001011000000111100011000111010111101100011101011101011110001111111001011000011101010001111110011111110001110101001111110111010111011110011001111000110001110101111011000111010111010 f0fcb0eb8ea4c6e88ea4b6e98eb2c0f18ebd8ebaf1fcb0ea3f3f8ea7eebbccf18ebd8eba
UTF-8 顫磯、捺、狗イ撰スコ髴郁諸ァ郢厄スコ 111010011010000110101011111001111010001110101111111011111011110110100100111001101000110110111010111011111011110110100100111001111000101110010111111011111011110110110010111001101001001010110000111011111011110110111101111011111011110110111010111010011010101110110100111010011000001110000001111011101001101110100000111011111010100010100010111011111011110110100111111010011000001110100010111001011000111010000100111011111011110110111101111011111011110110111010 e9a1abe7a3afefbda4e68dbaefbda4e78b97efbdb2e692b0efbdbdefbdbae9abb4e98381ee9ba0efa8a2efbda7e983a2e58e84efbdbdefbdba
UHC 顫磯?捺?狗?撰???郁????厄?? 1110111110110101110100011011010000111111110100011111010000111111110011111011011100111111111100111011110000111111001111110011111111101001111101000011111100111111001111110011111111100100111110000011111100111111 efb5d1b43fd1f43fcfb73ff3bc3f3f3fe9f43f3f3f3fe4f83f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)