To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 櫻????Ⅵ矣??嚴??意??循??罌?? 10011111010011100011111100111111001111110011111110000111010110011110000111100001001111110011111110011010100011100011111100111111100010001101001100111111001111111000111101111010001111110011111111100011101000000011111100111111 9f4e3f3f3f3f8759e1e13f3f9a8e3f3f88d33f3f8f7a3f3fe3a03f3f
EUC-JP 櫻??佾??矣??嚴??意??循??罌?? 1101110110101111001111110011111110001111101100001111101100111111001111111110001011100011001111110011111111010011111011100011111100111111101100001101010100111111001111111011110111011011001111110011111111100110101000100011111100111111 ddaf3f3f8fb0fb3f3fe2e33f3fd3ee3f3fb0d53f3fbddb3f3fe6a23f3f
UTF-8 櫻뗣굜佾잞Ⅵ矣꺿끃嚴곷뀛意뉏튋循됯굡罌블굷 111001101010101110111011111010111001011110100011111010101011010110011100111001001011110110111110111011001001111010011110111000101000010110100101111001111001111110100011111010101011101010111111111010111000000110000011111001011001101010110100111010101011001110110111111010111000000010011011111001101000010010001111111010111000100110001111111011011000101010001011111001011011111010101010111010111001000010101111111010101011010110100001111001111011110110001100111010111011100010010100111010101011010110110111 e6abbbeb97a3eab59ce4bdbeec9e9ee285a5e79fa3eababfeb8183e59ab4eab3b7eb809be6848feb898fed8a8be5beaaeb90afeab5a1e7bd8cebb894eab5b7
UHC 櫻뗣굜佾잞Ⅵ矣꺿끃嚴곷뀛意뉏튋循됯굡罌블굷 111001011010000110001011111000111000001010000100111011001110101110011111111011111010010110110101111010111111100010000011111000101000010110111001111001011111000110000001111010111000010110010100111010111111001010000111111001001011100110011111111000101110000010001001111010101011000110110110111001011010001010111010111011011000001010010110 e5a18be38284eceb9fefa5b5ebf883e285b9e5f181eb8594ebf287e4b99fe2e089eab1b6e5a2baed8296

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)