To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 櫻?????恂μ?n}櫻?????恂μ?n{^ 1001111101001110001111110011111100111111001111110011111110011100100101101000001111001010001111110110111001111101100111110100111000111111001111110011111100111111001111111001110010010110100000111100101000111111011011100111101101011110 9f4e3f3f3f3f3f9c9683ca3f6e7d9f4e3f3f3f3f3f9c9683ca3f6e7b5e
EUC-JP 櫻?????恂μ?n}櫻?????恂μ?n{^ 1101110110101111001111110011111100111111001111110011111111010111111101101010011011001100001111110110111001111101110111011010111100111111001111110011111100111111001111111101011111110110101001101100110000111111011011100111101101011110 ddaf3f3f3f3f3fd7f6a6cc3f6e7dddaf3f3f3f3f3fd7f6a6cc3f6e7b5e
UTF-8 櫻뗭엺痢믦ㄵ恂μ툛n}櫻뗭엺痢믦ㄵ恂μ툛n{^ 111001101010101110111011111010111001011110101101111011001001011110111010111011111010011110100101111010111010111110100110111000111000010010110101111001101000000110000010110011101011110011101101100010001001101101101110011111011110011010101011101110111110101110010111101011011110110010010111101110101110111110100111101001011110101110101111101001101110001110000100101101011110011010000001100000101100111010111100111011011000100010011011011011100111101101011110 e6abbbeb97adec97baefa7a5ebafa6e384b5e68182cebced889b6e7de6abbbeb97adec97baefa7a5ebafa6e384b5e68182cebced889b6e7b5e
UHC 櫻뗭엺痢믦ㄵ恂μ툛n}櫻뗭엺痢믦ㄵ恂μ툛n{^ 1110010110100001100010111110110010011110100011001110110010111000100100101110100010100100101001011110001011100001101001011110110010111000100100100110111001111101111001011010000110001011111011001001111010001100111011001011100010010010111010001010010010100101111000101110000110100101111011001011100010010010011011100111101101011110 e5a18bec9e8cecb892e8a4a5e2e1a5ecb8926e7de5a18bec9e8cecb892e8a4a5e2e1a5ecb8926e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)