To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????n}???????????n{^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 障?吟?醍???制絅?n}障?吟?醍???制絅?n{^ 10001111111000010011111110001011111000010011111110010001111001110011111100111111001111111001000010100111111000110100010000111111011011100111110110001111111000010011111110001011111000010011111110010001111001110011111100111111001111111001000010100111111000110100010000111111011011100111101101011110 8fe13f8be13f91e73f3f3f90a7e3443f6e7d8fe13f8be13f91e73f3f3f90a7e3443f6e7b5e
EUC-JP 障?吟?醍???制絅?n}障?吟?醍???制絅?n{^ 10111110111000110011111110110110111000110011111111000010111010010011111100111111001111111100000010101001111001011010010100111111011011100111110110111110111000110011111110110110111000110011111111000010111010010011111100111111001111111100000010101001111001011010010100111111011011100111101101011110 bee33fb6e33fc2e93f3f3fc0a9e5a53f6e7dbee33fb6e33fc2e93f3f3fc0a9e5a53f6e7b5e
UTF-8 障렚吟렞醍닻렖렕制絅긺n}障렚吟렞醍닻렖렕制絅긺n{^ 1110100110011010100111001110101110100000100110101110010110010000100111111110101110100000100111101110100110000110100011011110101110001011101110111110101110100000100101101110101110100000100101011110010110001000101101101110011110110101100001011110101010111000101110100110111001111101111010011001101010011100111010111010000010011010111001011001000010011111111010111010000010011110111010011000011010001101111010111000101110111011111010111010000010010110111010111010000010010101111001011000100010110110111001111011010110000101111010101011100010111010011011100111101101011110 e99a9ceba09ae5909feba09ee9868deb8bbbeba096eba095e588b6e7b585eab8ba6e7de99a9ceba09ae5909feba09ee9868deb8bbbeba096eba095e588b6e7b585eab8ba6e7b5e
UHC 障렚吟렞醍닻렖렕制絅긺n}障렚吟렞醍닻렖렕制絅긺n{^ 11101110101000011000111010101101111010111110000110001110101011111111000010110101101101001110100110001110101010111000111010101010111100001010010011001100111001111011000111100111011011100111110111101110101000011000111010101101111010111110000110001110101011111111000010110101101101001110100110001110101010111000111010101010111100001010010011001100111001111011000111100111011011100111101101011110 eea18eadebe18eaff0b5b4e98eab8eaaf0a4cce7b1e76e7deea18eadebe18eaff0b5b4e98eab8eaaf0a4cce7b1e76e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)