To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????N}????????N{^ 001111110011111100111111001111110011111100111111001111110011111101001110011111010011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 丈ソ舎舎丈ソ舎蕊N}丈ソ舎舎丈ソ舎蕊N{^ 100011111110010010111111100011101100100110001110110010011000111111100100101111111000111011001001100011101100011101001110011111011000111111100100101111111000111011001001100011101100100110001111111001001011111110001110110010011000111011000111010011100111101101011110 8fe4bf8ec98ec98fe4bf8ec98ec74e7d8fe4bf8ec98ec98fe4bf8ec98ec74e7b5e
EUC-JP 丈ソ舎舎丈ソ舎蕊N}丈ソ舎舎丈ソ舎蕊N{^ 10111110111001101000111010111111101111001100101110111100110010111011111011100110100011101011111110111100110010111011110011001001010011100111110110111110111001101000111010111111101111001100101110111100110010111011111011100110100011101011111110111100110010111011110011001001010011100111101101011110 bee68ebfbccbbccbbee68ebfbccbbcc94e7dbee68ebfbccbbccbbee68ebfbccbbcc94e7b5e
UTF-8 丈ソ舎舎丈ソ舎蕊N}丈ソ舎舎丈ソ舎蕊N{^ 1110010010111000100010001110111110111101101111111110100010001000100011101110100010001000100011101110010010111000100010001110111110111101101111111110100010001000100011101110100010010101100010100100111001111101111001001011100010001000111011111011110110111111111010001000100010001110111010001000100010001110111001001011100010001000111011111011110110111111111010001000100010001110111010001001010110001010010011100111101101011110 e4b888efbdbfe8888ee8888ee4b888efbdbfe8888ee8958a4e7de4b888efbdbfe8888ee8888ee4b888efbdbfe8888ee8958a4e7b5e
UHC 丈???丈???N}丈???丈???N{^ 11101101110110110011111100111111001111111110110111011011001111110011111100111111010011100111110111101101110110110011111100111111001111111110110111011011001111110011111100111111010011100111101101011110 eddb3f3f3feddb3f3f3f4e7deddb3f3f3feddb3f3f3f4e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)