To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 蔭??彦?????濡?蔭??彦?????濡?B 1000100011111100001111110011111110010101010001100011111100111111001111110011111100111111100101000100011100111111100010001111110000111111001111111001010101000110001111110011111100111111001111110011111110010100010001110011111101000010 88fc3f3f95463f3f3f3f3f94473f88fc3f3f95463f3f3f3f3f94473f42
EUC-JP 蔭??彦?????濡?蔭??彦?????濡?B 1011000011111110001111110011111111001001101001110011111100111111001111110011111100111111110001111010100000111111101100001111111000111111001111111100100110100111001111110011111100111111001111110011111111000111101010000011111101000010 b0fe3f3fc9a73f3f3f3f3fc7a83fb0fe3f3fc9a73f3f3f3f3fc7a83f42
UTF-8 蔭덉꽍彦쀫젽溜싲젽濡뻵蔭덉꽍彦쀫젽溜싲젽濡뻵B 11101000100101001010110111101011100011011000100111101010101111011000110111100101101111011010011011101100100000001010101111101100101000001011110111101111101001111000101111101100100010111011001011101100101000001011110111100110101111111010000111101011101110111011010111101000100101001010110111101011100011011000100111101010101111011000110111100101101111011010011011101100100000001010101111101100101000001011110111101111101001111000101111101100100010111011001011101100101000001011110111100110101111111010000111101011101110111011010101000010 e894adeb8d89eabd8de5bda6ec80abeca0bdefa78bec8bb2eca0bde6bfa1ebbbb5e894adeb8d89eabd8de5bda6ec80abeca0bdefa78bec8bb2eca0bde6bfa1ebbbb542
UHC 蔭덉꽍彦쀫젽溜싲젽濡뻵蔭덉꽍彦쀫젽溜싲젽濡뻵B 111010111110001110001000111011001000010010011101111001011110100110010111111010111010000010101111111010101111111010011010111010111010000010101111111010111010000110010110011110101110101111100011100010001110110010000100100111011110010111101001100101111110101110100000101011111110101011111110100110101110101110100000101011111110101110100001100101100111101001000010 ebe388ec849de5e997eba0afeafe9aeba0afeba1967aebe388ec849de5e997eba0afeafe9aeba0afeba1967a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)