To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 娃?????蘖??節??檍??阿??節よ?節 100010001010000100111111001111110011111100111111001111111001111101010000001111110011111110010000110111110011111100111111100111101111100000111111001111111000100010100010001111110011111110010000110111111000001011100110001111111001000011011111 88a13f3f3f3f3f9f503f3f90df3f3f9ef83f3f88a23f3f90df82e63f90df
EUC-JP 娃?????蘖??節??檍??阿??節よ?節 101100001010001100111111001111110011111100111111001111111101110110110001001111110011111111000000111000010011111100111111110111001111101000111111001111111011000010100100001111110011111111000000111000011010010011101000001111111100000011100001 b0a33f3f3f3f3fddb13f3fc0e13f3fdcfa3f3fb0a43f3fc0e1a4e83fc0e1
UTF-8 娃띰쉠樂됮젒蘖쀩슧節곈찕檍놅숱阿잞쉥節よ쾫節 111001011010100010000011111010111001110110110000111011001000100110100000111011111010011010111111111010111001000010101110111011001010000010010010111010001001100010010110111011001000000010101001111011001000101010100111111001111010111110000000111010101011001110001000111011001011000010010101111001101010101010001101111010111000011010000101111011001000100010110001111010011001100010111111111011001001111010011110111011001000100110100101111001111010111110000000111000111000001010001000111011001011111010101011111001111010111110000000 e5a883eb9db0ec89a0efa6bfeb90aeeca092e89896ec80a9ec8aa7e7af80eab388ecb095e6aa8deb8685ec88b1e998bfec9e9eec89a5e7af80e38288ecbeabe7af80
UHC 娃띰쉠樂됮젒蘖쀩슧節곈찕檍놅숱阿잞쉥節よ쾫節 1110100011011111101101101110111110111101101010101110100011111001100010011110100110100000100100011110010111101110100101111110100110011010101100011110111110111101101100001110100110101001100101011110010111100101100001101110111110111101101000101110010010111001100111111110111110111101101010111110111110111101101010101110100010110010100000101110111110111101 e8dfb6efbdaae8f989e9a091e5ee97e99ab1efbdb0e9a995e5e586efbda2e4b99fefbdabefbdaae8b282efbd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)