To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 橈??獄??蜈??節わ?燿?????節よ? 1001111011110100001111110011111110001101100101100011111100111111111001011000010100111111001111111001000011011111100000101110110100111111111000001010000000111111001111110011111100111111001111111001000011011111100000101110011000111111 9ef43f3f8d963f3fe5853f3f90df82ed3fe0a03f3f3f3f3f90df82e63f
EUC-JP 橈??獄??蜈??節わ?燿??孼??節よ? 11011100111101100011111100111111101110011111011000111111001111111110100111100101001111110011111111000000111000011010010011101111001111111110000010100010001111110011111110001111101110101100001100111111001111111100000011100001101001001110100000111111 dcf63f3fb9f63f3fe9e53f3fc0e1a4ef3fe0a23f3f8fbac33f3fc0e1a4e83f
UTF-8 橈롳슴獄깍쉴蜈졿변節わ쉼燿쒏눟孼닻겤節よ쾳 111001101010100110001000111010111010000110110011111011001000101010110100111001111000110110000100111010101011100110001101111011001000100110110100111010001001110010001000111011001010000110111111111010111011001110000000111001111010111110000000111000111000001010001111111011001000100110111100111001111000011110111111111011001001001010001111111010111000100010011111111001011010110110111100111010111000101110111011111010101011001010100100111001111010111110000000111000111000001010001000111011001011111010110011 e6a988eba1b3ec8ab4e78d84eab98dec89b4e89c88eca1bfebb380e7af80e3828fec89bce787bfec928feb889fe5adbceb8bbbeab2a4e7af80e38288ecbeb3
UHC 橈롳슴獄깍쉴蜈졿변節わ쉼燿쒏눟孼닻겤節よ쾳 111010001111101010001110111011111011110110111111111010001010101110110001111011111011110110101111111010001010010110100000111001101011101010101111111011111011110110101010111011111011110110110000111010001111110010011100111001101000011110110111111001011110110110110100111010011000000110110110111011111011110110101010111010001011001010001001 e8fa8eefbdbfe8abb1efbdafe8a5a0e6baafefbdaaefbdb0e8fc9ce687b7e5edb4e981b6efbdaae8b289

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)