To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鈺?<節??榮???э?鶯??與??鈺?? 1111101111000100001111111000000110000011100100001101111100111111001111111001111011000100001111110011111100111111100001001000111100111111111010011111001000111111001111111110010001101111001111110011111111111011110001000011111100111111 fbc43f818390df3f3f9ec43f3f3f848f3fe9f23f3fe46f3f3ffbc43f3f
EUC-JP 鈺?<節??榮???э?鶯??與??鈺?? 10001111111000111101010100111111101000011110001111000000111000010011111100111111110111001100011000111111001111110011111110100111111011110011111111110010111101000011111100111111111001111101000000111111001111111000111111100011110101010011111100111111 8fe3d53fa1e3c0e13f3fdcc63f3f3fa7ef3ff2f43f3fe7d03f3f8fe3d53f3f
UTF-8 鈺썲<節길쮭榮붻춾寧э숲鶯쇽쉽與딁킈鈺싨쪡 1110100110001000101110101110110010001101101100101110111110111100100111001110011110101111100000001110101010111000101110001110110010101110101011011110011010100110101011101110101110110110101110111110110010110110101111101110111110100110101010101101000110001101111011001000100010110010111010011011011010101111111011001000011110111101111011001000100110111101111010001000100010000111111010111001010010000001111011011000001010001000111010011000100010111010111011001000101110101000111011001010101010100001 e988baec8db2efbc9ce7af80eab8b8ecaeade6a6aeebb6bbecb6beefa6aad18dec88b2e9b6afec87bdec89bde88887eb9481ed8288e988baec8ba8ecaaa1
UHC 鈺썲<節길쮭榮붻춾寧э숲鶯쇽쉽與딁킈鈺싨쪡 111010001010110110111101111001011010001110111100111011111011110110110001111001101010100010001010111001111011010010010100111010001010110110011010111001111010110010101100111011111011110110100011111001011010001110111100111011111011110110110001111001101010100010001010111001111011010010010100111010001010110110011010111001101010010110011010 e8adbde5a3bcefbdb1e6a88ae7b494e8ad9ae7acacefbda3e5a3bcefbdb1e6a88ae7b494e8ad9ae6a59a

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)