To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 弱??夜??厓э?弱??節√?張????? 1000111011100011001111110011111110010110111010010011111100111111111110101000110110000100100011110011111110001110111000110011111100111111100100001101111110000001111000110011111110010010101000110011111100111111001111110011111100111111 8ee33f3f96e93f3ffa8d848f3f8ee33f3f90df81e33f92a33f3f3f3f3f
EUC-JP 弱??夜??厓э?弱??節√?張????? 101111001110010100111111001111111100110011101011001111110011111110001111101101001100011110100111111011110011111110111100111001010011111100111111110000001110000110100010111001010011111111000100101001010011111100111111001111110011111100111111 bce53f3fcceb3f3f8fb4c7a7ef3fbce53f3fc0e1a2e53fc4a53f3f3f3f3f
UTF-8 弱놅쉿夜쇽쉼厓э푴弱놅쉰節√꽌張놅쉥簾쇽쉥 1110010110111100101100011110101110000110100001011110110010001001101111111110010110100100100111001110110010000111101111011110110010001001101111001110010110001110100100111101000110001101111011011001000110110100111001011011110010110001111010111000011010000101111011001000100110110000111001111010111110000000111000101000100010011010111010101011110110001100111001011011110010110101111010111000011010000101111011001000100110100101111011111010011010100110111011001000011110111101111011001000100110100101 e5bcb1eb8685ec89bfe5a49cec87bdec89bce58e93d18ded91b4e5bcb1eb8685ec89b0e7af80e2889aeabd8ce5bcb5eb8685ec89a5efa6a6ec87bdec89a5
UHC 弱놅쉿夜쇽쉼厓э푴弱놅쉰節√꽌張놅쉥簾쇽쉥 111001011011000010000110111011111011110110110010111001011010100010111100111011111011110110110000111001001110110110101100111011111011111010000010111001011011000010000110111011111011110110101110111011111011110110100001111011101000010010011100111011011110010110000110111011111011110110101011111001111010000110111100111011111011110110101011 e5b086efbdb2e5a8bcefbdb0e4edacefbe82e5b086efbdaeefbda1ee849cede586efbdabe7a1bcefbdab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)