To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN ??????伎彦?}??????伎彦?{^ 00111111001111110011111100111111001111110011111110001010111010101001010101000110001111110111110100111111001111110011111100111111001111110011111110001010111010101001010101000110001111110111101101011110 3f3f3f3f3f3f8aea95463f7d3f3f3f3f3f3f8aea95463f7b5e
EUC-JP ??????伎彦?}??????伎彦?{^ 00111111001111110011111100111111001111110011111110110100111011001100100110100111001111110111110100111111001111110011111100111111001111110011111110110100111011001100100110100111001111110111101101011110 3f3f3f3f3f3fb4ecc9a73f7d3f3f3f3f3f3fb4ecc9a73f7b5e
UTF-8 吳녶콉樂녻뙬伎彦푟}吳녶콉樂녻뙬伎彦푟{^ 111001011001000010110011111010111000010110110110111011001011110110001001111011111010011010111111111010111000010110111011111010111001100110101100111001001011110010001110111001011011110110100110111011011001000110011111011111011110010110010000101100111110101110000101101101101110110010111101100010011110111110100110101111111110101110000101101110111110101110011001101011001110010010111100100011101110010110111101101001101110110110010001100111110111101101011110 e590b3eb85b6ecbd89efa6bfeb85bbeb99ace4bc8ee5bda6ed919f7de590b3eb85b6ecbd89efa6bfeb85bbeb99ace4bc8ee5bda6ed919f7b5e
UHC 吳녶콉樂녻뙬伎彦푟}吳녶콉樂녻뙬伎彦푟{^ 111001111110111110000110111001011011000110000101111010001111100110000110111010001000110010101111110100001110101111100101111010011011111001101011011111011110011111101111100001101110010110110001100001011110100011111001100001101110100010001100101011111101000011101011111001011110100110111110011010110111101101011110 e7ef86e5b185e8f986e88cafd0ebe5e9be6b7de7ef86e5b185e8f986e88cafd0ebe5e9be6b7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)