To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 業??宥??儒??筌????? 10001011110001100011111100111111100101110100011100111111001111111000111011110010001111110011111111100010101000110011111100111111001111110011111100111111 8bc63f3f97473f3f8ef23f3fe2a33f3f3f3f3f
EUC-JP 業??宥??儒??筌????? 10110110110010000011111100111111110011011010100000111111001111111011110011110100001111110011111111100100101001010011111100111111001111110011111100111111 b6c83f3fcda83f3fbcf43f3fe4a53f3f3f3f3f
UTF-8 業좊쉠宥딁뫀儒뺤퐵筌귣틶痢욃쭦 111001101010010110101101111011001010001010001010111011001000100110100000111001011010111010100101111010111001010010000001111010111010101110000000111001011000010010010010111010111011101010100100111011011001000010110101111001111010110110001100111010101011011110100011111011011000101110110110111011111010011110100101111011001001101010000011111011001010110110100110 e6a5adeca28aec89a0e5aea5eb9481ebab80e58492ebbaa4ed90b5e7ad8ceab7a3ed8bb6efa7a5ec9a83ecada6
UHC 業좊쉠宥딁뫀儒뺤퐵筌귣틶痢욃쭦 111001011111011010100000111010111011110110101010111010101110100110001010111001111001000110100100111010101110001110010101111011001011110110011110111011111010011110000010111010111011101010011101111011001011100010011110111001011010011110011010 e5f6a0ebbdaaeae98ae791a4eae395ecbd9eefa782ebba9decb89ee5a79a

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)