To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????h 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f68
SJIS-WIN ????????????鴉???????h 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111111101001111010110011111100111111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f3f3f3f3f3f3fe9eb3f3f3f3f3f3f3f68
EUC-JP ????????????鴉???????h 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111111110010111011010011111100111111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f3f3f3f3f3f3ff2ed3f3f3f3f3f3f3f68
UTF-8 溜븐뵽溜뽯졋溜뺣졋溜딅졎鴉쇱뵽溜뽯졋溜둅h 11101111101001111000101111101011101110001001000011101011101101011011110111101111101001111000101111101011101111011010111111101100101000011000101111101111101001111000101111101011101110101010001111101100101000011000101111101111101001111000101111101011100101001000010111101100101000011000111011101001101101001000100111101100100001111011000111101011101101011011110111101111101001111000101111101011101111011010111111101100101000011000101111101111101001111000101111101011100100011000010101101000 efa78bebb890ebb5bdefa78bebbdafeca18befa78bebbaa3eca18befa78beb9485eca18ee9b489ec87b1ebb5bdefa78bebbdafeca18befa78beb918568
UHC 溜븐뵽溜뽯졋溜뺣졋溜딅졎鴉쇱뵽溜뽯졋溜둅h 1110101011111110101110101110110010010100101110111110101011111110100101101110101110100000101110101110101011111110100101011110101110100000101110101110101011111110100010101110101110100000101110111110010010111100101111001110110010010100101110111110101011111110100101101110101110100000101110101110101011111110100010100100000101101000 eafebaec94bbeafe96eba0baeafe95eba0baeafe8aeba0bbe4bcbcec94bbeafe96eba0baeafe8a4168

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)