To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????R?????^[?????R?????^[^ 001111110011111100111111001111110011111101010010001111110011111100111111001111110011111101011110010110110011111100111111001111110011111100111111010100100011111100111111001111110011111100111111010111100101101101011110 3f3f3f3f3f523f3f3f3f3f5e5b3f3f3f3f3f523f3f3f3f3f5e5b5e
SJIS-WIN 鰲???琳R鰲???琳^[鰲???琳R鰲???琳^[^ 1110100111100000001111110011111100111111100101111101010001010010111010011110000000111111001111110011111110010111110101000101111001011011111010011110000000111111001111110011111110010111110101000101001011101001111000000011111100111111001111111001011111010100010111100101101101011110 e9e03f3f3f97d452e9e03f3f3f97d45e5be9e03f3f3f97d452e9e03f3f3f97d45e5b5e
EUC-JP 鰲???琳R鰲???琳^[鰲???琳R鰲???琳^[^ 1111001011100010001111110011111100111111110011101101011001010010111100101110001000111111001111110011111111001110110101100101111001011011111100101110001000111111001111110011111111001110110101100101001011110010111000100011111100111111001111111100111011010110010111100101101101011110 f2e23f3f3fced652f2e23f3f3fced65e5bf2e23f3f3fced652f2e23f3f3fced65e5b5e
UTF-8 鰲쑈뤛슭琳R鰲쑈뤛슭琳^[鰲쑈뤛슭琳R鰲쑈뤛슭琳^[^ 11101001101100001011001011101100100100011000100011101011101001001001101111101100100010101010110111100111100100001011001101010010111010011011000010110010111011001001000110001000111010111010010010011011111011001000101010101101111001111001000010110011010111100101101111101001101100001011001011101100100100011000100011101011101001001001101111101100100010101010110111100111100100001011001101010010111010011011000010110010111011001001000110001000111010111010010010011011111011001000101010101101111001111001000010110011010111100101101101011110 e9b0b2ec9188eba49bec8aade790b352e9b0b2ec9188eba49bec8aade790b35e5be9b0b2ec9188eba49bec8aade790b352e9b0b2ec9188eba49bec8aade790b35e5b5e
UHC 鰲쑈뤛슭琳R鰲쑈뤛슭琳^[鰲쑈뤛슭琳R鰲쑈뤛슭琳^[^ 1110100010100111101111101010010010001111110010101011110110111110110101111111101101010010111010001010011110111110101001001000111111001010101111011011111011010111111110110101111001011011111010001010011110111110101001001000111111001010101111011011111011010111111110110101001011101000101001111011111010100100100011111100101010111101101111101101011111111011010111100101101101011110 e8a7bea48fcabdbed7fb52e8a7bea48fcabdbed7fb5e5be8a7bea48fcabdbed7fb52e8a7bea48fcabdbed7fb5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)