To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ????o?敖??搖??????o?敖??搖??B 00111111001111110011111100111111100000101000111100111111100111011100001000111111001111111001110110001010001111110011111100111111001111110011111100111111100000101000111100111111100111011100001000111111001111111001110110001010001111110011111101000010 3f3f3f3f828f3f9dc23f3f9d8a3f3f3f3f3f3f828f3f9dc23f3f9d8a3f3f42
EUC-JP ???饔o?敖??搖?????饔o?敖??搖??B 0011111100111111001111111000111111101000111011111010001111101111001111111101101011000100001111110011111111011001111010100011111100111111001111110011111100111111100011111110100011101111101000111110111100111111110110101100010000111111001111111101100111101010001111110011111101000010 3f3f3f8fe8efa3ef3fdac43f3fd9ea3f3f3f3f3f8fe8efa3ef3fdac43f3fd9ea3f3f42
UTF-8 樂됵쉠饔o슨敖쏉쉿搖좄쮫樂됵쉠饔o슨敖쏉쉿搖좄쮫B 11101111101001101011111111101011100100001011010111101100100010011010000011101001101001011001010011101111101111011000111111101100100010101010100011100110100101011001011011101100100011111000100111101100100010011011111111100110100100001001011011101100101000101000010011101100101011101010101111101111101001101011111111101011100100001011010111101100100010011010000011101001101001011001010011101111101111011000111111101100100010101010100011100110100101011001011011101100100011111000100111101100100010011011111111100110100100001001011011101100101000101000010011101100101011101010101101000010 efa6bfeb90b5ec89a0e9a594efbd8fec8aa8e69596ec8f89ec89bfe69096eca284ecaeabefa6bfeb90b5ec89a0e9a594efbd8fec8aa8e69596ec8f89ec89bfe69096eca284ecaeab42
UHC 樂됵쉠饔o슨敖쏉쉿搖좄쮫樂됵쉠饔o슨敖쏉쉿搖좄쮫B 11101000111110011000100111101111101111011010101011101000101111011010001111101111101111011011110011100111111110011001101111101111101111011011001011101000111101001010000011101000101010001000100011101000111110011000100111101111101111011010101011101000101111011010001111101111101111011011110011100111111110011001101111101111101111011011001011101000111101001010000011101000101010001000100001000010 e8f989efbdaae8bda3efbdbce7f99befbdb2e8f4a0e8a888e8f989efbdaae8bda3efbdbce7f99befbdb2e8f4a0e8a88842

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)