To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???恁l???????恁??姨??????? 001111110011111100111111100111001000110010000010100011000011111100111111001111110011111100111111001111110011111110011100100011000011111100111111100110110100100000111111001111110011111100111111001111110011111100111111 3f3f3f9c8c828c3f3f3f3f3f3f3f9c8c3f3f9b483f3f3f3f3f3f3f
EUC-JP ???恁l??????Ł恁??姨??????? 0011111100111111001111111101011111101100101000111110110000111111001111110011111100111111001111110011111110001111101010011010100011010111111011000011111100111111110101011010100100111111001111110011111100111111001111110011111100111111 3f3f3fd7eca3ec3f3f3f3f3f3f8fa9a8d7ec3f3fd5a93f3f3f3f3f3f3f
UTF-8 梨뺥삟恁l콐吏좏삏梨쀬Ł恁깆콐姨먯ℓ梨섑샍吏잹 1110111110100111101000101110101110111010101001011110110010000010100111111110011010000001100000011110111110111101100011001110110010111101100100001110111110100111100111101110110010100010100011111110110010000010100011111110111110100111101000101110110010000000101011001100010110000001111001101000000110000001111010101011100110000110111011001011110110010000111001011010011110101000111010111010100010101111111000101000010010010011111011111010011110100010111011001000010010010001111011001000001110001101111011111010011110011110111011001001111010111001 efa7a2ebbaa5ec829fe68181efbd8cecbd90efa79eeca28fec828fefa7a2ec80acc581e68181eab986ecbd90e5a7a8eba8afe28493efa7a2ec8491ec838defa79eec9eb9
UHC 梨뺥삟恁l콐吏좏삏梨쀬Ł恁깆콐姨먯ℓ梨섑샍吏잹 11101100101100011001010111101101100110001010001011101100111101101010001111101100101100011000110011101100101001111010000011101101100110001001011011101100101100011001011111101100101010001010100111101100111101101011000111101100101100011000110011101100101010011001000011101100101001111010010011101100101100011001100011101101100110001011101111101100101001111010000001000010 ecb195ed98a2ecf6a3ecb18ceca7a0ed9896ecb197eca8a9ecf6b1ecb18ceca990eca7a4ecb198ed98bbeca7a042

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)