To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???濡??円???る?濡?????濡??^ 001111110011111100111111100101000100011100111111001111111000100101111110001111110011111100111111100000101110100100111111100101000100011100111111001111110011111100111111001111111001010001000111001111110011111101011110 3f3f3f94473f3f897e3f3f3f82e93f94473f3f3f3f3f94473f3f5e
EUC-JP ???濡??円???る?濡?????濡??^ 001111110011111100111111110001111010100000111111001111111011000111011111001111110011111100111111101001001110101100111111110001111010100000111111001111110011111100111111001111111100011110101000001111110011111101011110 3f3f3fc7a83f3fb1df3f3f3fa4eb3fc7a83f3f3f3f3fc7a83f3f5e
UTF-8 溜노죭濡싲젡円곷줁溜る졂濡덈줃溜싨셼濡섎젶^ 11101111101001111000101111101011100001011011100011101100101000111010110111100110101111111010000111101100100010111011001011101100101000001010000111100101100001101000011011101010101100111011011111101100101001001000000111101111101001111000101111100011100000101000101111101100101000011000001011100110101111111010000111101011100011011000100011101100101001001000001111101111101001111000101111101100100010111010100011101100100001011011110011100110101111111010000111101100100001001000111011101100101000001011011001011110 efa78beb85b8eca3ade6bfa1ec8bb2eca0a1e58686eab3b7eca481efa78be3828beca182e6bfa1eb8d88eca483efa78bec8ba8ec85bce6bfa1ec848eeca0b65e
UHC 溜노죭濡싲젡円곷줁溜る졂濡덈줃溜싨셼濡섎젶^ 11101010111111101011001111101011101000011000100011101011101000011001101011101011101000001001101011100101111101111000000111101011101000011001100011101010111111101010101011101011101000001011001111101011101000011000100011101011101000011001101011101010111111101001101011100110100110011000000111101011101000011001100011101011101000001010101001011110 eafeb3eba188eba19aeba09ae5f781eba198eafeaaeba0b3eba188eba19aeafe9ae69981eba198eba0aa5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)