To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 烏l???????湲?。???泳??穩??^ 10001001010001111000001010001100001111110011111100111111001111110011111100111111001111111001111111010001001111111000000101000010001111110011111100111111100010010110101000111111001111111110001001110010001111110011111101011110 8947828c3f3f3f3f3f3f3f9fd13f81423f3f3f896a3f3fe2723f3f5e
EUC-JP 烏l???????湲?。???泳??穩??^ 10110001101010001010001111101100001111110011111100111111001111110011111100111111001111111101111011010011001111111010000110100011001111110011111100111111101100011100101100111111001111111110001111010011001111110011111101011110 b1a8a3ec3f3f3f3f3f3f3fded33fa1a33f3f3fb1cb3f3fe3d33f3f5e
UTF-8 烏l캈溜뗧텚溜띹콊湲븃。易붾젉泳볥젌穩녾맏^ 11100111100000111000111111101111101111011000110011101100101110101000100011101111101001111000101111101011100101111010011111101101100001011001101011101111101001111000101111101011100111011011100111101100101111011000101011100110101110011011001011101011101110001000001111100011100000001000001011101111101001111010000011101011101101101011111011101100101000001000100111100110101100111011001111101011101100111010010111101100101000001000110011100111101010011010100111101011100001011011111011101011101001111000111101011110 e7838fefbd8cecba88efa78beb97a7ed859aefa78beb9db9ecbd8ae6b9b2ebb883e38082efa7a0ebb6beeca089e6b3b3ebb3a5eca08ce7a9a9eb85beeba78f5e
UHC 烏l캈溜뗧텚溜띹콊湲븃。易붾젉泳볥젌穩녾맏^ 11101000101000011010001111101100101011111001010011101010111111101000101111100111101101101001001111101010111111101000110111101000101100011000011011101010101110001011101011101000101000011010001111101100101011111001010011101011101000001000101111100111101101101001001111101011101000001000110111101000101100011000011011101010101110001011101001011110 e8a1a3ecaf94eafe8be7b693eafe8de8b186eab8bae8a1a3ecaf94eba08be7b693eba08de8b186eab8ba5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)