To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?訥イ??あ??ぁ□?慇?訥イ??あ??ぁ□?垠^ 00111111111001100110001110000011010000110011111100111111100000101010000000111111001111111000001010011111100000011010000000111111100111001011111000111111111001100110001110000011010000110011111100111111100000101010000000111111001111111000001010011111100000011010000000111111100110101011010101011110 3fe66383433f3f82a03f3f829f81a03f9cbe3fe66383433f3f82a03f3f829f81a03f9ab55e
EUC-JP ?訥イ??あ??ぁ□?慇?訥イ??あ??ぁ□?垠^ 00111111111010111100010010100101101001000011111100111111101001001010001000111111001111111010010010100001101000101010001000111111110110001100000000111111111010111100010010100101101001000011111100111111101001001010001000111111001111111010010010100001101000101010001000111111110101001011011101011110 3febc4a5a43f3fa4a23f3fa4a1a2a23fd8c03febc4a5a43f3fa4a23f3fa4a1a2a23fd4b75e
UTF-8 룶訥イ룶쨵あ룶쨵ぁ□룫慇룶訥イ룶쨵あ룶쨵ぁ□룫垠^ 11101011101000111011011011101000101010001010010111100011100000101010010011101011101000111011011011101100101010001011010111100011100000011000001011101011101000111011011011101100101010001011010111100011100000011000000111100010100101101010000111101011101000111010101111100110100001011000011111101011101000111011011011101000101010001010010111100011100000101010010011101011101000111011011011101100101010001011010111100011100000011000001011101011101000111011011011101100101010001011010111100011100000011000000111100010100101101010000111101011101000111010101111100101100111101010000001011110 eba3b6e8a8a5e382a4eba3b6eca8b5e38182eba3b6eca8b5e38181e296a1eba3abe68587eba3b6e8a8a5e382a4eba3b6eca8b5e38182eba3b6eca8b5e38181e296a1eba3abe59ea05e
UHC 룶訥イ룶쨵あ룶쨵ぁ□룫慇룶訥イ룶쨵あ룶쨵ぁ□룫垠^ 10001111101010111101001011101101101010111010010010001111101010111010010010001111101010101010001010001111101010111010010010001111101010101010000110100001111000001000111110100010111010111101101110001111101010111101001011101101101010111010010010001111101010111010010010001111101010101010001010001111101010111010010010001111101010101010000110100001111000001000111110100010111010111101100101011110 8fabd2edaba48faba48faaa28faba48faaa1a1e08fa2ebdb8fabd2edaba48faba48faaa28faba48faaa1a1e08fa2ebd95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)