To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????{N}?????????{N{^ 00111111001111110011111100111111001111110011111100111111001111110011111101111011010011100111110100111111001111110011111100111111001111110011111100111111001111110011111101111011010011100111101101011110 3f3f3f3f3f3f3f3f3f7b4e7d3f3f3f3f3f3f3f3f3f7b4e7b5e
SJIS-WIN 鸚??逸??音??{N}鸚??逸??音??{N{^ 11101010010111110011111100111111100010001110110100111111001111111000100110111001001111110011111101111011010011100111110111101010010111110011111100111111100010001110110100111111001111111000100110111001001111110011111101111011010011100111101101011110 ea5f3f3f88ed3f3f89b93f3f7b4e7dea5f3f3f88ed3f3f89b93f3f7b4e7b5e
EUC-JP 鸚??逸??音??{N}鸚??逸??音??{N{^ 11110011110000000011111100111111101100001110111100111111001111111011001010111011001111110011111101111011010011100111110111110011110000000011111100111111101100001110111100111111001111111011001010111011001111110011111101111011010011100111101101011110 f3c03f3fb0ef3f3fb2bb3f3f7b4e7df3c03f3fb0ef3f3fb2bb3f3f7b4e7b5e
UTF-8 鸚쒖눦逸드쩂音쏀뭻{N}鸚쒖눦逸드쩂音쏀뭻{N{^ 11101001101110001001101011101100100100101001011011101011100010001010011011101001100000001011100011101011100100111001110011101100101010011000001011101001100111111011001111101100100011111000000011101011101011011011101101111011010011100111110111101001101110001001101011101100100100101001011011101011100010001010011011101001100000001011100011101011100100111001110011101100101010011000001011101001100111111011001111101100100011111000000011101011101011011011101101111011010011100111101101011110 e9b89aec9296eb88a6e980b8eb939ceca982e99fb3ec8f80ebadbb7b4e7de9b89aec9296eb88a6e980b8eb939ceca982e99fb3ec8f80ebadbb7b4e7b5e
UHC 鸚쒖눦逸드쩂音쏀뭻{N}鸚쒖눦逸드쩂音쏀뭻{N{^ 11100101101001001001110011101100100001111011110111101100111011111011010111100101101001001001110011101011111001011011110111101101100100101000101001111011010011100111110111100101101001001001110011101100100001111011110111101100111011111011010111100101101001001001110011101011111001011011110111101101100100101000101001111011010011100111101101011110 e5a49cec87bdecefb5e5a49cebe5bded928a7b4e7de5a49cec87bdecefb5e5a49cebe5bded928a7b4e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)