To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鼇??鎰??醫??渦??????④?與??裕 1110101010000111001111110011111111101000010011000011111100111111111001111100111000111111001111111000100101010001001111110011111100111111001111110011111100111111100001110100001100111111111001000110111100111111001111111001011101010100 ea873f3fe84c3f3fe7ce3f3f89513f3f3f3f3f3f87433fe46f3f3f9754
EUC-JP 鼇??鎰??醫??渦????????與??裕 11110011111001110011111100111111111011111010110100111111001111111110111011010000001111110011111110110001101100100011111100111111001111110011111100111111001111110011111100111111111001111101000000111111001111111100110110110101 f3e73f3fefad3f3feed03f3fb1b23f3f3f3f3f3f3f3fe7d03f3fcdb5
UTF-8 鼇앸뜉鎰쇿쉽醫꾩탿渦긱꺇杻뗩굲硫④같與잂넀裕 111010011011110010000111111011001001010110111000111010111001110010001001111010011000111010110000111011001000011110111111111011001000100110111101111010011000011010101011111010101011111010101001111011011000001110111111111001101011100010100110111010101011100010110001111010101011101010000111111011111010011110001000111010111001011110101001111010101011010110110010111011111010011110001110111000101001000110100011111010101011000010011001111010001000100010000111111011001001111010000010111010111000010010000000111010001010001110010101 e9bc87ec95b8eb9c89e98eb0ec87bfec89bde986abeabea9ed83bfe6b8a6eab8b1eaba87efa788eb97a9eab5b2efa78ee291a3eab099e88887ec9e82eb8480e8a395
UHC 鼇앸뜉鎰쇿쉽醫꾩탿渦긱꺇杻뗩굲硫④같與잂넀裕 1110100010101000100111011110101110001101100011001110110011110000100110011110010110111101101100011110110010100010100001001110110010110101100110111110100010111110101100011110001110000011101011101110101011110100100010111110100110000010100101011110101110101001101010001110101010110000101100001110011010101000100111111110001010000110100100001110101110101110 e8a89deb8d8cecf099e5bdb1eca284ecb59be8beb1e383aeeaf48be98295eba9a8eab0b0e6a89fe28690ebae

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)