To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蘖??悠?ぜ循??筌??議??乙??沃 100111110101000000111111001111111001011101001001001111111000001010111010100011110111101000111111001111111110001010100011001111110011111110001011011000110011111100111111100010011011001100111111001111111001011110000000 9f503f3f97493f82ba8f7a3f3fe2a33f3f8b633f3f89b33f3f9780
EUC-JP 蘖??悠?ぜ循??筌??議??乙??沃 110111011011000100111111001111111100110110101010001111111010010010111100101111011101101100111111001111111110010010100101001111110011111110110101110001000011111100111111101100101011010100111111001111111100110111100000 ddb13f3fcdaa3fa4bcbddb3f3fe4a53f3fb5c43f3fb2b53f3fcde0
UTF-8 蘖뽰궠悠껇ぜ循낃샹筌뚯슦議녷릸乙노룂沃 111010001001100010010110111010111011110110110000111010101011011010100000111001101000001010100000111010101011101110000111111000111000000110011100111001011011111010101010111010111000001010000011111011001000001110111001111001111010110110001100111010111001101010101111111011001000101010100110111010001010110110110000111010111000010110110111111010111010011010111000111001001011100110011001111010111000010110111000111010111010001110000010111001101011001010000011 e89896ebbdb0eab6a0e682a0eabb87e3819ce5beaaeb8283ec83b9e7ad8ceb9aafec8aa6e8adb0eb85b7eba6b8e4b999eb85b8eba382e6b283
UHC 蘖뽰궠悠껇ぜ循낃샹筌뚯슦議녷릸乙노룂沃 1110010111101110100101101110110010000010101100111110101011101101100000111110100010101010101111001110001011100000100001011110101010111100101001111110111110100111100011001110110010011010101100001110110010100001100001101110011010010000100101101110101111100000101100111110101110001111100000111110100010101010 e5ee96ec82b3eaed83e8aabce2e085eabca7efa78cec9ab0eca186e69096ebe0b3eb8f83e8aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)