To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 雍??唯?ぜ鷹??壓??議??怨??沃?Ⅱ 111010001011010000111111001111111001011101000010001111111000001010111010100100011110100100111111001111111001101011011000001111110011111110001011011000110011111100111111100010011000010100111111001111111001011110000000001111111000011101010101 e8b43f3f97423f82ba91e93f3f9ad83f3f8b633f3f89853f3f97803f8755
EUC-JP 雍??唯?ぜ鷹??壓??議??怨??沃?? 1111000010110110001111110011111111001101101000110011111110100100101111001100001011101011001111110011111111010100110110100011111100111111101101011100010000111111001111111011000111100101001111110011111111001101111000000011111100111111 f0b63f3fcda33fa4bcc2eb3f3fd4da3f3fb5c43f3fb1e53f3fcde03f3f
UTF-8 雍우궡唯뽬ぜ鷹귥겭壓믩챷議묕쫫怨뚯뫊沃쇱Ⅱ 111010011001101110001101111011001001101010110000111010101011011010100001111001011001010010101111111010111011110110101100111000111000000110011100111010011011011110111001111010101011011110100101111010101011001010101101111001011010001110010011111010111010111110101001111011001011000110110111111010001010110110110000111010111010110010010101111011001010101110101011111001101000000010101000111010111001101010101111111010111010101110001010111001101011001010000011111011001000011110110001111000101000010110100001 e99b8dec9ab0eab6a1e594afebbdace3819ce9b7b9eab7a5eab2ade5a393ebafa9ecb1b7e8adb0ebac95ecababe680a8eb9aafebab8ae6b283ec87b1e285a1
UHC 雍우궡唯뽬ぜ鷹귥겭壓믩챷議묕쫫怨뚯뫊沃쇱Ⅱ 111010001011110010111111111011001000001010110100111010101110011010010110111010001010101010111100111010111110110110000010111011001000000110111011111001001110001010010010111010111010101010000100111011001010000110010001111011111010011010000100111010101011001110001100111011001001000110101100111010001010101010111100111011001010010110110001 e8bcbfec82b4eae696e8aabcebed82ec81bbe4e292ebaa84eca191efa684eab38cec91ace8aabceca5b1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)