To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????±? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111011000100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3fb13f
SJIS-WIN 雍??肉??議??歪??踰??飮??域?±誼 11101000101101000011111100111111100100111111011100111111001111111000101101100011001111110011111110011000011000110011111100111111111001101111101000111111001111111001111101011010001111110011111110001000111001100011111110000001011111011000101101100010 e8b43f3f93f73f3f8b633f3f98633f3fe6fa3f3f9f5a3f3f88e63f817d8b62
EUC-JP 雍??肉??議??歪??踰??飮??域?±誼 11110000101101100011111100111111110001101111100100111111001111111011010111000100001111110011111111001111110001000011111100111111111011001111110000111111001111111101110110111011001111110011111110110000111010000011111110100001110111101011010111000011 f0b63f3fc6f93f3fb5c43f3fcfc43f3fecfc3f3fddbb3f3fb0e83fa1deb5c3
UTF-8 雍우궠肉경벚議우쒜歪묅뫁踰묌샒飮뉖걙域듭±誼 1110100110011011100011011110110010011010101100001110101010110110101000001110100010000010100010011110101010110010101111011110101110110010100110101110100010101101101100001110110010011010101100001110110010010010100111001110011010101101101010101110101110101100100001011110101110101011100000011110100010111000101100001110101110101100100011001110110010000011100100101110100110100011101011101110101110001001100101101110101010110001100110011110010110011111100111111110101110010011101011011100001010110001111010001010101010111100 e99b8dec9ab0eab6a0e88289eab2bdebb29ae8adb0ec9ab0ec929ce6adaaebac85ebab81e8b8b0ebac8cec8392e9a3aeeb8996eab199e59f9feb93adc2b1e8aabc
UHC 雍우궠肉경벚議우쒜歪묅뫁踰묌샒飮뉖걙域듭±誼 1110100010111100101111111110110010000010101100111110101110111111101100001110011010111010101000101110110010100001101111111110110010111110101011101110100011100000100100011110001010010001101001011110101110110010100100011110100110011000101111111110101111100110100001111110101110000001100000111110011010110100101101011110110010100001101111101110101111111110 e8bcbfec82b3ebbfb0e6baa2eca1bfecbeaee8e091e291a5ebb291e998bfebe687eb8183e6b4b5eca1beebfe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)