To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????k}?????????k{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110101101111101001111110011111100111111001111110011111100111111001111110011111100111111011010110111101101011110 3f3f3f3f3f3f3f3f3f6b7d3f3f3f3f3f3f3f3f3f6b7b5e
SJIS-WIN 五??燁??仰??k}五??燁??仰??k{^ 1000110011011100001111110011111111111011010110010011111100111111100010111100001000111111001111110110101101111101100011001101110000111111001111111111101101011001001111110011111110001011110000100011111100111111011010110111101101011110 8cdc3f3ffb593f3f8bc23f3f6b7d8cdc3f3ffb593f3f8bc23f3f6b7b5e
EUC-JP 五??燁??仰??k}五??燁??仰??k{^ 10111000110111100011111100111111100011111100101010110011001111110011111110110110110001000011111100111111011010110111110110111000110111100011111100111111100011111100101010110011001111110011111110110110110001000011111100111111011010110111101101011110 b8de3f3f8fcab33f3fb6c43f3f6b7db8de3f3f8fcab33f3fb6c43f3f6b7b5e
UTF-8 五욇쉹燁뗨뵓仰삯뀶k}五욇쉹燁뗨뵓仰삯뀶k{^ 1110010010111010100101001110110010011010100001111110110010001001101110011110011110000111100000011110101110010111101010001110101110110101100100111110010010111011101100001110110010000010101011111110101110000000101101100110101101111101111001001011101010010100111011001001101010000111111011001000100110111001111001111000011110000001111010111001011110101000111010111011010110010011111001001011101110110000111011001000001010101111111010111000000010110110011010110111101101011110 e4ba94ec9a87ec89b9e78781eb97a8ebb593e4bbb0ec82afeb80b66b7de4ba94ec9a87ec89b9e78781eb97a8ebb593e4bbb0ec82afeb80b66b7b5e
UHC 五욇쉹燁뗨뵓仰삯뀶k}五욇쉹燁뗨뵓仰삯뀶k{^ 1110011111101001100111101110100110011010100011111110011110100111100010111110100010010100100101011110010011100110101110111110100110000101101011000110101101111101111001111110100110011110111010011001101010001111111001111010011110001011111010001001010010010101111001001110011010111011111010011000010110101100011010110111101101011110 e7e99ee99a8fe7a78be89495e4e6bbe985ac6b7de7e99ee99a8fe7a78be89495e4e6bbe985ac6b7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)