To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 壤??鎰??癒④?遙κ????幽??癲?? 1001101011011111001111110011111111101000010011000011111100111111100101101111110010000111010000110011111111101010101000011000001111001000001111110011111100111111001111111001011101001000001111110011111111100001100111110011111100111111 9adf3f3fe84c3f3f96fc87433feaa183c83f3f3f3f97483f3fe19f3f3f
EUC-JP 壤??鎰??癒??遙κ????幽??癲?? 11010100111000010011111100111111111011111010110100111111001111111100110011111110001111110011111111110100101000111010011011001010001111110011111100111111001111111100110110101001001111110011111111100010101000010011111100111111 d4e13f3fefad3f3fccfe3f3ff4a3a6ca3f3f3f3fcda93f3fe2a13f3f
UTF-8 壤깆쥜鎰섊독癒④쿆遙κ난罹숂독幽뚰뫕癲싴뫐 1110010110100011101001001110101010111001100001101110110010100101100111001110100110001110101100001110110010000100100010101110101110001111100001011110011110011001100100101110001010010001101000111110110010111111100001101110100110000001100110011100111010111010111010111000001010011100111011111010011110100110111011001000100010000010111010111000111110000101111001011011100110111101111010111001101010110000111010111010101110010101111001111001100110110010111011001000101110110100111010111010101110010000 e5a3a4eab986eca59ce98eb0ec848aeb8f85e79992e291a3ecbf86e98199cebaeb829cefa7a6ec8882eb8f85e5b9bdeb9ab0ebab95e799b2ec8bb4ebab90
UHC 壤깆쥜鎰섊독癒④쿆遙κ난罹숂독幽뚰뫕癲싴뫐 111001011011110110110001111011001010001010010001111011001111000010011000111001111011010110110110111010111010100010101000111010101011001010011011111010011010101110100101111010101011001110101101111011001011101010011001111001111011010110110110111010101110101110001100111011011001000110110111111011111010011010011010111011011001000110110010 e5bdb1eca291ecf098e7b5b6eba8a8eab29be9aba5eab3adecba99e7b5b6eaeb8ced91b7efa69aed91b2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)