To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 獄??榮??鸚??與??獄??榮??鸚?? 10001101100101100011111100111111100111101100010000111111001111111110101001011111001111110011111111100100011011110011111100111111100011011001011000111111001111111001111011000100001111110011111111101010010111110011111100111111 8d963f3f9ec43f3fea5f3f3fe46f3f3f8d963f3f9ec43f3fea5f3f3f
EUC-JP 獄??榮??鸚??與??獄??榮??鸚?? 10111001111101100011111100111111110111001100011000111111001111111111001111000000001111110011111111100111110100000011111100111111101110011111011000111111001111111101110011000110001111110011111111110011110000000011111100111111 b9f63f3fdcc63f3ff3c03f3fe7d03f3fb9f63f3fdcc63f3ff3c03f3f
UTF-8 獄기닖榮붺퇄鸚뀐쉑與뚦쪊獄기닖榮붺퇄鸚뀐쉑 111001111000110110000100111010101011100010110000111010111000101110010110111001101010011010101110111010111011011010111010111011011000011110000100111010011011100010011010111010111000000010010000111011001000100110010001111010001000100010000111111010111001101010100110111011001010101010001010111001111000110110000100111010101011100010110000111010111000101110010110111001101010011010101110111010111011011010111010111011011000011110000100111010011011100010011010111010111000000010010000111011001000100110010001 e78d84eab8b0eb8b96e6a6aeebb6baed8784e9b89aeb8090ec8991e88887eb9aa6ecaa8ae78d84eab8b0eb8b96e6a6aeebb6baed8784e9b89aeb8090ec8991
UHC 獄기닖榮붺퇄鸚뀐쉑與뚦쪊獄기닖榮붺퇄鸚뀐쉑 111010001010101110110001111000101000100010011010111001111011010010010100111001111011011110010101111001011010010010110010111011111011110110100111111001101010100010001100111001011010010110000100111010001010101110110001111000101000100010011010111001111011010010010100111001111011011110010101111001011010010010110010111011111011110110100111 e8abb1e2889ae7b494e7b795e5a4b2efbda7e6a88ce5a584e8abb1e2889ae7b494e7b795e5a4b2efbda7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)