To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鵝??榮??鵝??與?????????ユ?? 111010100100000000111111001111111001111011000100001111110011111111101010010000000011111100111111111001000110111100111111001111110011111100111111001111110011111100111111001111110011111110000011100001100011111100111111 ea403f3f9ec43f3fea403f3fe46f3f3f3f3f3f3f3f3f3f83863f3f
EUC-JP 鵝??榮??鵝??與??旿??????ユ?? 1111001110100001001111110011111111011100110001100011111100111111111100111010000100111111001111111110011111010000001111110011111110001111110000011111010000111111001111110011111100111111001111110011111110100101111001100011111100111111 f3a13f3fdcc63f3ff3a13f3fe7d03f3f8fc1f43f3f3f3f3f3fa5e63f3f
UTF-8 鵝얜젷榮숇젧鵝얜젷與잏윘旿섊킍溜잋슭溜ユ쯃銳 111010011011010110011101111011001001011010011100111011001010000010110111111001101010011010101110111011001000100010000111111011001010000010100111111010011011010110011101111011001001011010011100111011001010000010110111111010001000100010000111111011001001111010001111111011001001110010011000111001101001011110111111111011001000010010001010111011011000001010001101111011111010011110001011111011001001111010001011111011001000101010101101111011111010011110001011111000111000001110100110111011001010111110000011111010011000101010110011 e9b59dec969ceca0b7e6a6aeec8887eca0a7e9b59dec969ceca0b7e88887ec9e8fec9c98e697bfec848aed828defa78bec9e8bec8aadefa78be383a6ecaf83e98ab3
UHC 鵝얜젷榮숇젧鵝얜젷與잏윘旿섊킍溜잋슭溜ユ쯃銳 1110010010111101101111101110101110100000101010111110011110110100100110011110101110100000100111111110010010111101101111101110101110100000101010111110011010101000100111111110011110011111100111001110011111111010100110001110011110110100100110011110101011111110100111111110010010111101101111101110101011111110101010111110011010101000100111111110011111100101 e4bdbeeba0abe7b499eba09fe4bdbeeba0abe6a89fe79f9ce7fa98e7b499eafe9fe4bdbeeafeabe6a89fe7e5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)