To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 厭ル?誼?┃怨??厭ル?誼?┃怨??B 1000100101111101100000111000101100111111100010110110001000111111100001001010101110001001100001010011111100111111100010010111110110000011100010110011111110001011011000100011111110000100101010111000100110000101001111110011111101000010 897d838b3f8b623f84ab89853f3f897d838b3f8b623f84ab89853f3f42
EUC-JP 厭ル?誼?┃怨??厭ル?誼?┃怨??B 1011000111011110101001011110101100111111101101011100001100111111101010001010110110110001111001010011111100111111101100011101111010100101111010110011111110110101110000110011111110101000101011011011000111100101001111110011111101000010 b1dea5eb3fb5c33fa8adb1e53f3fb1dea5eb3fb5c33fa8adb1e53f3f42
UTF-8 厭ル쉴誼숋┃怨룹졆厭ル쉴誼숋┃怨룹졆B 11100101100011101010110111100011100000111010101111101100100010011011010011101000101010101011110011101100100010001000101111100010100101001000001111100110100000001010100011101011101000111011100111101100101000011000011011100101100011101010110111100011100000111010101111101100100010011011010011101000101010101011110011101100100010001000101111100010100101001000001111100110100000001010100011101011101000111011100111101100101000011000011001000010 e58eade383abec89b4e8aabcec888be29483e680a8eba3b9eca186e58eade383abec89b4e8aabcec888be29483e680a8eba3b9eca18642
UHC 厭ル쉴誼숋┃怨룹졆厭ル쉴誼숋┃怨룹졆B 11100110111101001010101111101011101111011010111111101011111111101001100111101111101001101010110111101010101100111011011111101100101000001011011111100110111101001010101111101011101111011010111111101011111111101001100111101111101001101010110111101010101100111011011111101100101000001011011101000010 e6f4abebbdafebfe99efa6adeab3b7eca0b7e6f4abebbdafebfe99efa6adeab3b7eca0b742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)